Data Science Project Ideas! (Python)
Description
In this video, we walk through eight project ideas to help you build up your data science skills. All of the resources mentioned in the video can be found here in the description.
Covid-19 Analysis (2:16):
- Johns Hopkins Data: https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series
- Code that I wrote: https://github.com/KeithGalli/Data-Science-Project-Ideas/blob/master/Covid/Covid%20Analysis.ipynb
- Kaggle: https://www.kaggle.com/covid19
- @sentdex kaggle video: https://youtu.be/S6GVXk6kbcs
- @3Blue1Brown simulations video: https://youtu.be/gxAaO2rsdIs
Board Game AI (3:16):
- Overview video: https://youtu.be/y7AKtWGOPAE
- Implementing minimax algorithm in python: https://youtu.be/MMLtza3CZFM
- Reinforcement learning (Snake) tutorial: https://towardsdatascience.com/how-to-teach-an-ai-to-play-games-deep-reinforcement-learning-28f9b920440a
- AlphaZero chess tutorial: https://towardsdatascience.com/create-ai-for-your-own-board-game-from-scratch-alpha-zero-part-3-f22761372245
- @DeepMind AlphaGo Documentary: https://youtu.be/WXuK6gekU1Y
Reddit, Data is Beautiful (4:50):
- Thread url: https://www.reddit.com/r/dataisbeautiful/
- Population changes chart: https://www.reddit.com/r/dataisbeautiful/comments/fr8q34/change_in_population_by_county_between_2010_and/
- S&P 500 recoveries chart: https://www.reddit.com/r/dataisbeautiful/comments/frzlbt/sp_500_recovers_during_major_crashes_oc/
Text Sentiment Analysis Tool (6:48):
- My full tutorial (machine learning w/ sklearn): https://youtu.be/M9Itm95JzL0
- Learn about Transformers: http://jalammar.github.io/illustrated-transformer/
- BERT Paper: https://arxiv.org/pdf/1810.04805.pdf
- Spacy NLP Library: https://explosion.ai/blog/spacy-transformers
- YouTube API: https://developers.google.com/youtube/v3/quickstart/python
Sports Analysis (8:34):
- My script to webscrape sports data: https://github.com/KeithGalli/Data-Science-Project-Ideas/blob/master/Sports/extract_data.py
- Basketball reference site that I scraped: https://www.basketball-reference.com/leagues/NBA_2020_per_game.html
Stock Trading Bot (10:12):
- Alpaca Site: https://alpaca.markets/
- Alpaca Tutorials: https://alpaca.markets/docs/get-started-with-alpaca/tutorial-videos/
- Quantopian (to learn more & backtest your trading strategies): https://www.quantopian.com/tutorials/getting-started
House Pricing Prediction (12:02):
- Link to competition: https://www.kaggle.com/c/house-prices-advanced-regression-techniques
Miscellaneous Kaggle Projects (13:06):
- Kaggle Data: https://www.kaggle.com/datasets
- Airbnb Data: https://www.kaggle.com/dgomonov/new-york-city-airbnb-open-data
- My real world data science tutorial: https://youtu.be/eMOA1pPVUc4
Hope you guys enjoyed this video! If you have any questions about any of these projects let me know in the comments. Like & Subscribe if you haven't already :).
Some skills you should hopefully take away from these projects:
- Analysis with Python Pandas Library
- Visualization with Python Matplotlib Library
- AI/Machine Learning skills with scikit learn library
- Regression Techniques
- Exploratory Data Analysis
---------------------------------------------
Follow me on social media!
Instagram | https://www.instagram.com/keithgalli/
Twitter | https://twitter.com/keithgalli
---------------------------------------------
~ Intro Music ~
Track: Sunflower — Soyb [Audio Library Release]
Music provided by Audio Library Plus
Watch: https://youtu.be/dG1U3NuR9Pk
Free Download / Stream: https://alplus.io/sunflower
Comments