I am making this list to make it easier to find sources for publically available data sources which upcoming data scientists/data enthusiasts could play with.
Generic datasets
- U.S Government's public data: DATA.GOV
- Indian Government's public data: DATA.GOV.IN
- Public data from World Bank: World Bank
- Data by the Reserve Bank of India: RBI
- Five Thirty Eight: F38
Large Datasets
- Amazon Web Services: AWS
- Google Datasets: Google DS
- Youtube: YT labelled video datasets
Predictive modelling and ML
- UCI Machine Learning Repository
- Kaggle
- Financial / economic data: Quandl
- Driven Data
Image classification
- Hand-written digits: MNIST
- Character recognition: Chars74k
- Face: Frontal Face images
- Generic: ImageNet
Sentiment analysis
- Twitter: Twitter sentiment analysis
Miscellaneous
Hope this helped 💖
Top comments (0)