Articles
Elite Data Science
--
Datasets for Data Science and Machine Learning
Create artificial dataset
sklearn dataset module
:
from sklearn import datasets
. This contains also some popular reference datasets.
Source of datasets
awesome-public-datasets
— A topic-centric list of HQ open datasets.
Built-in datasets in Scikit-Learn
.
BuzzFeedNews/everything
— data from BuzzFeed.
COCO
-- Common Objects in Context.
Data Hub Datasets collection
— high quality data and datasets organized by topic.
data.gov
— a large dataset aggregator and the home of the US Government’s open data.
data.world
-- The Cloud-Native Data Catalog.
FiveThirtyEight
— hard data and statistical analysis to tell stories about politics, sports, societal matters and more.
Google Dataset Search
.
Google Trends Datastore
Google AI Datasets
— In order to contribute to the broader research community, Google periodically releases data of interest to researchers in a wide range of computer science disciplines.
Kaggle Datasets
.
NLP-progress
.
Open Images V6
Quandl
— your perfect choice for testing your machine learning algorithms and don’t waste your time on cleaning data.