UCI Machine Learning Adult Dataset
Business Problem: Classification (a person earns more than 50k or less) Predictor Variable: Label ; Predictors: country, age, education, occupation, marital status etc.
文章:https://towardsdatascience.com/pandas-index-explained-b131beaf6f7b
数据集地址:https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data
Kaggle - Avazu:Click-Through Rate Prediction
Predict whether a mobile ad will be clicked
In online advertising, click-through rate (CTR) is a very important metric for evaluating ad performance. As a result, click prediction systems are essential and widely used for sponsored search and real-time bidding.
Kaggle地址:
https://www.kaggle.com/c/avazu-ctr-prediction/overview
UCI - Adult Data Set $50k
Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset
https://archive.ics.uci.edu/ml/datasets/Adult
UCI - Iris Data Set
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day
https://archive.ics.uci.edu/ml/datasets/Iris
Kaggle Titanic: Machine Learning from Disaster
use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.