機器學習常用資料集大全

機器學習常用資料集大全

UCI Machine Learning Adult Dataset

Business Problem: Classification (a person earns more than 50k or less) Predictor Variable: Label ; Predictors: country, age, education, occupation, marital status etc。

文章:https://towardsdatascience。com/pandas-index-explained-b131beaf6f7b

資料集地址:https://archive。ics。uci。edu/ml/machine-learning-databases/adult/adult。data

Kaggle – Avazu:Click-Through Rate Prediction

Predict whether a mobile ad will be clicked

In online advertising, click-through rate (CTR) is a very important metric for evaluating ad performance。 As a result, click prediction systems are essential and widely used for sponsored search and real-time bidding。

Kaggle地址:

https://www。kaggle。com/c/avazu-ctr-prediction/overview

UCI – Adult Data Set $50k

Predict whether income exceeds $50K/yr based on census data。 Also known as “Census Income” dataset

https://archive。ics。uci。edu/ml/datasets/Adult

UCI – Iris Data Set

This is perhaps the best known database to be found in the pattern recognition literature。 Fisher’s paper is a classic in the field and is referenced frequently to this day

https://archive。ics。uci。edu/ml/datasets/Iris

Kaggle Titanic: Machine Learning from Disaster

use machine learning to create a model that predicts which passengers survived the Titanic shipwreck。

https://www。kaggle。com/c/titanic