Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Welcome to the UC Irvine Machine Learning Repository!

We currently maintain 462 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians.

Supported By:

In Collaboration With:

Latest News:
09-24-2018: Welcome to the new Repository admins Dheeru Dua and Efi Karra Taniskidou!
04-04-2013: Welcome to the new Repository admins Kevin Bache and Moshe Lichman!
03-01-2010: Note from donor regarding Netflix data
10-16-2009: Two new data sets have been added.
09-14-2009: Several data sets have been added.
03-24-2008: New data sets have been added!
06-25-2007: Two new data sets have been added: UJI Pen Characters, MAGIC Gamma Telescope


Featured Data Set:  Hepatitis

Task: Classification
Data Type: Multivariate
# Attributes: 19
# Instances: 155

From G.Gong: CMU; Mostly Boolean or numeric-valued attribute types; Includes cost data (donated by Peter Turney)
Newest Data Sets:
11-16-2018:
 Electrical Grid Stability Simulated Data
11-09-2018:
 BAUM-2
11-09-2018:
 BAUM-1
11-05-2018:
 Parkinson's Disease Classification
11-02-2018:
 Caesarian Section Classification Dataset
10-12-2018:
 Superconductivty Data
10-08-2018:
 Physical Unclonable Functions
10-04-2018:
 Drug Review Dataset (Drugs.com)
10-02-2018:
 PANDOR
10-02-2018:
 Drug Review Dataset (Druglib.com)
09-16-2018:
 Student Academics Performance
09-14-2018:
 WESAD (Wearable Stress and Affect Detection)
Most Popular Data Sets (hits since 2007):
2273217:
 Iris
1336965:
 Adult
1024925:
 Wine
880082:
 Car Evaluation
815354:
 Breast Cancer Wisconsin (Diagnostic)
800010:
 Wine Quality
784399:
 Heart Disease
745726:
 Bank Marketing
734868:
 Human Activity Recognition Using Smartphones
702474:
 Abalone
696063:
 Forest Fires
483902:
 Poker Hand

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML