Datasets
Data Science at Microsoft
- Home
- Datasets
These unique datasets, created and curated by Microsoft researchers, are for several fields of inquiry—ranging from natural language processing to computer vision. You can download these Microsoft Research datasets, free of charge, for use in your research.
Downloads
- Microsoft Research Social Media Conversation Corpus1 June 2015 · 0.11 MB · Version: 1.0
- Dataset for Inferring Missing Entity Type Instances for Knowledge Base Completion18 March 2015 · 1043.76 MB · Version: 1.0
- MSR Demosaicing Dataset28 February 2015 · 0.00 MB · Version: 1.0
- Sparse Reflections Analysis: Sensor Data from ECCV 2014 Paper26 November 2014 · 14.16 MB · Version: 1.0
- Microsoft Research Dense Visual Annotation Corpus28 August 2014 · 0.92 MB · Version: 1.0
- GeoS6 June 2014 · 7.14 MB · Version: 2.3.6
- Smart Selection Dataset10 April 2014 · 10.55 MB · Version: 1.0
- MSR 3D Video Dataset11 March 2014 · 721.23 MB · Version: 1.0
- Abstract Scene Dataset5 February 2014 · 783.05 MB · Version: 1.1
- Image Cropping Dataset24 October 2013 · 0.06 MB · Version: 1.0
- Powergrading Short Answer Grading Corpus4 October 2013 · 0.20 MB · Version: 1.0
- Lexical Semantics Dataset10 September 2013 · 0.00 MB · Version: 1
- ClueWeb 09 Labeled Near-Duplicate News Articles28 August 2013 · 14.09 MB · Version: 1.0
- Cultivating the Long tail of Environmental Observations31 July 2013 · 298.70 MB · Version: 1.0
- Avatar Dataset25 June 2013 · 3.03 MB · Version: 1.0
- Microsoft Document Aboutness Dataset19 November 2012 · 0.02 MB · Version: 0.0.1
- GeoLife GPS Trajectories9 August 2012 · 298.66 MB · Version: 1.2.2
- Question-Generation Corpus22 May 2012 · 0.43 MB · Version: 1.0
- Kinect Gesture Data Set24 April 2012 · 165.17 MB · Version: 1
- Data Set of English-Spanish Term Vectors from Wikipedia8 August 2011 · 218.44 MB · Version: 1.0.0
- Enron Stimuli for Text-Entry Experiments4 May 2011 · 0.02 MB · Version: 1.0
- Microsoft Research Video Description Corpus12 November 2010 · 2.54 MB · Version: 1.0
- Microsoft Research Action Data Set II (Part 5)3 March 2010 · 0.00 MB · Version: 1.0
- Microsoft Research Action Data Set II (Part 4)3 March 2010 · 0.00 MB · Version: 1.0
- Microsoft Research Action Data Set II (Part 3)3 March 2010 · 0.00 MB · Version: 1.0
- Microsoft Research Action Data Set II (Part 2)3 March 2010 · 0.00 MB · Version: 1.0
- Microsoft Research Action Data Set II (Part 1)3 March 2010 · 0.00 MB · Version: 1.0
- MSR Action Data Set24 July 2009 · 0.00 MB · Version: 1.0.0
- Microsoft Research Question-Answering Corpus13 November 2008 · 36.76 MB · Version: 1.0.0
- Microsoft Research MapSynthesis7 November 2007 · 0.83 MB · Version: 0.91
- Microsoft Research Asia Chinese Word-Segmentation Data Set16 August 2007 · 4.37 MB · Version: 1.0
- Simple Stereo Video Ground-Truth Data17 July 2007 · 17.84 MB · Version: 1
- NLP Data Sets for Comparative Study of Parameter-Estimation Methods2 June 2007 · 45.08 MB · Version: 1.0
- Microsoft Research Paraphrase Phrase Tables10 October 2006 · 698.27 MB · Version: 1.0
- ESL 123 Mass Noun Examples18 July 2006 · 0.20 MB · Version: 1.0
- Microsoft Research IME Corpus21 December 2005 · 4.29 MB · Version: 1.0
- Microsoft Research Paraphrase Corpus3 March 2005 · 1.30 MB · Version: 1.0
- Pitch and Voicing Estimates for Aurora 212 January 2005 · 56.33 MB · Version: 1.0
Learning resources
- Get started with Azure Machine Learning
- Get started with Power BI
- Create innovative dashboards to present your data analytics with BI Designer
- Learn about Advanced Analytics at Microsoft
- Microsoft Research Data Science Summer School
- Diversity in data science: Microsoft Research’s summer school aims high