ABC: Large Scale Labeled Data for Classification
This download contains the processed datasets used in the AAAI’19 paper ABC: Efficient Selection of Machine Learning Configuration on Large Dataset. A machine learning configuration refers to a combination of preprocessor, learner, and hyperparameters. Given…
Leadership and Advocacy
Atom: A Grammar for Unit Visualizations
SIGMOD 2018 Program Committee Chair’s Report
Generating data series query workloads
Sparse-Factor Synthetic Controls: Unit-Level Counterfactuals from High-Dimensional Data
Synthetic controls is the most common empirical strategy used to accommodate unobserved endogenous factor models. However, little consideration is given to the dimension of the relevant factors, whether the associated loadings are truly time-invariant and…
Data Platforms and Analytics – Internships
MSR Redmond is now accepting applications for 2024 internship roles in the Data Platforms and Analytics Research Area. During the internship you will have the opportunity to work with top researchers and engineers at MSR…