Research Working Group
DataPerf Working Group
-
Working Groups
- Training
- Inference
- Datasets
- Best Practices
- Research
Mission
Drive innovation in ML datasets by defining, developing, and operating benchmarks for datasets and data-centric algorithms.
Purpose
We are building DataPerf, a benchmark suite for ML datasets and algorithms for working with datasets. Historically, ML research has focused primarily on models, and simply used the largest existing dataset for common ML tasks without considering the dataset’s breadth, difficulty, and fidelity to the underlying problem. This under-focus on data has led to a range of issues, from data cascades in real applications, to saturation of existing dataset-driven benchmarks for model quality impeding research progress. In order to catalyze increased research focus on data quality and foster data excellence, we created DataPerf: a suite of benchmarks that evaluate the quality of training and test data, and the algorithms for constructing or optimizing such datasets, such as core set selection or labeling error debugging, across a range of common ML tasks such as image classification. We leverage the DataPerf benchmarks through challenges and leaderboards.
Deliverables
- Data benchmarking roadmap
- Data benchmarking rules
- Data benchmarking evaluation harnesses
- Data benchmarking reference implementations
- Leaderboards and challenges on an online platform
Meeting Schedule
Weekly on Thursday from 9:05-10:00AM Pacific.
How to Join and Access Working Group Resources
- To sign up for the group mailing list, receive the meeting invite, and access shared documents and meeting minutes:
- Associate a Google account with your organizational email address.
- Request to join the DataPerf Google Group. Requests are manually reviewed, so please be patient.
- Once your request to join the DataPerf Google Group is approved, you'll be able to access the DataPerf folder in the Public Google Drive.
- To engage in group dicussions:
- Join the group's channels on the MLCommons Discord server.
- To access the GitHub repository (public):
- If you want to contribute code, please sign our CLA first.
- Visit the GitHub repository.
DataPerf Website
Visit the DataPerf website to learn more about the group's data-centric benchmarks.
Working Group Chairs
To contact all DataPerf working group chairs email dataperf-chairs@mlcommons.org.
Lilith Bat-Leah (lilith@mlcommons.org) - LinkedIn
Lilith Bat-Leah is Vice President, Data Services at Mod Op, responsible for consulting on use cases for data analytics, data science, and machine learning. Lilith has over 11 years of experience managing, delivering, and consulting on identification, preservation, collection, processing, review, annotation, analysis, and production of data in legal proceedings. She also has experience leading research and development of AI / machine learning software. She speaks and writes about various topics such as evaluation of machine learning systems, ESI protocols, and discovery of databases. Lilith holds a BSGS in Organization Behavior from Northwestern University, where she graduated magna cum laude. She formerly served as Co-Trustee of the EDRM Analytics and Machine Learning project, as a member of the EDRM Global Advisory Council, as Vice President of the Chicago ACEDS chapter, and as President of the New York Metro ACEDS Chapter.
Praveen Paritosh (pkp@mlcommons.org) - LinkedIn
Praveen Paritosh is a senior research scientist at Google, leading research on data excellence and evaluation for AI systems. He designed the large-scale human curation systems for Freebase and the Google Knowledge Graph. He was the co-organizer and chair for the AAAI Rigorous Evaluation workshops, Crowdcamp 2016, SIGIRWebQA 2015 workshop, the Crowdsourcing at Scale 2013, the shared task challenge at HCOMP 2013, and Connecting Online Learning and Work at HCOMP 2014, CSCW 2015, and CHI 2016 toward the goal of galvanizing research at the intersection of crowdsourcing, natural language understanding, knowledge representation, and rigorous evaluations for artificial intelligence.