Research

Democratizing new technological capabilities and ensuring wide-spread adoption requires an open approach. MLCommons regularly publishes and presents at top conferences and industry events along with our broad community—allowing all researchers, scientists, and professionals in AI and ML to access and learn from our work.


Publications

MLCommons is a community-driven effort. We regularly co-author papers with community members to share our collective learnings with the broader community.

The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models

NEURIPS 2024 | Best Paper Award

View Details

Introducing v0.5 of the AI Safety Benchmark from MLCommons

ArXiv 2024

View Details

Dataperf: Benchmarks for data-centric ai development

NeurIPS 2023 | Poster

View Details

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models

ArXiv 2023

View Details

MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation

Nature Machine Intelligence 2023 | Journal

View Details

Speech Wikimedia: A 77 Language Multilingual Speech Dataset

ArXiv 2023 | Paper

View Details

The Dollar Street Dataset: Images Representing the Geographic and Socioeconomic Diversity of the World

NeurIPS 2022 | Paper

View Details

MLPerf Mobile Inference Benchmark

MLSys 2022 | Paper

View Details

MLPerf Tiny Benchmark

NeurIPS 2021 | Paper

View Details

Benchmarking tinyml systems: Challenges and direction

ArXiv 2021 | Paper

View Details

The People’s Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage

NeurIPS 2021 | Paper

View Details

Multilingual Spoken Words Corpus

NeurIPS 2021 | Paper

View Details

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

ArXiv 2021 | Paper

View Details

Software/hardware co-optimization on the IPU: An MLPerf™ case study

Hot Chips 33 2021 | Paper

View Details

Data Engineering for Everyone

ArXiv 2021 | Paper

View Details

LSH methods for data deduplication in a Wikipedia artificial dataset

arXiv 2021 | Paper

View Details

MLPerf Training and Inference

Hot Chips 33 2021 | Tutorial

View Details

MLPerf Training Benchmark

MLSys 2020 | Paper

View Details

MLPerf Inference Benchmark

ISCA 2020 | Paper

View Details

MLPerf: A Benchmark Suite for Machine Learning from an Academic-Industry Cooperative

Hot Chips 31 2019 | Paper

View Details

Talks

MLCommons is a community-driven effort. We regularly co-author papers with community members to share our collective learnings with the broader community.

David Kanter

CASPA Spring Symposium

Driving ML Forward in Automotive

Victor Bittorf

ASPLOS 2021

What is MLCube

Tom St. John

ASPLOS 2021

MLPerf Automotive Overview

Alex Karargyris

ASPLOS 2021

Medical Imaging Benchmark using MLPerf

Murali Emani

ASPLOS 2021

MLPerf HPC: A Benchmark Suite for Large scale ML on HPC Systems

Greg Diamos

ASPLOS 2021

Data-centric Speech for Machine Learning Systems

Wookie Hong

ASPLOS 2021

Mobile AI Performance Benchmarking & Analysis with the MLPerf App

Christine Cheng

ASPLOS 2021

MLPerf Inference Benchmark Suite

Peter Mattson

ASPLOS 2021

MLPerf Training Benchmark Suite

NeuRIPS 2021

Multilingual Spoken Words Corpus