MLCommons Datasets

People’s Speech

The MLCommons People’s Speech dataset is among the world’s largest English speech recognition corpus today that is licensed for academic and commercial usage under CC-BY-SA and CC-BY 4.0.

About the dataset

The MLCommons People’s Speech dataset includes 30,000+ hours of transcribed speech in English languages with a diverse set of speakers. This open dataset is large enough to train speech-to-text systems and crucially is available with a permissive license. Just as ImageNet catalyzed machine learning for vision, the People’s Speech will unleash innovation in speech research and products that are available to users across the globe.

Dataset Details

  • Date: 2022-11-17
  • Hours: +30 K
  • Examples: 23.7 Millions
  • Audio Format: FLAC

Download

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.