MLCommons Datasets
Dollar Street
The MLCommons Dollar Street dataset is a collection of images of everyday household items from homes around the world that visually captures socioeconomic diversity of traditionally underrepresented populations.
About the dataset
The MLCommons Dollar Street dataset consists of public domain data, licensed for academic, commercial and non-commercial usage, under CC-BY and CC-BY-SA 4.0. The dataset was developed because similar datasets lack socioeconomic metadata and are not representative of global diversity.
It includes 38,479 images collected from 63 different countries, tagged from a set of 289 possible topics. Besides this, the metadata for each image includes demographic information such as region, country and total household monthly income, allowing for many different use cases, ultimately enhancing image datasets for computer vision.
- Read our full paper here.
- Join the Dollar Street mailing list here.
- Connect with other Dollar Street users on the Datasets working group discord server.
- Link to the original project.
Details
- Date: 2021-11-9
- Size: 101.3 GB
- Examples: 38,479
- Format: JPG and PNG