Data-centric AI Resource Hub

Find the latest developments and best practices compiled here, so you can begin your Data-centric AI journey!

What is Data-centric AI?

Data-centric AI is the discipline of systematically engineering the data used to build an AI system.

Explore Topics

Labeling and Crowdsourcing

Introduction by

Michael Bernstein

Associate Professor, Human-Computer Interaction Group, Stanford University

Learn more

Data Augmentation

Introduction by

Anima Anandkumar

Bren Professor at Caltech CMS department and a Director of machine learning research at NVIDIA

Learn more

Data in Deployment

Introduction by

D. Sculley

Director of Engineering at Google Brain

Learn more

Latest Posts

The Hugging Face 🤗 Data Measurements Tool

Data in Deployment

The Hugging Face 🤗 Data Measurements Tool

We created the Hugging Face 🤗 Data Measurements Tool, a no-code interface that helps empower members of the AI community to build, measure, and compare datasets.

Sasha Luccioni

Published 31 Mar 2022

Past and Future of data centric AI

Olga Russakovsky This video is from the NeurIPS 2021 Data-centric AI workshop proceedings.

Published 31 Mar 2022

Finding millions of label errors with Cleanlab

Curtis G. Northcutt, Anish Athalye, Jonas Mueller This video is from the NeurIPS 2021 Data-centric AI workshop proceedings.

Published 31 Mar 2022

Q&A with Morning Invited + Keynote Speakers

Sharon Zhou, Douwe Kiela, Peter Mattson, Michael Bernstein, Praveen Paritosh, Lora Aroyo This video is from the NeurIPS 2021 Data-centric AI workshop proceedings.

Published 31 Mar 2022

Q&A with Afternoon Invited + Keynote Speakers + Closing Remarks

Andrew Ng, Sharon Zhou, Alex Ratner, D. Sculley, Curtis Northcutt, Carole-Jean Wu This video is from the NeurIPS 2021 Data-centric AI workshop proceedings.

Published 31 Mar 2022