Dataset development

Blog tagged as Dataset development

The Emerging Task of Measuring AI Training Data
A new perspective paper argues for "measuring data" as a critical task to advance responsible AI development. Just as physical objects can be measured, data used to train AI systems should also be quantitatively analyzed to understand its composition.
Ines Almeida
13.08.23 08:14 PM - Comment(s)
Making Data Work More Visible Through Documentation
A new study provides insights into the complex processes and people behind ML data work.
Ines Almeida
13.08.23 12:31 PM - Comment(s)
Examining How AI Training Datasets Are Built: A Framework for More Responsible Practices
In a recent paper, researchers Mehtab Khan and Alex Hanna highlight the need for greater scrutiny, transparency, and accountability in how massive datasets for machine learning models are created.
Ines Almeida
13.08.23 11:51 AM - Comment(s)
Machine learning models rely heavily on their training datasets, inheriting inherent biases and limitations. This research proposes "datasheets for datasets" increasing transparency and mitigating risks.
Ines Almeida
13.08.23 11:21 AM - Comment(s)