ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

News

Introducing Data Statements Schema Version 3 for Language Datasets

Learn about the new Data Statements Schema Version 3 for language datasets, and access the guide and resources for creating and documenting language datasets at Tech Policy Lab.

A new version of the Data Statements Schema for language datasets is now available. The schema, version 3, builds on the 2023 dissertation of Angelina McMillan-Major and serves as a guide for creating and documenting language datasets. The guide includes templates for data statements and other resources, and is now available for use.

Access the guide and resources at https://techpolicylab.uw.edu/data-statements/

Tags: Data Statements Schema, Language Datasets, Angelina McMillan-Major, Emily M. Bender, Data Collection, Data Documentation, Tech Policy Lab

Leave a Reply

Your email address will not be published. Required fields are marked *