ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

FeaturedNews

Universal Dependencies v2.16 Released with 319 Treebanks

Universal Dependencies v2.16 released with 319 treebanks across 179 languages, featuring significant updates and expansions.

The Universal Dependencies project has released version 2.16, featuring 319 treebanks across 179 languages. The release is available at https://universaldependencies.org/.

The treebanks are annotated according to version 2 of the UD guidelines and represent languages from 35 families, including Indo-European, Sino-Tibetan, and Afro-Asiatic.

  • Notable updates include significant changes in 48 treebanks, such as the addition of new treebanks for languages like Alemannic, Coptic, and Georgian.
  • The release contains 2,263,318 sentences, 36,437,487 surface tokens, and 37,158,675 syntactic words.

Tags: Universal Dependencies, treebank annotation, multilingual NLP, computational linguistics, language typology