ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

FeaturedNews

Reference Corpus of Middle High German (ReM) Version 2 Released

Version 2 of the Reference Corpus of Middle High German (ReM) is released, offering improved annotations and new formats for download.

The Reference Corpus of Middle High German (ReM) has been updated to Version 2, available for download at https://linguistics.rub.de/rem. This corpus contains over two million tokens, covering written records from 1050 to 1350.

  • Corrections and improvements to tokenization and linguistic annotations
  • New documents added to the corpus
  • Available in various formats: CorA-XML, TEI XML, GraphML, and JSON
  • Accessible via ANNIS 4 at https://newannis.linguistics.rub.de/rem

The corpus is licensed under Creative Commons Attribution-ShareAlike 4.0 (CC BY-SA 4.0).

Tags: Middle High German, Reference Corpus, ReM, Corpus Linguistics, Historical Language Research, Computational Linguistics, Digital Humanities