ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

Conference CallsFeatured

12th Workshop on Challenges in Large Corpora Management

The 12th CMLC workshop invites submissions on managing large corpora, focusing on interoperability, machine learning, and linguistic challenges, with a deadline of 16.02.2026.

The 12th Workshop on the Challenges in the Management of Large Corpora (CMLC) will be held as part of the LREC-2026 conference in Palma, Mallorca. The workshop invites submissions on topics including interoperability, machine learning, linguistic content challenges, technical challenges, and exploitation challenges.

Key topics of interest include:

  • Making corpora accessible and interoperable
  • Data preparation for machine learning
  • Dealing with linguistic diversity and inclusion
  • Storage and retrieval solutions for large corpora
  • Legal and privacy issues in corpus management

Submissions are accepted through the START system. A volume of proceedings will be published online by ELRA. Important dates include a paper submission deadline of 16.02.2026 and a meeting date TBA.

For more information, visit https://corpora.ids-mannheim.de/cmlc-2026.html.

Tags: corpus linguistics, natural language processing, large corpora management, LREC-2026, machine learning, linguistic diversity, corpus accessibility