12th Workshop on Challenges in Large Corpora Management
The 12th CMLC workshop invites submissions on managing large corpora, focusing on interoperability, machine learning, and linguistic challenges, with a deadline of 16.02.2026.
The 12th Workshop on the Challenges in the Management of Large Corpora (CMLC) will be held as part of the LREC-2026 conference in Palma, Mallorca. The workshop invites submissions on topics including interoperability, machine learning, linguistic content challenges, technical challenges, and exploitation challenges.
Key topics of interest include:
- Making corpora accessible and interoperable
- Data preparation for machine learning
- Dealing with linguistic diversity and inclusion
- Storage and retrieval solutions for large corpora
- Legal and privacy issues in corpus management
Submissions are accepted through the START system. A volume of proceedings will be published online by ELRA. Important dates include a paper submission deadline of 16.02.2026 and a meeting date TBA.
For more information, visit https://corpora.ids-mannheim.de/cmlc-2026.html.
Tags: corpus linguistics, natural language processing, large corpora management, LREC-2026, machine learning, linguistic diversity, corpus accessibility