FIRE 2024 Task: Word-level Code-Mixed Language Identification in Dravidian Languages
The 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024) is hosting a shared task, CoLI-Dravidian, focused on Word-level Code-Mixed Language Identification in Dravidian Languages. This task aims to address the challenges of language identification in code-mixed Dravidian languages, which are widely spoken in southern India and often feature Roman or hybrid scripts on digital platforms. The shared task will provide code-mixed datasets for four languages: Kannada, Tamil, Malayalam, and Tulu, to encourage the development of advanced language identification models.
Participants can download the data and join the competition through the CodaLab link: https://codalab.lisn.upsaclay.fr/competitions/19357