CoLI-Dravidian: Word-level Code-Mixed Language Identification in Dravidian Languages Shared Task at FIRE 2024
The CoLI-Dravidian shared task at the 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024) aims to address word-level language identification challenges in code-mixed Dravidian languages. This task will be held for four languages – Kannada, Tamil, Malayalam, and Tulu. Participants will be provided with code-mixed datasets to encourage the development of advanced language identification models. A real-time leaderboard will be available, and participants can make a maximum of 10 submissions in the training phase and 5 submissions in the testing phase.
To participate, download the data from CodaLab. The important dates for the shared task are as follows:
- 1st July 2024 – test data release
- 25th July – run submission deadline
- 27th July – results declared
For more information, visit the task website at https://sites.google.com/view/coli-dravidian-2024/datasets?authuser=0.