ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

Conference Calls

FIRE 2024 Task: Word-level Code-Mixed Language Identification in Dravidian Languages

Notice: Heads up: This article was published more than 6 months ago. Details, links, or policies may have changed since then.

The 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024) is hosting a shared task, CoLI-Dravidian, focused on Word-level Code-Mixed Language Identification in Dravidian Languages. This task aims to address the challenges of language identification in code-mixed Dravidian languages, which are widely spoken in southern India and often feature Roman or hybrid scripts on digital platforms. The shared task will provide code-mixed datasets for four languages: Kannada, Tamil, Malayalam, and Tulu, to encourage the development of advanced language identification models.

Participants can download the data and join the competition through the CodaLab link: https://codalab.lisn.upsaclay.fr/competitions/19357

Leave a Reply