ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

Conference Calls

Code-Mixed Language Identification in Dravidian Languages at FIRE 2024

The 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024) is hosting a shared task, CoLI-Dravidian, focusing on Word-level Code-Mixed Language Identification in Dravidian Languages. The task involves detecting the language(s) used in a given text, particularly in code-mixed Dravidian languages, which blend local languages with English at various levels. This shared task aims to encourage the development of advanced Language Identification models for under-resourced Dravidian languages like Kannada, Tamil, Malayalam, and Tulu.

Participants will be allowed to make a maximum of 10 submissions in the training phase and 5 submissions in the testing phase through CodaLab. A real-time leaderboard will be available, and each team will have to select the best submission for ranking. The deadline for run submission is 25th July, and results will be declared on 27th July. Working notes are due on 27th August, and camera-ready copies of working notes are expected by 30th October.

To participate, download the data from CodaLab and submit your runs before the deadline. For more information, visit the task website at https://sites.google.com/view/coli-dravidian-2024/datasets?authuser=0.

Leave a Reply

Your email address will not be published. Required fields are marked *