Birzeit University’s SinaLab Unveils Qabas: A Comprehensive Open-Source Lexicographic Database for Arabic NLP Applications
Birzeit University’s SinaLab for Computational Linguistics and Artificial Intelligence has officially launched Qabas, an open-source lexicographic database for Arabic, specifically designed for Natural Language Processing (NLP) applications. Qabas is a unique resource that links its lexical entries (lemmas) with lemmas from 110 different lexicons and numerous morphologically annotated corpora, creating an extensive lexicographic graph. The project has been under development for over fourteen years and is publicly available online for both commercial and non-commercial purposes.
Prof. Mustafa Jarrar, the project’s manager and main author, emphasized the importance of making Qabas freely available as an open-source resource, allowing everyone to access and use it for innovative content and applications that benefit humanity. Qabas is the largest Arabic lexicon, encompassing about 58K lemmas, and is publicly available at https://sina.birzeit.edu/qabas.