ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

News

Introducing Ulysses Tesemõ: A Comprehensive Corpus for the Brazilian Legal Domain

Introducing Ulysses Tesemõ, a vast corpus tailored for the Brazilian legal domain. This corpus comprises over 3.5 million files, totaling a massive 30.7 GiB of raw text data. It has been gathered from a diverse range of 159 sources, including judicial, legislative, academic, news, and other related domains.

For more information, please refer to the following link: https://doi.org/10.1007/s10579-024-09762-8

Best Regards, Ellen Souza

Leave a Reply

Your email address will not be published. Required fields are marked *