ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

News

Introducing Ulysses Tesemõ: A Comprehensive Corpus for the Brazilian Legal Domain

Notice: Heads up: This article was published more than 6 months ago. Details, links, or policies may have changed since then.

Introducing Ulysses Tesemõ, a vast corpus tailored for the Brazilian legal domain. This corpus comprises over 3.5 million files, totaling a massive 30.7 GiB of raw text data. It has been gathered from a diverse range of 159 sources, including judicial, legislative, academic, news, and other related domains.

For more information, please refer to the following link: https://doi.org/10.1007/s10579-024-09762-8

Best Regards, Ellen Souza

Leave a Reply