Introducing Ulysses Tesemõ: A Comprehensive Corpus for the Brazilian Legal Domain
Notice: Heads up: This article was published more than 6 months ago. Details, links, or policies may have changed since then.
Introducing Ulysses Tesemõ, a vast corpus tailored for the Brazilian legal domain. This corpus comprises over 3.5 million files, totaling a massive 30.7 GiB of raw text data. It has been gathered from a diverse range of 159 sources, including judicial, legislative, academic, news, and other related domains.
For more information, please refer to the following link: https://doi.org/10.1007/s10579-024-09762-8
Best Regards, Ellen Souza