ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

News

New Speech Corpora Available: ÌròyìnSpeech and Slovak Autistic and Non-Autistic Child Speech Corpus (SANACS)

Two new speech corpora have been added to the ELRA Catalogue of Language Resources. The first one is ÌròyìnSpeech, a modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. It contains 34000 read sentences, 42 hours of audio, and is available for download here: ÌròyìnSpeech.

The second corpus is the Slovak Autistic and Non-Autistic Child Speech Corpus (SANACS). It contains 67 recorded sessions of interactions between two native Slovak speakers, and is intended for research in the field of autism. More information about the corpus can be found here: SANACS Corpus.

For more information about the ELRA Catalogue of Language Resources, please visit ELRA Catalogue or contact ELDA.

Leave a Reply

Your email address will not be published. Required fields are marked *