New Speech Corpora Available: ÌròyìnSpeech and Slovak Autistic and Non-Autistic Child Speech Corpus (SANACS)

June 5, 2024

Notice: Heads up: This article was published more than 6 months ago. Details, links, or policies may have changed since then.

Two new speech corpora have been added to the ELRA Catalogue of Language Resources. The first one is ÌròyìnSpeech, a modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. It contains 34000 read sentences, 42 hours of audio, and is available for download here: ÌròyìnSpeech.

The second corpus is the Slovak Autistic and Non-Autistic Child Speech Corpus (SANACS). It contains 67 recorded sessions of interactions between two native Slovak speakers, and is intended for research in the field of autism. More information about the corpus can be found here: SANACS Corpus.

For more information about the ELRA Catalogue of Language Resources, please visit ELRA Catalogue or contact ELDA.

ML Scientist

New Speech Corpora Available: ÌròyìnSpeech and Slovak Autistic and Non-Autistic Child Speech Corpus (SANACS)

Leave a Reply Cancel reply

Related Reading

ACL Rolling Review Discontinues MS Word Template

FRCCS 2025 Publications & ISCS 2026 Announcement

WSDM 2026: Call for Industry Day Talks – Extended Deadline

Leave a Reply Cancel reply