KUTED
Corpus: Kurdish TED (KUTED)Licence: CreativeCommons Attribution NonCommercial-ShareAlike 4.0 International License.URL: https://huggingface.co/datasets/aranemini/kurdishtedAuthor(s): Mohammad MohammadaminiAntoine LaurentDescription Kurdish TED (KUTED) is the first Speech-to-Text-Translation (S2TT) dataset for the Central Kurdish language derived from TED Talks and TEDx. The corpus consists of 91,000 pairs, encompassing 170 hours of English audio, 1.65 million English tokens, and 1.40 million Central Kurdish […]