Speech data collection in an under-resourced language within a multilingual context

Speech data collection in an under-resourced language within a multilingual context Molapo, Raymond; Barnard, Etienne; de Wet, Febe In this paper, we present an end-to-end solution to the development of an automatic speech recognition (ASR) system in typical under-resourced languages, where the target language is likely to be influenced by one more embedded foreign languages. We first describe the collection and processing of the text corpus crawled from the World Wide Web using the Rapid Language Adaptation Toolkit. In particular, we highlight the challenges faced when foreign languages are embedded within the matrix language. Thereafter, we discuss our speech data collection efforts in under-resourced environments. We finally report on a strategy called transliteration that aids to improve recognition results of our grapheme-based automatic speech recognition system in the presence of embedded language words.

Speech data collection in an under-resourced language within a multilingual context

Trending Articles

Police confirm man stabbed to death in Selsdon was Andrew David Else of Croydon

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

Muloraki Au

Windows Server の Essentials エディションは、ドメインのメンバーサーバーとして利用できません。

Police charge man, 23, with assault and criminal damage following incident in...

(Notes & Audio) The 26 Promises of Allah to the Ummah

Raj Panchayat 3rd / Third Grade Teacher Revised Result 2012 Level 1-2...

Practice Sheet of Right form of verbs for HSC Students

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

I Offer a Relaxing Swedish Massage for adult males and females of all ages. :...

Drug dealing brothers caught with £74k stash in Newtown Linford home

Scanmatik 2 SM2 clone diver v2.21.22 free no pass

Notification of Pre-Mature Increment to All the Upgraded Employees since...

Hull man, 27, dies after crashing car into a tree on the A165 near Brandesburton

Brunei reaffirms healthcare commitment

Kalank - Malayalam (1CD ) - subtitles

99 God Status for Whatsapp, Facebook

Skint TV teen to be sentenced

Kanulanu Thaake Lyrics and translation | Manam (2014)

Stephanie cheung vs victoria hay vs estrina ang