CHF 63.00

Multilingual Phone Recognition in Indian Languages

Englisch · Taschenbuch

Versand in der Regel in 4 bis 7 Arbeitstagen

Beschreibung

Mehr lesen

The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.

Über den Autor / die Autorin










Dr. Manjunath K E received his PhD in multilingual speech recognition from International Institute of Information Technology, Bangalore, India, and his MS in automatic speech recognition from Indian Institute of Technology, Kharagpur, India. Currently, he works as Scientist at U R Rao Satellite Centre, Indian Space Research Organisation (ISRO). He has published in several international conferences and journals. He has co-authored the book "Speech recognition using Articulatory and Excitation Source Features" (Springer 2017).


Produktdetails

Autoren Manjunath K E, K.E Manjunath, K E Manjunath, K. E Manjunath, Manjunath K.E.
Verlag Springer, Berlin
 
Inhalt Buch
Produktform Taschenbuch
Erscheinungsdatum 25.09.2021
Thema Naturwissenschaften, Medizin, Informatik, Technik > Technik > Elektronik, Elektrotechnik, Nachrichtentechnik
 
EAN 9783030807405
ISBN 978-3-0-3080740-5
Anzahl Seiten 103
Illustration XIV, 103 p. 28 illus., 9 illus. in color.
Abmessung (Verpackung) 15.5 x 0.6 x 23.5 cm
 
Serie SpringerBriefs in Speech Technology
Themen Elektronik, Computerlinguistik und Korpuslinguistik, Audiosignalverarbeitung, DeepLearning, featurefusion, InternationalPhoneticAlphabet, Articulatoryfeatures, Indianlanguages
 

Kundenrezensionen

Zu diesem Artikel wurden noch keine Rezensionen verfasst. Schreibe die erste Bewertung und sei anderen Benutzern bei der Kaufentscheidung behilflich.

Schreibe eine Rezension

Top oder Flop? Schreibe deine eigene Rezension.

Für Mitteilungen an CeDe.ch kannst du das Kontaktformular benutzen.

Die mit * markierten Eingabefelder müssen zwingend ausgefüllt werden.

Mit dem Absenden dieses Formulars erklärst du dich mit unseren Datenschutzbestimmungen einverstanden.