Fr. 139.00

Speech and Audio Processing for Coding, Enhancement and Recognition

English · Paperback / Softback

Shipping usually within 6 to 7 weeks

Description

Read more

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas.




List of contents

From 'Harmonic Telegraph' to Cellular Phones.- Challenges in Speech Coding Research.- Recent Speech Coding Technologies and Standards.- Ensemble Learning Approaches in Speech Recognition.- Dynamic and Deep Networks For Speech Modeling and Recognition.- Speech Based Emotion Recognition.- Speaker Diarization: Challenges and Emerging Research.- Maximum a posteriori spectral estimation with source log-spectral priors for multichannel speech enhancement.- Modulation Processing for Speech Enhancement.

About the author

Tokunbo Ogunfunmi is an Associate Professor of Electrical Engineering and an Associate Dean for Research and Fac. Dev. at Santa Clara University.
Roberto Togneri is a professor with the School of Electrical, Electronic and Computer Engineering at The University of Western Australia.
Madihally (Sim) Narasimha is a Senior Director of Technology at Qualcomm Inc.

Summary

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. 
 
 
 

Product details

Assisted by Madihally (Sim) Narasimha (Editor), Madihally (Sim) Narasimha (Editor), Madihally Sim Narasimha (Editor), Tokunbo Ogunfunmi (Editor), Robert Togneri (Editor), Roberto Togneri (Editor)
Publisher Springer, Berlin
 
Languages English
Product format Paperback / Softback
Released 01.01.2016
 
EAN 9781493948048
ISBN 978-1-4939-4804-8
No. of pages 345
Dimensions 158 mm x 236 mm x 22 mm
Weight 556 g
Illustrations X, 345 p. 79 illus., 32 illus. in color.
Subjects Natural sciences, medicine, IT, technology > Technology > Electronics, electrical engineering, communications engineering

B, engineering, Multimedia Information Systems, Signal, Image and Speech Processing, Signal Processing, Speech processing systems, Digital and Analog Signal Processing, Image processing, Graphical & digital media applications, User interface design & usability, User Interfaces and Human Computer Interaction, User interfaces (Computer systems)

Customer reviews

No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.

Write a review

Thumbs up or thumbs down? Write your own review.

For messages to CeDe.ch please use the contact form.

The input fields marked * are obligatory

By submitting this form you agree to our data privacy statement.