Share
Fr. 46.90
Don Davis, Jun Huang, Stephen Levinson, Scott Slimon, Scott et al Slimon
Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus
English · Paperback / Softback
Shipping usually within 1 to 2 weeks (title will be printed to order)
Description
This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter model, which assumes the independence of the excitation and filter, we treat the entire vocal apparatus as one mechanical system that produces sound by means of fluid dynamics. The vocal apparatus is represented as a three-dimensional time-varying mechanism and the sound propagation inside it is due to the non-planar propagation of acoustic waves through a viscous, compressible fluid described by the Navier-Stokes equations. We propose a combined minimum energy and minimum jerk criterion to compute the dynamics of the vocal tract during articulation. Theoretical error bounds and experimental results show that this method obtains a close match to the phonetic target positions while avoiding abrupt changes in the articulatory trajectory. The vocal folds are set into aerodynamic oscillation by the flow of air from the lungs. The modulated air stream then excites the moving vocal tract. This method shows strong evidence for source-filter interaction. Based on our results, we propose that the articulatory speech production model has the potential to synthesize speech and provide a compact parameterization of the speech signal that can be useful in a wide variety of speech signal processing problems. Table of Contents: Introduction / Literature Review / Estimation of Dynamic Articulatory Parameters / Construction of Articulatory Model Based on MRI Data / Vocal Fold Excitation Models / Experimental Results of Articulatory Synthesis / Conclusion
List of contents
Introduction.- Literature Review.- Estimation of Dynamic Articulatory Parameters.- Construction of Articulatory Model Based on MRI Data.- Vocal Fold Excitation Models.- Experimental Results of Articulatory Synthesis.- Conclusion.
About the author
Stephen E. Levinson received his Ph.D. in Electrical Engineering from the University of Rhode Island, Kingston, Rhode Island in 1974. From 1966 - 1969 he was a design engineer at Electric Boat Division of General Dynamics in Groton, Connecticut. From 1974-1976 he held a J. Willard Gibbs Instructorship in Computer Science at Yale University. In 1976, he joined the technical staff of Bell Laboratories in Murray Hill, NJ where he conducted research in the areas of speech recognition and understanding. In 1979, he was a visiting researcher at the NTT Musashino Electrical Communication Laboratory in Tokyo, Japan. He held a visiting fellowship in the Engineering Department at Cambridge University in 1984, and in 1990 he became head of the Linguistics Research Department at AT&T Bell Laboratories where he directed research in Speech Synthesis, Speech Recognition, and Spoken Language Translation. He joined the Department of Electrical and Computer Engineering of the University of Illinois at Urbana-Champaign in 1997, where he teaches courses in Speech and Language Processing and leads research projects in speech synthesis and automatic language acquisition. He is also a full-time faculty member of the Beckman Institute for Advanced Science and Technology where he serves as the head of the Artificial Intelligence group. Dr. Levinson is a member of the Association for Computing Machinery, a fellow of the Institute of Electrical and Electronic Engineers, and a fellow of the Acoustical Society of America. He is a founding editor of the journal Computer Speech and Language and a former member and chair of the Industrial Advisory Board of the CAIP Center at Rutgers University. He is the author of more than 100 technical papers and holds 7 patents. His book, published in 2005 by John Wiley and Sons, Ltd., is entitled Mathematical Models for Speech Technology.Donald W. Davis, Jr. received B. S., M. S., and Ph. D. degrees in Aeronautical Engineering from Purdue University in 1970, 1975, and 1981, respectively. Currently, he is a Staff Engineer at Electric Boat Corporation where he works in the area of computational fluid dynamics (CFD). His research interests include fluid mechanics, heat transfer, computational methods, and turbulence modeling. He is also involved in applying advanced CFD tools to large, complex, industrially relevant turbulent flow problems.
Product details
| Authors | Don Davis, Jun Huang, Stephen Levinson, Scott Slimon, Scott et al Slimon |
| Publisher | Springer, Berlin |
| Original title | Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus |
| Languages | English |
| Product format | Paperback / Softback |
| Released | 01.01.2012 |
| EAN | 9783031014352 |
| ISBN | 978-3-0-3101435-2 |
| No. of pages | 104 |
| Dimensions | 191 mm x 6 mm x 235 mm |
| Illustrations | XII, 104 p. |
| Series |
Synthesis Lectures on Speech and Audio Processing |
| Subject |
Natural sciences, medicine, IT, technology
> Technology
> Electronics, electrical engineering, communications engineering
|
Customer reviews
No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.
Write a review
Thumbs up or thumbs down? Write your own review.