Share
Francesc Camastra, Francesco Camastra, Alessandro Vinciarelli
Machine Learning for Audio, Image and Video Analysis - Theory and Applications
English · Paperback / Softback
Description
Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing.
The machine learning techniques presented enable readers to address many real world problems involving complex data. Examples covering areas such as automatic speech and handwriting transcription, automatic face recognition, and semantic video segmentation are included, along with detailed introductions to algorithms and examples of their applications.
The book is organized in four parts: The first focuses on technical aspects, basic mathematical notions and elementary machine learning techniques. The second provides an extensive survey of most relevant machine learning techniques for media processing, while the third part focuses on applications and shows how techniques are applied in actual problems. The fourth part contains detailed appendices that provide notions about the main mathematical instruments used throughout the text.
Students and researchers needing a solid foundation or reference, and practitioners interested in discovering more about the state-of-the-art will find this book invaluable. Examples and problems are based on data and software packages publicly available on the web. Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing. It is organized into three parts. The first focuses on technical aspects, basic mathematical notions and elementary machine learning techniques. The second provides an extensive survey of most relevant machine learning techniques for media processing. The third focuses on applications and shows how techniques are applied in actual problems. Examples and problems are based on data and software packages publicly available on the web.
List of contents
From Perception to Computation.- Audio Acquisition, Representation and Storage.- Image and Video Acquisition, Representation and Storage.- Machine Learning.- Machine Learning.- Bayesian Theory of Decision.- Clustering Methods.- Foundations of Statistical Learning and Model Selection.- Supervised Neural Networks and Ensemble Methods.- Kernel Methods.- Markovian Models for Sequential Data.- Feature Extraction Methods and Manifold Learning Methods.- Applications.- Speech and Handwriting Recognition.- Automatic Face Recognition.- Video Segmentation and Keyframe Extraction.
Summary
This book illustrates how to deal with complex media and convert raw data into useful information. Students and researchers needing a solid foundation or reference, and practitioners interested in discovering more about the state-of-the-art will find this book invaluable.
Report
From the reviews:
"A book that focuses on the intersection and intersection of these two fast-growing areas could not be better timed. ... the book is organized into three major parts that cover audio and video processing, machine learning, and applications. ... On the whole, this is a valuable and timely reference book for those interested in machine learning or audio, video, and image processing, although the need for a well-integrated book on this topic still remains." (M. Sasikumar, ACM Computing Reviews, December, 2008)
"...this book, unlike most other books in this field, not only introduces a few widely used techniques in audio and image analysis, but also discusses the latest advancements in the field. ...Distinct from other books, it also points out several public software packages and benchmark data sets that encourage the reader to have a hands-on experience on how machine-learning techniques work to analyze audio and visual content. Its comprehensive coverage on recent development in this research area makes it easy for experienced researchers to further explore the latest techniques. ...it is ideal as a textbook or supplemental material for senior graduate courses or advanced topic seminars." (Jie Yu, Journal of Electronic Imaging, Vol. 18, Apr-Jun 2009)
Product details
Authors | Francesc Camastra, Francesco Camastra, Alessandro Vinciarelli |
Publisher | Springer, Berlin |
Languages | English |
Product format | Paperback / Softback |
Released | 26.10.2010 |
EAN | 9781849966993 |
ISBN | 978-1-84996-699-3 |
No. of pages | 494 |
Illustrations | 6 Tabellen |
Series |
Advanced Information and Knowledge Processing Advanced Information and Knowledge Processing |
Subject |
Natural sciences, medicine, IT, technology
> IT, data processing
> IT
|
Customer reviews
No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.
Write a review
Thumbs up or thumbs down? Write your own review.