Fr. 116.00

Machine Learning for Audio, Image and Video Analysis - Theory and Applications

English · Hardback

Shipping usually within 2 to 3 weeks (title will be printed to order)

Description

Read more

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third partApplications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.
Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.

List of contents

Introduction.- Part I: From Perception to Computation.- Audio Acquisition, Representation and Storage.- Image and Video Acquisition, Representation and Storage.- Part II: Machine Learning.- Machine Learning.- Bayesian Theory of Decision.- Clustering Methods.- Foundations of Statistical Learning and Model Selection.- Supervised Neural Networks and Ensemble Methods.- Kernel Methods.- Markovian Models for Sequential Data.- Feature Extraction Methods and Manifold Learning Methods.- Part III: Applications.- Speech and Handwriting Recognition.- Speech and Handwriting Recognition.- Video Segmentation and Keyframe Extraction.- Real-Time Hand Pose Recognition.- Automatic Personality Perception.- Part IV: Appendices.- Appendix A: Statistics.- Appendix B: Signal Processing.- Appendix C: Matrix Algebra.- Appendix D: Mathematical Foundations of Kernel Methods.- Index.

Summary

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third partApplications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.
Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.

Additional text

“This nice book of over 560 pages is really useful for students, researchers, practitioners, and anybody who is interested in machine learning and related subjects.” (Michael M. Dediu, Mathematical Reviews, May, 2017)

Report

"This nice book of over 560 pages is really useful for students, researchers, practitioners, and anybody who is interested in machine learning and related subjects." (Michael M. Dediu, Mathematical Reviews, May, 2017)

Product details

Authors Francesc Camastra, Francesco Camastra, Alessandro Vinciarelli
Publisher Springer, Berlin
 
Languages English
Product format Hardback
Released 01.01.2015
 
EAN 9781447167341
ISBN 978-1-4471-6734-1
No. of pages 561
Dimensions 156 mm x 35 mm x 243 mm
Weight 1033 g
Illustrations XVI, 561 p. 119 illus.
Series Advanced Information and Knowledge Processing
Advanced Information and Knowledge Processing
Subject Natural sciences, medicine, IT, technology > IT, data processing > IT

Customer reviews

No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.

Write a review

Thumbs up or thumbs down? Write your own review.

For messages to CeDe.ch please use the contact form.

The input fields marked * are obligatory

By submitting this form you agree to our data privacy statement.