<p><P>Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both
Machine learning for audio, image and video analysis : theory and applications
โ Scribed by Camastra, Francesco; Vinciarelli, Alessandro
- Publisher
- Springer
- Year
- 2015
- Tongue
- English
- Leaves
- 564
- Series
- Advanced information and knowledge processing
- Edition
- 2ed.
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third part Applications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.
Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.
โฆ Table of Contents
Front Matter....Pages i-xvi
Introduction....Pages 1-10
Front Matter....Pages 11-11
Audio Acquisition, Representation and Storage....Pages 13-55
Image and Video Acquisition, Representation and Storage....Pages 57-96
Front Matter....Pages 97-97
Machine Learning....Pages 99-106
Bayesian Theory of Decision....Pages 107-129
Clustering Methods....Pages 131-167
Foundations of Statistical Learning and Model Selection....Pages 169-190
Supervised Neural Networks and Ensemble Methods....Pages 191-227
Kernel Methods....Pages 229-293
Markovian Models for Sequential Data....Pages 295-340
Feature Extraction Methods and Manifold Learning Methods....Pages 341-386
Front Matter....Pages 387-387
Speech and Handwriting Recognition....Pages 389-419
Automatic Face Recognition....Pages 421-448
Video Segmentation and Keyframe Extraction....Pages 449-465
Real-Time Hand Pose Recognition....Pages 467-484
Automatic Personality Perception....Pages 485-498
Back Matter....Pages 499-561
โฆ Subjects
Pattern Recognition; Image Processing and Computer Vision; Multimedia Information Systems
๐ SIMILAR VOLUMES
<p><P>Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both
<p><p>This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.<br>Divide
Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing. It is organized into three parts. The first focuses on technical aspects, basic mathematical
This book is divided into three parts: From Perception to Computation - Shows how the physical supports our auditory and visual perceptions. In other words, it shows how acoustic waves and electromagnetic radiation are converted into objects that can be manipulated by a computer. Machine Learning