๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Machine Learning for Audio, Image and Video Analysis: Theory and Applications

โœ Scribed by Francesco Camastra PhD, Alessandro Vinciarelli PhD (auth.)


Publisher
Springer-Verlag London
Year
2008
Tongue
English
Leaves
487
Series
Advanced Information and Knowledge Processing
Edition
1
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing.

The machine learning techniques presented enable readers to address many real world problems involving complex data. Examples covering areas such as automatic speech and handwriting transcription, automatic face recognition, and semantic video segmentation are included, along with detailed introductions to algorithms and examples of their applications.

The book is organized in four parts: The first focuses on technical aspects, basic mathematical notions and elementary machine learning techniques. The second provides an extensive survey of most relevant machine learning techniques for media processing, while the third part focuses on applications and shows how techniques are applied in actual problems. The fourth part contains detailed appendices that provide notions about the main mathematical instruments used throughout the text.

Students and researchers needing a solid foundation or reference, and practitioners interested in discovering more about the state-of-the-art will find this book invaluable. Examples and problems are based on data and software packages publicly available on the web.

โœฆ Table of Contents


Front Matter....Pages I-XVI
Introduction....Pages 1-10
Audio Acquisition, Representation and Storage....Pages 13-50
Image and Video Acquisition, Representation and Storage....Pages 51-80
Machine Learning....Pages 83-89
Bayesian Theory of Decision....Pages 91-115
Clustering Methods....Pages 117-148
Foundations of Statistical Learning and Model Selection....Pages 149-172
Supervised Neural Networks and Ensemble Methods....Pages 173-209
Kernel Methods....Pages 211-263
Markovian Models for Sequential Data....Pages 265-303
Feature Extraction Methods and Manifold Learning Methods....Pages 305-341
Speech and Handwriting Recognition....Pages 345-379
Automatic Face Recognition....Pages 381-411
Video Segmentation and Keyframe Extraction....Pages 413-430
Back Matter....Pages 431-494

โœฆ Subjects


Pattern Recognition; Image Processing and Computer Vision; Multimedia Information Systems


๐Ÿ“œ SIMILAR VOLUMES


Machine Learning for Audio, Image and Vi
โœ Francesco Camastra PhD, Alessandro Vinciarelli PhD (auth.) ๐Ÿ“‚ Library ๐Ÿ“… 2008 ๐Ÿ› Springer-Verlag London ๐ŸŒ English

<p><P>Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both

Machine Learning for Audio, Image and Vi
โœ Francesco Camastra, Alessandro Vinciarelli (auth.) ๐Ÿ“‚ Library ๐Ÿ“… 2015 ๐Ÿ› Springer-Verlag London ๐ŸŒ English

<p><p>This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.<br>Divide

Machine learning for audio, image and vi
โœ Camastra, Francesco; Vinciarelli, Alessandro ๐Ÿ“‚ Library ๐Ÿ“… 2015 ๐Ÿ› Springer ๐ŸŒ English

<p>This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.<br>Divided i

Machine learning for audio, image and vi
โœ Camastra F., Vinciarelli A. ๐Ÿ“‚ Library ๐Ÿ“… 2008 ๐Ÿ› Springer ๐ŸŒ English

Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing. It is organized into three parts. The first focuses on technical aspects, basic mathematical

Machine Learning for Audio, Image and Vi
โœ Francesco Camastra, Alessandro Vinciarelli ๐Ÿ“‚ Library ๐Ÿ“… 2007 ๐Ÿ› Springer ๐ŸŒ English

This book is divided into three parts: From Perception to Computation - Shows how the physical supports our auditory and visual perceptions. In other words, it shows how acoustic waves and electromagnetic radiation are converted into objects that can be manipulated by a computer. Machine Learning