Machine Learning for Audio, Image and Video Analysis: Theory and Applications (Advanced Information and Knowledge Processing)

✍ Scribed by Francesco Camastra, Alessandro Vinciarelli

Publisher: Springer
Year: 2007
Tongue: English
Leaves: 484
Series: Advanced Information and Knowledge Processing
Edition: 1
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

This book is divided into three parts:

From Perception to Computation - Shows how the physical supports our auditory and visual perceptions. In other words, it shows how acoustic waves and electromagnetic radiation are converted into objects that can be manipulated by a computer.

Machine Learning - Provides a rather deep survey of the main techniques used in machine learning. These chapters cover most of the algorithms applied in systems for audio, image, and video analysis. At this point, all of the algorithms are general pattern recognition techniques that could apply to any field.

Applications - This section presents examples of applications using the techniques presented in part two. There is a chapter each dedicated to speech and handwriting recognition, face recognition, and video segmentation and keyframe extraction. Each chapter shows an overall system where analysis and machine learning components interact in order to accomplish a given task. Whenever possible the chapters of this part present results obtained using publicly available data and software package. This enables the reader to perform experiments similar to those presented in this book.

The beginning of each chapter starts with what the reader should understand before getting started, such as calculus and chapter four in the case of chapter eleven. That is followed with a list of what the reaer should know after reading the chapter. I'd say parts one and two are quite good, but things break down a bit in part three. Granted, the subject of each of the three chapters in the final section is complex, but a few more figures and labeled algorithmic steps and maybe a little less prose might have made the specific matters of each task at hand clearer.

📜 SIMILAR VOLUMES

Machine Learning for Audio, Image and Vi

📁 Machine Learning for Audio, Image and Video Analysis: Theory and Applications

✍ Francesco Camastra PhD, Alessandro Vinciarelli PhD (auth.) 📂 Library 📅 2008 🏛 Springer-Verlag London 🌐 English

Machine Learning involves several scientific domains including mathematics, computer science, statistics and biology, and is an approach that enables computers to automatically learn from data. Focusing on complex media and how to convert raw data into useful information, this book offers both

Machine Learning for Audio, Image and Vi

📁 Machine Learning for Audio, Image and Video Analysis: Theory and Applications

✍ Francesco Camastra PhD, Alessandro Vinciarelli PhD (auth.) 📂 Library 📅 2008 🏛 Springer-Verlag London 🌐 English

Machine Learning for Audio, Image and Vi

📁 Machine Learning for Audio, Image and Video Analysis - Theory and Applications

✍ Nicu Sebe, Ira Cohen, Ashutosh Garg, Thomas S. Huang 📂 Library 📅 2008 🏛 Springer 🌐 English

Machine Learning for Audio, Image and Vi

📁 Machine Learning for Audio, Image and Video Analysis: Theory and Applications

✍ Francesco Camastra, Alessandro Vinciarelli (auth.) 📂 Library 📅 2015 🏛 Springer-Verlag London 🌐 English

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book. Divide

Machine learning for audio, image and vi

📁 Machine learning for audio, image and video analysis : theory and applications

✍ Camastra, Francesco; Vinciarelli, Alessandro 📂 Library 📅 2015 🏛 Springer 🌐 English

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book. Divided i

Machine learning for audio, image and vi

📁 Machine learning for audio, image and video analysis

✍ Camastra F., Vinciarelli A. 📂 Library 📅 2008 🏛 Springer 🌐 English

Focusing on complex media and how to convert raw data into useful information, this book offers both introductory and advanced material in the combined fields of machine learning and image/video processing. It is organized into three parts. The first focuses on technical aspects, basic mathematical