Skip to main content

2015 | Buch

Machine Learning for Audio, Image and Video Analysis

Theory and Applications

insite
SUCHEN

Über dieses Buch

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third part Applications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.

Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Introduction
Abstract
There are two fundamental questions that should be answered before buying, and even more before reading, a book. \(\bullet \) Why should one read the book?, \(\bullet \) What is the book about?
Francesco Camastra, Alessandro Vinciarelli

From Perception to Computation

Frontmatter
Chapter 2. Audio Acquisition, Representation and Storage
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of physics. \(\bullet \) Basic notions of calculus (trigonometry, logarithms, exponentials, etc.)
Francesco Camastra, Alessandro Vinciarelli
Chapter 3. Image and Video Acquisition, Representation and Storage
Abstract
What the reader should know to understand this chapter \(\bullet \) Elementary notions of optics and physics. \(\bullet \) Basic notions of mathematics.
Francesco Camastra, Alessandro Vinciarelli

Machine Learning

Frontmatter
Chapter 4. Machine Learning
Abstract
What the reader should know after reading in this chapter \(\bullet \) Supervised learning, \(\bullet \) Unsupervised learning, \(\bullet \) Semi-supervised learning, \(\bullet \) Reinforcement learning.
Francesco Camastra, Alessandro Vinciarelli
Chapter 5. Bayesian Theory of Decision
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of statistics and probability theory (see Appendix A). \(\bullet \) Calculus notions are an advantage.
Francesco Camastra, Alessandro Vinciarelli
Chapter 6. Clustering Methods
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of calculus and linear algebra. \(\bullet \) Basic notions of machine learning. \(\bullet \) Programming skills to implement some computer projects proposed in the Problems section.
Francesco Camastra, Alessandro Vinciarelli
Chapter 7. Foundations of Statistical Learning and Model Selection
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of machine learning. \(\bullet \) Notions of calculus. \(\bullet \) Chapter 5.
Francesco Camastra, Alessandro Vinciarelli
Chapter 8. Supervised Neural Networks and Ensemble Methods
Abstract
What the reader should know to understand this chapter \(\bullet \) Fundamentals of machine learning (Chap. 4). \(\bullet \) Statistics (Appendix A).
Francesco Camastra, Alessandro Vinciarelli
Chapter 9. Kernel Methods
Abstract
Notions of calculus. What the reader should know to understand this chapter \(\bullet \) Notions of calculus. \(\bullet \) Chapters 56, and 7. \(\bullet \) Although the reading of Appendix D is not mandatory, it represents an advantage for the chapter understanding.
Francesco Camastra, Alessandro Vinciarelli
Chapter 10. Markovian Models for Sequential Data
Abstract
What the reader should know to understand this chapter \(\bullet \) Bayes decision theory (Chap. 5). \(\bullet \) Lagrange multipliers and conditional optimization problems (Chap. 9). \(\bullet \) Probability and statistics (Appendix A).
Francesco Camastra, Alessandro Vinciarelli
Chapter 11. Feature Extraction Methods and Manifold Learning Methods
Abstract
What the reader needs to understand this chapter. \(\bullet \) Notions of calculus. \(\bullet \) The fourth chapter.
Francesco Camastra, Alessandro Vinciarelli

Applications

Frontmatter
Chapter 12. Speech and Handwriting Recognition
Abstract
What the reader should know to understand this chapter \(\bullet \) Hidden Markov models (Chap. 10). \(\bullet \) Language models (Chap. 10). \(\bullet \) Bayes decision theory (Chap. 3).
Francesco Camastra, Alessandro Vinciarelli
Chapter 13. Automatic Face Recognition
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of image processing (Chap. 3). \(\bullet \) Support vectors machines and kernel methods (Chap. 9). \(\bullet \) Principal component analysis (Chap. 11).
Francesco Camastra, Alessandro Vinciarelli
Chapter 14. Video Segmentation and Keyframe Extraction
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of image processing (Chap. 3). \(\bullet \) Clustering techniques (Chap. 6).
Francesco Camastra, Alessandro Vinciarelli
Chapter 15. Real-Time Hand Pose Recognition
Abstract
What the reader should know to understand this chapter \(\bullet \) Color Models (Chap. 3). \(\bullet \) Learning Vector Quantization (Chap. 8).
Francesco Camastra, Alessandro Vinciarelli
Chapter 16. Automatic Personality Perception
Abstract
What the reader should know to understand this chapter \(\bullet \) Basic notions of speech processing (Chap. 2). \(\bullet \) Classification techniques (Chap. 8).
Francesco Camastra, Alessandro Vinciarelli
Backmatter
Metadaten
Titel
Machine Learning for Audio, Image and Video Analysis
verfasst von
Francesco Camastra
Alessandro Vinciarelli
Copyright-Jahr
2015
Verlag
Springer London
Electronic ISBN
978-1-4471-6735-8
Print ISBN
978-1-4471-6734-1
DOI
https://doi.org/10.1007/978-1-4471-6735-8

Premium Partner