Machine Learning for Audio, Image and Video Analysis | springerprofessional.de

Springer Professional

nach oben

2015 | Buch

Kapitel lesen Erstes Kapitel lesen

Machine Learning for Audio, Image and Video Analysis

Theory and Applications

verfasst von: Francesco Camastra, Alessandro Vinciarelli

Verlag: Springer London

Buchreihe : Advanced Information and Knowledge Processing

Enthalten in: Springer Professional "Wirtschaft+Technik" , Springer Professional "Technik" , Springer Professional "Wirtschaft"

Einloggen, um Zugang zu erhalten

Über dieses Buch

This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third part Applications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.

Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.

Anzeige

Inhaltsverzeichnis

Frontmatter

Chapter 1. Introduction

Abstract

There are two fundamental questions that should be answered before buying, and even more before reading, a book. \(\bullet \) Why should one read the book?, \(\bullet \) What is the book about?

Francesco Camastra, Alessandro Vinciarelli

From Perception to Computation

Frontmatter

Chapter 2. Audio Acquisition, Representation and Storage

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of physics. \(\bullet \) Basic notions of calculus (trigonometry, logarithms, exponentials, etc.)

Francesco Camastra, Alessandro Vinciarelli

Chapter 3. Image and Video Acquisition, Representation and Storage

Abstract

What the reader should know to understand this chapter \(\bullet \) Elementary notions of optics and physics. \(\bullet \) Basic notions of mathematics.

Francesco Camastra, Alessandro Vinciarelli

Machine Learning

Frontmatter

Chapter 4. Machine Learning

Abstract

What the reader should know after reading in this chapter \(\bullet \) Supervised learning, \(\bullet \) Unsupervised learning, \(\bullet \) Semi-supervised learning, \(\bullet \) Reinforcement learning.

Francesco Camastra, Alessandro Vinciarelli

Chapter 5. Bayesian Theory of Decision

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of statistics and probability theory (see Appendix A). \(\bullet \) Calculus notions are an advantage.

Francesco Camastra, Alessandro Vinciarelli

Chapter 6. Clustering Methods

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of calculus and linear algebra. \(\bullet \) Basic notions of machine learning. \(\bullet \) Programming skills to implement some computer projects proposed in the Problems section.

Francesco Camastra, Alessandro Vinciarelli

Chapter 7. Foundations of Statistical Learning and Model Selection

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of machine learning. \(\bullet \) Notions of calculus. \(\bullet \) Chapter 5.

Francesco Camastra, Alessandro Vinciarelli

Chapter 8. Supervised Neural Networks and Ensemble Methods

Abstract

What the reader should know to understand this chapter \(\bullet \) Fundamentals of machine learning (Chap. 4). \(\bullet \) Statistics (Appendix A).

Francesco Camastra, Alessandro Vinciarelli

Chapter 9. Kernel Methods

Abstract

Notions of calculus. What the reader should know to understand this chapter \(\bullet \) Notions of calculus. \(\bullet \) Chapters 5, 6, and 7. \(\bullet \) Although the reading of Appendix D is not mandatory, it represents an advantage for the chapter understanding.

Francesco Camastra, Alessandro Vinciarelli

Chapter 10. Markovian Models for Sequential Data

Abstract

What the reader should know to understand this chapter \(\bullet \) Bayes decision theory (Chap. 5). \(\bullet \) Lagrange multipliers and conditional optimization problems (Chap. 9). \(\bullet \) Probability and statistics (Appendix A).

Francesco Camastra, Alessandro Vinciarelli

Chapter 11. Feature Extraction Methods and Manifold Learning Methods

Abstract

What the reader needs to understand this chapter. \(\bullet \) Notions of calculus. \(\bullet \) The fourth chapter.

Francesco Camastra, Alessandro Vinciarelli

Applications

Frontmatter

Chapter 12. Speech and Handwriting Recognition

Abstract

What the reader should know to understand this chapter \(\bullet \) Hidden Markov models (Chap. 10). \(\bullet \) Language models (Chap. 10). \(\bullet \) Bayes decision theory (Chap. 3).

Francesco Camastra, Alessandro Vinciarelli

Chapter 13. Automatic Face Recognition

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of image processing (Chap. 3). \(\bullet \) Support vectors machines and kernel methods (Chap. 9). \(\bullet \) Principal component analysis (Chap. 11).

Francesco Camastra, Alessandro Vinciarelli

Chapter 14. Video Segmentation and Keyframe Extraction

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of image processing (Chap. 3). \(\bullet \) Clustering techniques (Chap. 6).

Francesco Camastra, Alessandro Vinciarelli

Chapter 15. Real-Time Hand Pose Recognition

Abstract

What the reader should know to understand this chapter \(\bullet \) Color Models (Chap. 3). \(\bullet \) Learning Vector Quantization (Chap. 8).

Francesco Camastra, Alessandro Vinciarelli

Chapter 16. Automatic Personality Perception

Abstract

What the reader should know to understand this chapter \(\bullet \) Basic notions of speech processing (Chap. 2). \(\bullet \) Classification techniques (Chap. 8).

Francesco Camastra, Alessandro Vinciarelli

Backmatter

Titel: Machine Learning for Audio, Image and Video Analysis
verfasst von: Francesco Camastra
Alessandro Vinciarelli
Copyright-Jahr: 2015
Verlag: Springer London
Electronic ISBN: 978-1-4471-6735-8
Print ISBN: 978-1-4471-6734-1
DOI: https://doi.org/10.1007/978-1-4471-6735-8

Premium Partner