Search results for Computational statistics, machine learning and information science

Frontmatter
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Sparse and Misaligned Data
from Part IV - Handling Diverse Data Formats
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 245-265
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

There is no silver bullet: no model can fit all data. Hence, special data requires special algorithms. In this chapter, we deal with two types of special data: sparse data and sequences that can be aligned to each other. We will not dive deep into sparsity learning, which is very complex. Rather, we introduce key concepts: sparsity inducing loss functions, dictionary learning, and what exactly the word sparsity means. For the second part in this chapter, we introduce dynamic time warping (DTW), which deals with sequences that can be aligned with each other (but there are sequences that cannot be aligned, which we will discuss in the next chapter). We use our old tricks: ideas, visualizations, formalizations, to reach the DTW solution. The key idea behind its success is divide-and-conquer and the key technology is dynamic programming.

13 - The Normal Distribution
from Part V - Advanced Topics
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 293-315
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The normal distribution is the most widely used continuous distribution, but many of its relevant properties are a little bit advanced for an undergraduate course. Hence, Part IV introduces some of these advanced topics. This chapter devotes itself to properties of normal distributions: single- and multivariate normal distributions, moment and canonical parameterizations, sum and product, geometry and the Mahalanobis distance, and conditional distributions. We also show that with these properties, some algorithms will become much easier to understand. We use parameter estimation and the Kalman filter as two such examples.

15 - Convolutional Neural Networks
from Part V - Advanced Topics
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 333-364
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We cannot miss deep learning in a modern pattern recognition textbook, and we introduce CNN (convolutional neural networks) in this chapter. Although the mathematical derivation of CNN, especially the back-propagation process and gradient computation, is complex, we use a lot of useful tools to help readers understand what exactlyis going on in a CNN. Hence, this chapter focuses on accessibility rather than completeness. In its exercise problems, we introduce more relevant topics and methods.

6 - Fisher’s Linear Discriminant
from Part II - Domain-Independent Feature Extraction
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 123-140
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Unlike PCA, which is unsupervised, FLD uses labels associated with data points, and no doubt it may get better linear features and accuracy than PCA. We start by illustrating this motivation, and practice the problem-solving framework by gradually developing the correct mathematical formulation behind the relatively simple idea behind Fisher's linear discriminant (FLD). We discuss various practical issues: the solution for the binary case, the scenario where this solution breaks down, and how to generalize from tasks with only two categories to many categories.

List of Tables
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp xi-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Probabilistic Methods
from Part III - Classifiers and Tools
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 173-195
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is a succinct introduction to basic probabilistic methods for pattern recognition and machine learning. One focus is to clearly present the exact meanings of different terms, including the taxonomy of different probabilistic methods. We present a basic introduction to maximum likelihood and maximum a posteriori estimation, and a very brief example to showcase the concept of Bayesian estimation. For the nonparametric world, we start from the drawbacks of parametric methods, gradually analyzing the properties preferred for a nonparametric one, and finally reach the kernel density estimation, a typical nonparametric method.

Part IV - Handling Diverse Data Formats
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 243-244
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Notation
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp xvi-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction
from Part I - Introduction and Overview
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 3-14
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is an overall introduction to the definition of pattern recognition, its relationship with machine learning and other relevant subject areas, and the main components and development process inside a pattern recognition system. This introduction is started by considering an autonomous driving example.

Part I - Introduction and Overview
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Bibliography
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 365-378
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Plates
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 385-400
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - The Basic Idea behind Expectation-Maximization
from Part V - Advanced Topics
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 316-332
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Parameter estimation is generally difficult, requiring advanced methods such as the expectation-maximization (EM). This chapter focuses on the ideas behind EM, rather than its complex mathematical properties or proofs. We use the Gaussian mixture model (GMM) as an illustrative example to find what leads us to the EM algorithms, e.g., complete and incomplete data likelihood, concave and nonconcave loss functions, and observed and hidden variables. We then derive the EM algorithm in general and its application to GMM.

3 - Overview of a Pattern Recognition System
from Part I - Introduction and Overview
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 44-62
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter presents a simple but working face recognition system, which is based on the nearest neighbor search algorithm. Albeit simple, it is a complete pattern recognition pipeline. We can then examine every component in it, and analyze potential difficulties and pitfalls one may encounter. Furthermore, we introduce a problem-solving framework, which will be useful in the rest of this book and in solving other tasks.

List of Figures
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp ix-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Distance Metrics and Data Transformations
from Part III - Classifiers and Tools
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 196-218
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is not about one particular method (or a family of methods). Instead, it provides a set of tools useful for better pattern recognition, especially for real-world applications. They include the definition of distance metrics, vector norms, a brief introduction to the idea of distance metric learning, and power mean kernels (which is a family of useful metrics). We also establish by examples that proper normalizations of our data are essential, and introduce a few data normalization and transformation methods.

Part V - Advanced Topics
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 291-292
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Classifiers and Tools
from Part III - Classifiers and Tools
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 143-172
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Starting from this chapter, Part III introduces several commonly used algorithms in pattern recognition and machine learning. Support vector machines (SVM) starts from a simple and beautiful idea: large margin. We first show that in order to find such an idea, we may need to simplify our problem setup by assuming a linearly separable binary one. Then we visualize and calculate the margin to reach the SVM formulation, which is complex and difficult to optimize. We practice the simplification procedure again until the formulation becomes viable, briefly mention the primal--dual relationship, but do not go into details of its optimization. We show that the simplification assumptions (linear, separable, and binary) can be relaxed such that SVM will solve more difficult tasks---and the key ideas here are also useful in other tasks: slack variables and kernel methods.

Part III - Classifiers and Tools
Jianxin Wu, Nanjing University, China
Book:

Essentials of Pattern Recognition

Published online:

08 December 2020

Print publication:

19 November 2020, pp 141-142
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Computational statistics, machine learning and information science

Refine search

Refine search

Actions for selected content:

1004 results in Computational statistics, machine learning and information science

Frontmatter

11 - Sparse and Misaligned Data

Summary

13 - The Normal Distribution

Summary

15 - Convolutional Neural Networks

Summary

6 - Fisher’s Linear Discriminant

Summary

List of Tables

8 - Probabilistic Methods

Summary

Part IV - Handling Diverse Data Formats

Notation

1 - Introduction

Summary

Part I - Introduction and Overview

Bibliography

Plates

14 - The Basic Idea behind Expectation-Maximization

Summary

3 - Overview of a Pattern Recognition System

Summary

List of Figures

9 - Distance Metrics and Data Transformations

Summary

Part V - Advanced Topics

7 - Classifiers and Tools

Summary

Part III - Classifiers and Tools

Computational statistics, machine learning and information science

Refine search

Refine search

Actions for selected content:

Save Search

1004 results in Computational statistics, machine learning and information science

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary