To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
This chapter is not about one particular method (or a family of methods). Instead, it provides a set of tools useful for better pattern recognition, especially for real-world applications. They include the definition of distance metrics, vector norms, a brief introduction to the idea of distance metric learning, and power mean kernels (which is a family of useful metrics). We also establish by examples that proper normalizations of our data are essential, and introduce a few data normalization and transformation methods.
Starting from this chapter, Part III introduces several commonly used algorithms in pattern recognition and machine learning. Support vector machines (SVM) starts from a simple and beautiful idea: large margin. We first show that in order to find such an idea, we may need to simplify our problem setup by assuming a linearly separable binary one. Then we visualize and calculate the margin to reach the SVM formulation, which is complex and difficult to optimize. We practice the simplification procedure again until the formulation becomes viable, briefly mention the primal--dual relationship, but do not go into details of its optimization. We show that the simplification assumptions (linear, separable, and binary) can be relaxed such that SVM will solve more difficult tasks---and the key ideas here are also useful in other tasks: slack variables and kernel methods.
Information theory is developed in the communications community, but it turns out to be very useful for pattern recognition. In this chapter, we start with an example to develop the ideas of uncertainty and its measurement, i.e., entropy. A few core results in information theory are introduced: entropy, joint and conditional entropy, mutual information, and their relationships. We then move to differential entropy for continuous random variables and find distributions with maximum entropy under certain constraints, which are useful for pattern recognition. Finally, we introduce the applications of information theory in our context: maximum entropy learning, minimum cross entropy, feature selection, and decision trees (a widely used family of models for pattern recognition and machine learning).
Data-driven computational neuroscience facilitates the transformation of data into insights into the structure and functions of the brain. This introduction for researchers and graduate students is the first in-depth, comprehensive treatment of statistical and machine learning methods for neuroscience. The methods are demonstrated through case studies of real problems to empower readers to build their own solutions. The book covers a wide variety of methods, including supervised classification with non-probabilistic models (nearest-neighbors, classification trees, rule induction, artificial neural networks and support vector machines) and probabilistic models (discriminant analysis, logistic regression and Bayesian network classifiers), meta-classifiers, multi-dimensional classifiers and feature subset selection methods. Other parts of the book are devoted to association discovery with probabilistic graphical models (Bayesian networks and Markov networks) and spatial statistics with point processes (complete spatial randomness and cluster, regular and Gibbs processes). Cellular, structural, functional, medical and behavioral neuroscience levels are considered.