Search results for Pattern Recognition and Machine Learning

8 - Direct Density-Ratio Estimation with Dimensionality Reduction
from Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 89-116
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The approaches of direct density-ratio estimation explained in the previous chapters were shown to be promising in experiments with naive kernel density estimation in experiments. However, these methods still perform rather poorly when the dimensionality of the data domain is high.
The purpose of this chapter is to introduce ideas for mitigating this weakness, following Sugiyama et al. (2010a, 2011b). A basic assumption behind the approaches explained here is that the difference between the two distributions in the density ratio (i.e., the distributions corresponding to the numerator and denominator of the density ratio) does not spread over the entire data domain, but is confined in a low-dimensional subspace – which we refer to as the heterodistributional subspace. Once the heterodistributional subspace can be identified, the density ratio is estimated only within this subspace. This will lead to more stable and reliable estimations of density ratios. Such an approach is called direct density-ratio estimation with dimensionality reduction (D3; pronounced “D-cube”).
In this chapter, two approaches to D3 are described. In Section 8.1, a heuristic method based on discriminant analysis is explained. This method is shown to be computationally very efficient, and thus is very practical. On the other hand, in Section 8.2, a more theory-oriented approach based on divergence maximization is introduced. This method is justifiable under general settings, and thus it has a wider applicability. Numerical examples are shown in Section 8.3, and the chapter is concluded in Section 8.4.

Contents
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp v-viii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Foreword
- By Thomas G. Dietterich, Oregon State University
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp ix-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Estimating probability distributions is widely viewed as a central question in machine learning. The whole enterprise of probabilistic modeling using probabilistic graphical models is generally addressed by learning marginal and conditional probability distributions. Classification and regression – starting with Fisher's fundamental contributions – are similarly viewed as problems of estimating conditional densities.
The present book introduces an exciting alternative perspective – namely, that virtually all problems in machine learning can be formulated and solved as problems of estimating density ratios – the ratios of two probability densities. This book provides a comprehensive review of the elegant line of research undertaken by the authors and their collaborators over the last decade. It reviews existing work on density-ratio estimation and derives a variety of algorithms for directly estimating density ratios. It then shows how these novel algorithms can address not only standard machine learning problems – such as classification, regression, and feature selection – but also a variety of other important problems such as learning under a covariate shift, multi-task learning, outlier detection, sufficient dimensionality reduction, and independent component analysis.
At each point this book carefully defines the problems at hand, reviews existing work, derives novel methods, and reports on numerical experiments that validate the effectiveness and superiority of the new methods. A particularly impressive aspect of the work is that implementations of most of the methods are available for download from the authors’ web pages.

6 - Density-Ratio Fitting
from Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 67-74
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 21-24
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Conditional Probability Estimation
from Part III - Applications of Density Ratios in Machine Learning
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 191-212
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Density Fitting
from Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 56-66
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

List of Symbols and Abbreviations
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 307-308
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

16 - Non-Parametric Numerical Stability Analysis
from Part IV - Theoretical Analysis of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 275-300
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Unified Framework
from Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 75-88
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we describe a framework of density-ratio estimation by density-ratio fitting under the Bregman divergence (Bregman, 1967). This framework is a natural extension of the least-squares approach described in Chapter 6, and it includes various existing approaches as special cases (Sugiyama et al., 2011a).
In Section 7.1 we first describe the framework of density-ratio fitting under the Bregman divergence. Then, in Section 7.2, we show that various existing approaches can be accommodated in this framework, such as kernel mean matching (see Section 3.3), logistic regression (see Section 4.2), Kullback–Leibler importance estimation procedure (see Section 5.1), and least-squares importance fitting (see Section 6.1).We then show other views of the density-ratio fitting framework in Section 7.3. Furthermore, in Section 7.4, a robust density-ratio estimator is derived as an instance of the density-ratio fitting approach based on Basu's power divergence (Basu et al., 1998). The chapter is concluded in Section 7.5.
Basic Framework
A basic idea of density-ratio fitting is to fit a density ratio model r(x) to the true density-ratio function r*(x) under some divergence (Figure 7.1). At a glance, this density-ratio fitting problem may look equivalent to the regression problem, which is aimed at learning a real-valued function [see Section 1.1.1 and Figure 1.1(a)]. However, density-ratio fitting is essentially different from regression because samples of the true density-ratio function are not available. Here we employ the Bregman (BR) divergence (Bregman, 1967) for measuring the discrepancy between the true density-ratio function and the density-ratio model.

Preface
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp xi-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Machine learning is aimed at developing systems that learn. The mathematical foundation of machine learning and its real-world applications have been extensively explored in the last decades. Various tasks of machine learning, such as regression and classification, typically can be solved by estimating probability distributions behind data. However, estimating probability distributions is one of the most difficult problems in statistical data analysis, and thus solving machine learning tasks without going through distribution estimation is a key challenge in modern machine learning.
So far, various algorithms have been developed that do not involve distribution estimation but solve target machine learning tasks directly. The support vector machine is a successful example that follows this line – it does not estimate data generating distributions but directly obtains the class-decision boundary that is sufficient for classification. However, developing such an excellent algorithm for each of the machine learning tasks could be highly costly and difficult.
To overcome these limitations of current machine learning research, we introduce and develop a novel paradigm called density-ratio estimation – instead of probability distributions, the ratio of probability densities is estimated for statistical data processing. The density-ratio approach covers various machine learning tasks, for example, non-stationarity adaptation, multi-task learning, outlier detection, two-sample tests, feature selection, dimensionality reduction, independent component analysis, causal inference, conditional density estimation, and probabilitic classification. Thus, density-ratio estimation is a versatile tool for machine learning. This book is aimed at introducing the mathematical foundation, practical algorithms, and applications of density-ratio estimation.

4 - Probabilistic Classification
from Part II - Methods of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 47-55
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Density-Ratio Approach to Machine Learning
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

17 - Conclusions and Future Directions
from Part V - Conclusions
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 303-306
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this book we described a new approach to machine learning based on densityratio estimation. This density-ratio approach offers a novel research paradigm in the field of machine learning and data mining from theory and algorithms to application.
In Part II, various methods for density-ratio estimation were described, including methods based on separate estimations of numerator and denominator densities (Chapter 2), moment matching between numerator and denominator samples (Chapter 3), probabilistic classifications of numerator and denominator samples (Chapter 4), density fitting between numerator and denominator densities (Chapter 5), and direct fitting of a density-ratio model to the true densityratio (Chapter 6). We also gave a unified framework of density-ratio estimation in Chapter 7, which accommodates the various methods described above and is substantially more general – as an example, a robust density-ratio estimator was derived. Finally, in Chapter 8, we described methods that combine density-ratio estimation with dimensionality reduction. Among various density-ratio estimators, the unconstrained least-squares importance fitting (uLSIF) method described in Chapter 6 would be most useful practically because of its high computational efficiency by an analytic-form solution, the availability of cross-validation for model selection, its wide applicability to various machine learning tasks (Part III), and its superior convergence and numerical properties (Part IV).
In Part III we covered the usage of density-ratio estimators in various machine learning tasks that were categorized into four groups. In Chapter 9 we described applications of density ratios to importance sampling tasks such as non-stationarity/domain adaptation and multi-task learning.

Bibliography
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 309-326
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Non-Parametric Convergence Analysis
from Part IV - Theoretical Analysis of Density-Ratio Estimation
Masashi Sugiyama, Tokyo Institute of Technology, Taiji Suzuki, University of Tokyo, Takafumi Kanamori, Nagoya University, Japan
Book:

Density Ratio Estimation in Machine Learning

Published online:

05 March 2012

Print publication:

20 February 2012, pp 236-251
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Scaling up Machine Learning

Parallel and Distributed Approaches
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Published online:

05 February 2012

Print publication:

30 December 2011
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.

Frontmatter
David Barber, University College London
Book:

Bayesian Reasoning and Machine Learning

Published online:

05 June 2012

Print publication:

02 February 2012, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Bayesian model selection
from II - Learning in probabilistic models
David Barber, University College London
Book:

Bayesian Reasoning and Machine Learning

Published online:

05 June 2012

Print publication:

02 February 2012, pp 284-302
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

2327 results in Pattern Recognition and Machine Learning

8 - Direct Density-Ratio Estimation with Dimensionality Reduction

Summary

Contents

Foreword

Summary

6 - Density-Ratio Fitting

Part II - Methods of Density-Ratio Estimation

12 - Conditional Probability Estimation

5 - Density Fitting

List of Symbols and Abbreviations

Frontmatter

16 - Non-Parametric Numerical Stability Analysis

7 - Unified Framework

Summary

Preface

Summary

4 - Probabilistic Classification

Part I - Density-Ratio Approach to Machine Learning

17 - Conclusions and Future Directions

Summary

Bibliography

14 - Non-Parametric Convergence Analysis

Scaling up Machine Learning

Frontmatter

12 - Bayesian model selection

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

Save Search

2327 results in Pattern Recognition and Machine Learning

Summary

Summary

Summary

Summary

Summary

Scaling up Machine Learning