Ebooks in machine learning

Preface
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The term machine learning refers to the automated detection of meaningful patterns in data. In the past couple of decades it has become a common tool in almost any task that requires information extraction from large data sets. We are surrounded by a machine learning–based technology: Search engines learn how to bring us the best results (while placing profitable ads), antispam software learns to filter our email messages, and credit card transactions are secured by a software that learns how to detect frauds. Digital cameras learn to detect faces and intelligent personal assistance applications on smart-phones learn to recognize voice commands. Cars are equipped with accident-prevention systems that are built using machine learning algorithms. Machine learning is also widely used in scientific applications such as bioinformatics, medicine, and astronomy.
One common feature of all of these applications is that, in contrast to more traditional uses of computers, in these cases, due to the complexity of the patterns that need to be detected, a human programmer cannot provide an explicit, fine-detailed specification of how such tasks should be executed. Taking examples from intelligent beings, many of our skills are acquired or refined through learning from our experience (rather than following explicit instructions given to us). Machine learning tools are concerned with endowing programs with the ability to “learn” and adapt.
The first goal of this book is to provide a rigorous, yet easy-to-follow, introduction to the main concepts underlying machine learning: What is learning?

16 - Kernel Methods
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 179-189
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the previous chapter we described the SVM paradigm for learning halfspaces in high dimensional feature spaces. This enables us to enrich the expressive power of halfspaces by first mapping the data into a high dimensional feature space, and then learning a linear predictor in that space. This is similar to the AdaBoost algorithm, which learns a composition of a halfspace over base hypotheses. While this approach greatly extends the expressiveness of halfspace predictors, it raises both sample complexity and computational complexity challenges. In the previous chapter we tackled the sample complexity issue using the concept of margin. In this chapter we tackle the computational complexity challenge using the method of kernels.
We start the chapter by describing the idea of embedding the data into a high dimensional feature space. We then introduce the idea of kernels. A kernel is a type of a similarity measure between instances. The special property of kernel similarities is that they can be viewed as inner products in some Hilbert space (or Euclidean space of some high dimension) to which the instance space is virtually embedded. We introduce the “kernel trick” that enables computationally efficient implementation of learning, without explicitly handling the high dimensional representation of the domain instances. Kernel based learning algorithms, and in particular kernel-SVM, are very useful and popular machine learning tools. Their success may be attributed both to being flexible for accommodating domain specific prior knowledge and to having a well developed set of efficient implementation algorithms.

Index
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 395-397
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

30 - Compression Bounds
from Part 4 - Advanced Theory
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 359-363
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Convex Learning Problems
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 124-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we introduce convex learning problems. Convex learning comprises an important family of learning problems, mainly because most of what we can learn efficiently falls into it. We have already encountered linear regression with the squared loss and logistic regression, which are convex problems, and indeed they can be learned efficiently. We have also seen nonconvex problems, such as halfspaces with the 0-1 loss, which is known to be computationally hard to learn in the unrealizable case.
In general, a convex learning problem is a problem whose hypothesis class is a convex set, and whose loss function is a convex function for each example. We begin the chapter with some required definitions of convexity. Besides convexity, we will define Lipschitzness and smoothness, which are additional properties of the loss function that facilitate successful learning. We next turn to defining convex learning problems and demonstrate the necessity for further constraints such as Boundedness and Lipschitzness or Smoothness. We define these more restricted families of learning problems and claim that Convex-Smooth/Lipschitz-Bounded problems are learnable. These claims will be proven in the next two chapters, in which we will present two learning paradigms that successfully learn all problems that are either convex-Lipschitz-bounded or convex-smooth-bounded.
Finally, in Section 12.3, we show how one can handle some nonconvex problems by minimizing “surrogate” loss functions that are convex (instead of the original nonconvex loss function). Surrogate convex loss functions give rise to efficient solutions but might increase the risk of the learned predictor.

10 - Boosting
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 101-113
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Boosting is an algorithmic paradigm that grew out of a theoretical question and became a very practical machine learning tool. The boosting approach uses a generalization of linear predictors to address two major issues that have been raised earlier in the book. The first is the bias-complexity tradeoff. We have seen (in Chapter 5) that the error of an ERM learner can be decomposed into a sum of approximation error and estimation error. The more expressive the hypothesis class the learner is searching over, the smaller the approximation error is, but the larger the estimation error becomes. A learner is thus faced with the problem of picking a good tradeoff between these two considerations. The boosting paradigm allows the learner to have smooth control over this tradeoff. The learning starts with a basic class (that might have a large approximation error), and as it progresses the class that the predictor may belong to grows richer.
The second issue that boosting addresses is the computational complexity of learning. As seen in Chapter 8, for many interesting concept classes the task of finding an ERM hypothesis may be computationally infeasible. A boosting algorithm amplifies the accuracy of weak learners. Intuitively, one can think of a weak learner as an algorithm that uses a simple “rule of thumb” to output a hypothesis that comes from an easy-to-learn hypothesis class and performs just slightly better than a random guess.

19 - Nearest Neighbor
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 219-227
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

25 - Feature Selection and Generation
from Part 3 - Additional Learning Models
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 309-322
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 87-88
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part 4 - Advanced Theory
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 323-324
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Model Selection and Validation
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 114-123
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix A - Technical Lemmas
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 369-371
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

31 - PAC-Bayes
from Part 4 - Advanced Theory
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 364-368
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part 1 - Foundations
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 11-12
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part 3 - Additional Learning Models
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 243-244
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Stochastic Gradient Descent
from Part 2 - From Theory to Algorithms
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 150-166
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Dedication
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 385-394
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction
Shai Shalev-Shwartz, Hebrew University of Jerusalem, Shai Ben-David, University of Waterloo, Ontario
Book:

Understanding Machine Learning

Published online:

05 July 2014

Print publication:

19 May 2014, pp 1-10
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The subject of this book is automated learning, or, as we will more often call it, Machine Learning (ML). That is, we wish to program computers so that they can “learn” from input available to them. Roughly speaking, learning is the process of converting experience into expertise or knowledge. The input to a learning algorithm is training data, representing experience, and the output is some expertise, which usually takes the form of another computer program that can perform some task. Seeking a formal-mathematical understanding of this concept, we'll have to be more explicit about what we mean by each of the involved terms: What is the training data our programs will access? How can the process of learning be automated? How can we evaluate the success of such a process (namely, the quality of the output of a learning program)?
WHAT IS LEARNING?
Let us begin by considering a couple of examples from naturally occurring animal learning. Some of the most fundamental issues in ML arise already in that context, which we are all familiar with.
Bait Shyness – Rats Learning to Avoid Poisonous Baits: When rats encounter food items with novel look or smell, they will first eat very small amounts, and subsequent feeding will depend on the flavor of the food and its physiological effect. If the food produces an ill effect, the novel food will often be associated with the illness, and subsequently, the rats will not eat it.

Ebooks in machine learning

Refine listing

Refine listing

Actions for selected content:

1437 results in Ebooks in machine learning

Preface

Summary

16 - Kernel Methods

Summary

Index

30 - Compression Bounds

12 - Convex Learning Problems

Summary

10 - Boosting

Summary

19 - Nearest Neighbor

25 - Feature Selection and Generation

Part 2 - From Theory to Algorithms

Part 4 - Advanced Theory

11 - Model Selection and Validation

Appendix A - Technical Lemmas

Frontmatter

31 - PAC-Bayes

Part 1 - Foundations

Part 3 - Additional Learning Models

14 - Stochastic Gradient Descent

Dedication

References

1 - Introduction

Summary

Ebooks in machine learning

Refine listing

Refine listing

Actions for selected content:

Save Search

1437 results in Ebooks in machine learning

Summary

Summary

Summary

Summary

Summary