Mining of Massive Datasets

Jure Leskovec; Anand Rajaraman; Jeffrey David Ullman

doi:10.1017/9781108684163

Chapter 12: Large-Scale Machine Learning

pp. 441-497

Jure Leskovec

, Stanford University, California,

Anand Rajaraman

, Rocketship VC,

Jeffrey David Ullman

, Stanford University, California

Get access

Add bookmark
Cite
Share

Summary

This chapter is not intended to be a complete discussion of machine learning. We concentrate on a small number of ideas, and emphasize how to deal with very large data sets. Especially important is how we exploit parallelism to build models of the data. We consider the classical “perceptron” approach to learning a data classifier, where a hyperplane that separates two classes is sought. Then, we look at more modern techniques involving support-vector machines. Similar to perceptrons, these methods look for hyperplanes that best divide the classes, so that few, if any, members of the training set lie close to the hyperplane. We next consider nearest-neighbor techniques, where data is classified according to the class(es) of their nearest neighbors in some space. We end with a discussion of decision trees, which are branching programs for predicting the class of an example.

Keywords

machine learning
perceptron
support-vector machine
gradient descent
nearest-neighbor learning
decision tree

About the book

Chapter DOI https://doi.org/10.1017/9781108684163.013
Book DOI https://doi.org/10.1017/9781108684163
Subjects Computer Science,Data Science, Databases, Data Mining, and Information Retrieval,Machine Learning and Pattern Recognition
Format: Hardback
- Publication date: 13 February 2020
- ISBN: 9781108476348
Format: Digital
- Publication date: 16 April 2020
- ISBN: 9781108684163
Find out more details about this book

Access options

Review the options below to login to check your access.

Purchase options

eTextbook

US$89.00

Hardback

US$89.00

Have an access code?

To redeem an access code, please log in with your personal login.

If you believe you should have access to this content, please contact your institutional librarian or consult our FAQ page for further information about accessing our content.

Also available to purchase from these educational ebook suppliers