Scaling up Machine Learning: Parallel and Distributed Approaches

Edited by Ron Bekkerman, LinkedIn Corporation, Mountain View, California, Mikhail Bilenko, Microsoft Research, Redmond, Washington, John Langford, Yahoo! Research, New York

Show more authors

You may already have access via personal or institutional login

Select format

Publisher:

Cambridge University Press

Publication date:

05 February 2012

30 December 2011

ISBN:

9781139042918

9780521192248

9781108461740

DOI:

https://doi.org/10.1017/CBO9781139042918

Dimensions:

(253 x 215 mm)

Weight & Pages:

1kg, 492 Pages

Dimensions:

(254 x 178 mm)

Weight & Pages:

1kg, 491 Pages

Subjects:: Computer Science, Pattern Recognition and Machine Learning, Distributed, Networked and Mobile Computing

You may already have access via personal or institutional login

Selected: Digital

Add to cart View cart Buy from Cambridge.org

Subjects:: Computer Science, Pattern Recognition and Machine Learning, Distributed, Networked and Mobile Computing

Information

Contents

Metrics

Accessibility

This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.

‘One of the landmark achievements of our time is the ability to extract value from large volumes of data. Engineering and algorithmic developments on this front have gelled substantially in recent years, and are quickly being reduced to practice in widely available, reusable forms. This book provides a broad and timely snapshot of the state of developments in scalable machine learning, which should be of interest to anyone who wishes to understand and extend the state of the art in analyzing data.’

Joseph M. Hellerstein - University of California, Berkeley

‘This is a book that every machine learning practitioner should keep in their library.’

Yoram Singer - Google Inc.

‘The contributions in this book run the gamut from frameworks for large-scale learning to parallel algorithms to applications, and contributors include many of the top people in this burgeoning subfield. Overall this book is an invaluable resource for anyone interested in the problem of learning from and working with big datasets.’

William W. Cohen - Carnegie Mellon University, Pennsylvania

‘This unique, timely book provides a 360 degrees view and understanding of both conceptual and practical issues that arise when implementing leading machine learning algorithms on a wide range of parallel and high-performance computing platforms. It will serve as an indispensable handbook for the practitioner of large-scale data analytics and a guide to dealing with BIG data and making sound choices for efficient applying learning algorithms to them. It can also serve as the basis for an attractive graduate course on parallel/distributed machine learning and data mining.’

Joydeep Ghosh - University of Texas

Metrics

Altmetric attention score

Total number of HTML views: 0

Total number of PDF views: 0 *

Loading metrics...

Total views: 0 *

Loading metrics...

* Views captured on Cambridge Core between #date#. This data will be updated every 24 hours.

Usage data cannot currently be displayed.

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

Accessibility compliance for the HTML of this book is currently unknown and may be updated in the future.

Scaling up Machine Learning

Parallel and Distributed Approaches

Book description

Reviews

Refine List

Actions for selected content:

Save Search

Contents

Metrics

Altmetric attention score

Full text views

Book summary page views

Accessibility standard: Unknown

Why this information is here

Accessibility Information