Mining of Massive Datasets

Publisher:

Cambridge University Press

Publication date:

05 June 2012

27 October 2011

ISBN:

9781139058452

DOI:

https://doi.org/10.1017/CBO9781139058452

Dimensions:

Weight & Pages:

Dimensions:

Weight & Pages:

Subjects:: Knowledge Management, Databases and Data Mining, Computer Science, Computational Statistics, Machine Learning and Information Science, Statistics and Probability

You may already have access via personal or institutional login

Selected: Paperback

View cart Buy from Cambridge.org

Subjects:: Knowledge Management, Databases and Data Mining, Computer Science, Computational Statistics, Machine Learning and Information Science, Statistics and Probability

Information

Contents

Metrics

Accessibility

The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike.

Metrics

Altmetric attention score

Total number of HTML views: 0

Total number of PDF views: 0 *

Loading metrics...

Total views: 0 *

Loading metrics...

* Views captured on Cambridge Core between #date#. This data will be updated every 24 hours.

Usage data cannot currently be displayed.

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.

Mining of Massive Datasets

This Book has been cited by the following publications. This list is generated based on data provided by Crossref.

Book description

Refine List

Actions for selected content:

Contents

Frontmatter
pp i-iv

Contents
pp v-viii

Preface
pp ix-x

1 - Data Mining
pp 1-17

2 - Large-Scale File Systems and Map-Reduce
pp 18-52

3 - Finding Similar Items
pp 53-107

4 - Mining Data Streams
pp 108-138

5 - Link Analysis
pp 139-175

6 - Frequent Itemsets
pp 176-212

7 - Clustering
pp 213-251

8 - Advertising on the Web
pp 252-276

9 - Recommendation Systems
pp 277-309

Index
pp 310-315

Metrics

Altmetric attention score

Full text views

Book summary page views

Accessibility standard: Unknown

Why this information is here

Accessibility Information

Mining of Massive Datasets

Book description

Refine List

Actions for selected content:

Save Search

Contents

Metrics

Altmetric attention score

Full text views

Book summary page views

Accessibility standard: Unknown

Why this information is here

Accessibility Information