Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-25wd4 Total loading time: 0 Render date: 2024-04-25T11:53:54.783Z Has data issue: false hasContentIssue false

Preface

Published online by Cambridge University Press:  08 August 2009

Ronen Feldman
Affiliation:
Bar-Ilan University, Israel
James Sanger
Affiliation:
ABS Ventures, Boston, Massachusetts
Get access

Summary

The information age has made it easy to store large amounts of data. The proliferation of documents available on the Web, on corporate intranets, on news wires, and elsewhere is overwhelming. However, although the amount of data available to us is constantly increasing, our ability to absorb and process this information remains constant. Search engines only exacerbate the problem by making more and more documents available in a matter of a few key strokes.

Text mining is a new and exciting research area that tries to solve the information overload problem by using techniques from data mining, machine learning, natural language processing (NLP), information retrieval (IR), and knowledge management. Text mining involves the preprocessing of document collections (text categorization, information extraction, term extraction), the storage of the intermediate representations, the techniques to analyze these intermediate representations (such as distribution analysis, clustering, trend analysis, and association rules), and visualization of the results.

This book presents a general theory of text mining along with the main techniques behind it. We offer a generalized architecture for text mining and outline the algorithms and data structures typically used by text mining systems.

The book is aimed at the advanced undergraduate students, graduate students, academic researchers, and professional practitioners interested in complete coverage of the text mining field. We have included all the topics critical to people who plan to develop text mining systems or to use them.

Type
Chapter
Information
The Text Mining Handbook
Advanced Approaches in Analyzing Unstructured Data
, pp. x - xii
Publisher: Cambridge University Press
Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Preface
  • Ronen Feldman, Bar-Ilan University, Israel, James Sanger, ABS Ventures, Boston, Massachusetts
  • Book: The Text Mining Handbook
  • Online publication: 08 August 2009
  • Chapter DOI: https://doi.org/10.1017/CBO9780511546914.001
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Preface
  • Ronen Feldman, Bar-Ilan University, Israel, James Sanger, ABS Ventures, Boston, Massachusetts
  • Book: The Text Mining Handbook
  • Online publication: 08 August 2009
  • Chapter DOI: https://doi.org/10.1017/CBO9780511546914.001
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Preface
  • Ronen Feldman, Bar-Ilan University, Israel, James Sanger, ABS Ventures, Boston, Massachusetts
  • Book: The Text Mining Handbook
  • Online publication: 08 August 2009
  • Chapter DOI: https://doi.org/10.1017/CBO9780511546914.001
Available formats
×