Hostname: page-component-6766d58669-l4t7p Total loading time: 0 Render date: 2026-05-15T21:44:00.581Z Has data issue: false hasContentIssue false

Data mining for building knowledge bases: techniques, architectures and applications

Published online by Cambridge University Press:  31 March 2016

Alfred Krzywicki
Affiliation:
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia e-mail: alfredk@cse.unsw.edu.au, wobcke@cse.unsw.edu.au, mike@cse.unsw.edu.au, jcalvo@cse.unsw.edu.au, compton@cse.unsw.edu.au
Wayne Wobcke
Affiliation:
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia e-mail: alfredk@cse.unsw.edu.au, wobcke@cse.unsw.edu.au, mike@cse.unsw.edu.au, jcalvo@cse.unsw.edu.au, compton@cse.unsw.edu.au
Michael Bain
Affiliation:
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia e-mail: alfredk@cse.unsw.edu.au, wobcke@cse.unsw.edu.au, mike@cse.unsw.edu.au, jcalvo@cse.unsw.edu.au, compton@cse.unsw.edu.au
John Calvo Martinez
Affiliation:
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia e-mail: alfredk@cse.unsw.edu.au, wobcke@cse.unsw.edu.au, mike@cse.unsw.edu.au, jcalvo@cse.unsw.edu.au, compton@cse.unsw.edu.au
Paul Compton
Affiliation:
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia e-mail: alfredk@cse.unsw.edu.au, wobcke@cse.unsw.edu.au, mike@cse.unsw.edu.au, jcalvo@cse.unsw.edu.au, compton@cse.unsw.edu.au

Abstract

Data mining techniques for extracting knowledge from text have been applied extensively to applications including question answering, document summarisation, event extraction and trend monitoring. However, current methods have mainly been tested on small-scale customised data sets for specific purposes. The availability of large volumes of data and high-velocity data streams (such as social media feeds) motivates the need to automatically extract knowledge from such data sources and to generalise existing approaches to more practical applications. Recently, several architectures have been proposed for what we call knowledge mining: integrating data mining for knowledge extraction from unstructured text (possibly making use of a knowledge base), and at the same time, consistently incorporating this new information into the knowledge base. After describing a number of existing knowledge mining systems, we review the state-of-the-art literature on both current text mining methods (emphasising stream mining) and techniques for the construction and maintenance of knowledge bases. In particular, we focus on mining entities and relations from unstructured text data sources, entity disambiguation, entity linking and question answering. We conclude by highlighting general trends in knowledge mining research and identifying problems that require further research to enable more extensive use of knowledge bases.

Information

Type
Articles
Copyright
© Cambridge University Press, 2016 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable