Hostname: page-component-6766d58669-bp2c4 Total loading time: 0 Render date: 2026-05-22T09:23:28.103Z Has data issue: false hasContentIssue false

Keyword extraction from emails*

Published online by Cambridge University Press:  09 September 2016

S. LAHIRI
Affiliation:
University of Michigan, Ann Arbor, MI, USA 48109 e-mail: lahiri@umich.edu, mihalcea@umich.edu
R. MIHALCEA
Affiliation:
University of Michigan, Ann Arbor, MI, USA 48109 e-mail: lahiri@umich.edu, mihalcea@umich.edu
P.-H. LAI
Affiliation:
Samsung Research America, Richardson, TX, USA 75082 e-mail: s.lai@sra.samsung.com

Abstract

Emails constitute an important genre of online communication. Many of us are often faced with the daunting task of sifting through increasingly large amounts of emails on a daily basis. Keywords extracted from emails can help us combat such information overload by allowing a systematic exploration of the topics contained in emails. Existing literature on keyword extraction has not covered the email genre, and no human-annotated gold standard datasets are currently available. In this paper, we introduce a new dataset for keyword extraction from emails, and evaluate supervised and unsupervised methods for keyword extraction from emails. The results obtained with our supervised keyword extraction system (38.99% F-score) improve over the results obtained with the best performing systems participating in the SemEval 2010 keyword extraction task.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2016 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable