Natural Language Engineering: Volume 17 - Issue 4

Machine learning for query formulation in question answering
CHRISTOF MONZ
Published online by Cambridge University Press:

05 January 2011, pp. 425-454
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Research on question answering dates back to the 1960s but has more recently been revisited as part of TREC's evaluation campaigns, where question answering is addressed as a subarea of information retrieval that focuses on specific answers to a user's information need. Whereas document retrieval systems aim to return the documents that are most relevant to a user's query, question answering systems aim to return actual answers to a users question. Despite this difference, question answering systems rely on information retrieval components to identify documents that contain an answer to a user's question. The computationally more expensive answer extraction methods are then applied only to this subset of documents that are likely to contain an answer. As information retrieval methods are used to filter the documents in the collection, the performance of this component is critical as documents that are not retrieved are not analyzed by the answer extraction component. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. In this paper, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Term weights are learned by inspecting a large number of query formulation variations and their respective accuracy in identifying documents containing an answer. Several linguistic features are used for building the models, including part-of-speech tags, degree of connectivity in the dependency parse tree of the question, and ontological information. All of these features are extracted automatically by using several natural language processing tools. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements in identifying answer-bearing documents.

Dependency-based n-gram models for general purpose sentence realisation
YUQING GUO, HAIFENG WANG, JOSEF VAN GENABITH
Published online by Cambridge University Press:

29 November 2010, pp. 455-483
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This paper presents a general-purpose, wide-coverage, probabilistic sentence generator based on dependency n-gram models. This is particularly interesting as many semantic or abstract syntactic input specifications for sentence realisation can be represented as labelled bi-lexical dependencies or typed predicate-argument structures. Our generation method captures the mapping between semantic representations and surface forms by linearising a set of dependencies directly, rather than via the application of grammar rules as in more traditional chart-style or unification-based generators. In contrast to conventional n-gram language models over surface word forms, we exploit structural information and various linguistic features inherent in the dependency representations to constrain the generation space and improve the generation quality. A series of experiments shows that dependency-based n-gram models generalise well to different languages (English and Chinese) and representations (LFG and CoNLL). Compared with state-of-the-art generation systems, our general-purpose sentence realiser is highly competitive with the added advantages of being simple, fast, robust and accurate.

BLANC: Implementing the Rand index for coreference evaluation
M. RECASENS, E. HOVY
Published online by Cambridge University Press:

06 December 2010, pp. 485-510
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This paper addresses the current state of coreference resolution evaluation, in which different measures (notably, MUC, B3, CEAF, and ACE-value) are applied in different studies. None of them is fully adequate, and their measures are not commensurate. We enumerate the desiderata for a coreference scoring measure, discuss the strong and weak points of the existing measures, and propose the BiLateral Assessment of Noun-Phrase Coreference, a variation of the Rand index created to suit the coreference task. The BiLateral Assessment of Noun-Phrase Coreference rewards both coreference and non-coreference links by averaging the F-scores of the two types, does not ignore singletons – the main problem with the MUC score – and does not inflate the score in their presence – a problem with the B3 and CEAF scores. In addition, its fine granularity is consistent over the whole range of scores and affords better discrimination between systems.

Assessing user simulation for dialog systems using human judges and automatic evaluation measures
HUA AI, DIANE LITMAN
Published online by Cambridge University Press:

01 February 2011, pp. 511-540
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
While different user simulations are built to assist dialog system development, there is an increasing need to quickly assess the quality of the user simulations reliably. Previous studies have proposed several automatic evaluation measures for this purpose. However, the validity of these evaluation measures has not been fully proven. We present an assessment study in which human judgments are collected on user simulation qualities as the gold standard to validate automatic evaluation measures. We show that a ranking model can be built using the automatic measures to predict the rankings of the simulations in the same order as the human judgments. We further show that the ranking model can be improved by using a simple feature that utilizes time-series analysis.

Learning opinions in user-generated web content
M. SOKOLOVA, G. LAPALME
Published online by Cambridge University Press:

11 March 2011, pp. 541-567
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The user-generated Web content has been intensively analyzed in Information Extraction and Natural Language Processing research. Web-posted reviews of consumer goods are studied to find customer opinions about the products. We hypothesize that nonemotionally charged descriptions can be applied to predict those opinions. The descriptions may include indicators of product size (tall), commonplace (some), frequency of happening (often), and reviewer certainty (maybe). We first construct patterns of how the descriptions are used in consumer-written texts and then represent individual reviews through these patterns. We propose a semantic hierarchy that organizes individual words into opinion types. We run machine learning algorithms on five data sets of user-written product reviews: four are used in classification experiments, another one for regression and classification. The obtained results support the use of non-emotional descriptions in opinion learning.

NLE volume 17 issue 4 Cover and Front matter
Published online by Cambridge University Press:

14 September 2011, pp. f1-f2
- Article
- - You have access
- PDF
- Export citation

NLE volume 17 issue 4 Cover and Back matter
Published online by Cambridge University Press:

14 September 2011, pp. b1-b3
- Article
- - You have access
- PDF
- Export citation

Natural Language Engineering

Refine listing

Actions for selected content:

Volume 17 - Issue 4 - October 2011

Articles

Machine learning for query formulation in question answering

Dependency-based n-gram models for general purpose sentence realisation

BLANC: Implementing the Rand index for coreference evaluation

Assessing user simulation for dialog systems using human judges and automatic evaluation measures

Learning opinions in user-generated web content

Front Cover (OFC, IFC) and matter

NLE volume 17 issue 4 Cover and Front matter

Back Cover (IBC, OBC) and matter

NLE volume 17 issue 4 Cover and Back matter

Natural Language Engineering

Refine listing

Actions for selected content:

Save Search

Volume 17 - Issue 4 - October 2011

Articles

Machine learning for query formulation in question answering

Dependency-based n-gram models for general purpose sentence realisation

BLANC: Implementing the Rand index for coreference evaluation

Assessing user simulation for dialog systems using human judges and automatic evaluation measures

Learning opinions in user-generated web content

Front Cover (OFC, IFC) and matter

NLE volume 17 issue 4 Cover and Front matter

Back Cover (IBC, OBC) and matter

NLE volume 17 issue 4 Cover and Back matter