Hostname: page-component-6766d58669-tq7bh Total loading time: 0 Render date: 2026-05-24T18:23:34.174Z Has data issue: false hasContentIssue false

Adaptive graph walk-based similarity measures for parsed text

Published online by Cambridge University Press:  11 February 2013

EINAT MINKOV
Affiliation:
Department of Information Systems, University of Haifa, Haifa, Israel e-mail: einatm@is.haifa.ac.il
WILLIAM W. COHEN
Affiliation:
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA e-mail: wcohen@cs.cmu.edu

Abstract

We consider a dependency-parsed text corpus as an instance of a labeled directed graph, where nodes represent words and weighted directed edges represent the syntactic relations between them. We show that graph walks, combined with existing techniques of supervised learning that model local and global information about the graph walk process, can be used to derive a task-specific word similarity measure in this graph. We also propose and evaluate a new learning method in this framework, a path-constrained graph walk variant, in which the walk process is guided by high-level knowledge about meaningful edge sequences (paths) in the graph. Empirical evaluation on the tasks of named entity coordinate term extraction and general word synonym extraction show that this framework is preferable to, or competitive with, vector-based models when learning is applied, and using small to moderate size text corpora.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2013 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable