Hostname: page-component-8448b6f56d-42gr6 Total loading time: 0 Render date: 2024-04-18T00:25:21.300Z Has data issue: false hasContentIssue false

Correcting real-word spelling errors by restoring lexical cohesion

Published online by Cambridge University Press:  28 February 2005

GRAEME HIRST
Affiliation:
Department of Computer Science, University of Toronto, Toronto, Ontario, Canada M5S 3G4 e-mail: gh@cs.toronto.edu, abm@cs.toronto.edu
ALEXANDER BUDANITSKY
Affiliation:
Department of Computer Science, University of Toronto, Toronto, Ontario, Canada M5S 3G4 e-mail: gh@cs.toronto.edu, abm@cs.toronto.edu

Abstract

Spelling errors that happen to result in a real word in the lexicon cannot be detected by a conventional spelling checker. We present a method for detecting and correcting many such errors by identifying tokens that are semantically unrelated to their context and are spelling variations of words that would be related to the context. Relatedness to context is determined by a measure of semantic distance initially proposed by Jiang and Conrath (1997). We tested the method on an artificial corpus of errors; it achieved recall of 23–50% and precision of 18–25%.

Type
Papers
Copyright
© 2005 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)