Hostname: page-component-89b8bd64d-72crv Total loading time: 0 Render date: 2026-05-09T12:43:57.945Z Has data issue: false hasContentIssue false

Improving syntactic rule extraction through deleting spurious links with translation span alignment

Published online by Cambridge University Press:  06 September 2013

JINGBO ZHU
Affiliation:
Natural Language Processing Laboratory, Northeastern University, Shenyang 110819, China e-mail: zhujingbo@mail.neu.edu.cn, liqiangneu@gmail.com, xiaotong@mail.neu.edu.cn
QIANG LI
Affiliation:
Natural Language Processing Laboratory, Northeastern University, Shenyang 110819, China e-mail: zhujingbo@mail.neu.edu.cn, liqiangneu@gmail.com, xiaotong@mail.neu.edu.cn
TONG XIAO
Affiliation:
Natural Language Processing Laboratory, Northeastern University, Shenyang 110819, China e-mail: zhujingbo@mail.neu.edu.cn, liqiangneu@gmail.com, xiaotong@mail.neu.edu.cn

Abstract

Most statistical machine translation systems typically rely on word alignments to extract translation rules. This approach would suffer from a practical problem that even one spurious word alignment link can prevent some desirable translation rules from being extracted. To address this issue, this paper presents two approaches, referred to as sub-tree alignment and phrase-based forced decoding methods, to automatically learn translation span alignments from parallel data. Then, we improve the translation rule extraction by deleting spurious links and inserting new links based on bilingual translation span correspondences. Some comparison experiments are designed to demonstrate the effectiveness of the proposed approaches.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2013 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable