Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-pftt2 Total loading time: 0 Render date: 2024-06-02T19:36:06.485Z Has data issue: false hasContentIssue false

9 - Reordering Techniques in Japanese and English Machine Translation

Published online by Cambridge University Press:  10 June 2019

Meng Ji
Affiliation:
University of Sydney
Michael Oakes
Affiliation:
University of Wolverhampton
Get access

Summary

English and Japanese have very different word order and form one of the most difficult language pairs for translating each other. In statistical machine translation, it is well known that the translation accuracy between distant languages such as English and Japanese, is significantly lower than that between similar languages such as English and French. In this article, we describe a series of works done at our research groups in NTT to solve the problems in statistical machine translation between language pairs with very different word order such as Japanese and English.

Type
Chapter
Information
Advances in Empirical Translation Studies
Developing Translation Resources and Technologies
, pp. 164 - 176
Publisher: Cambridge University Press
Print publication year: 2019

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bahdanau, Dzmitry, Cho, Kyunghyun and Bengio, Yoshua (2014). Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations. CoRR, abs/1409.0473.Google Scholar
Brown, Peter F., Della Pietra, Stephen A., Della Pietra, Vincent J. and Mercer, Robert L. (1993). The mathematics of machine translation: Parameter estimation, Computational Linguistics 19(2), 263311.Google Scholar
Cho, Kyunghyun, van Merrienboer, Bart, Gulcehre, Caglar, Bahdanau, Dzmitry, Bougares, Fethi, Schwenk, Holger and Bengio, Yoshua (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 17241734.Google Scholar
Dan, Han, Sudoh, Katsuhito, Wu, Xianchao, Doh, K., Tsukada, Hajime and Nagata, Masaaki (2012). Head finalization reordering for Chinese-to-Japanese machine translation. In Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July, pp. 5766.Google Scholar
Goto, Isao, Lu, Bin, Chow, Ka Po, Sumita, Eiichiro and Tsou, Benjamin K. (2013). Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop. Proceedings of NTCIR-10 Workshop Meeting, NII, Tokyo, Japan, 18–21 June, pp. 559578.Google Scholar
Hayashi, Katsuhiko, and Nagata, Masaaki (2016). Empty element recovery by spinal parser operations. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 95100.Google Scholar
Hoshino, Sho, Miyao, Yusuke,Sudoh, Katsuhito, Hayashi, Katsuhiko and Nagata, Masaaki (2015). Discriminative preordering meets Kendall’s maximization. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 139144.Google Scholar
Hutchins, W. John (2010). Machine translation: A concise history, Journal of Translation Studies 13(12), 2970.Google Scholar
Isozaki, Hideki, Hirao, Tsutomu, Duh, Kevin, Sudoh, Katsuhito and Tsukada, Hajime (2010). Automatic evaluation of translation quality for distant language pairs. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing EMNLP 2010, MIT, Massachusetts, 9–11 October, pp. 944952.Google Scholar
Isozaki, Hideki, Sudoh, Katsuhito, Tsukada, Hajime and Duh, Kevin (2012). HPSG-based preprocessing for English-to-Japanese translation, ACM Transactions on Asian Language Information Processing 11(3) (September), Article 8, 8:11–8:16.Google Scholar
Koehn, Philipp, Och, Franz Josef and Marcu, Daniel (2003). Statistical phrase-based translation. Proceedings of the Joint Conference on Human Language Technologies and the Annual Meeting of the North American Chapter of the Association of Computational Linguistics. Edmonton, May–June, pp. 4854.Google Scholar
Luong, Thang, Pham, Hieu and Manning, Christopher D. (2015). Effective approaches to attention-based neural machine translation. In Proceedings of the 2015 Empirical Methods in Natural Language Processing. Lisbon: The Association for Computational Linguistics, pp. 14121421.Google Scholar
Papineni, Kishore, Roukos, Salim, Ward, Todd and Zhu, Wei-Jing (2002). BLEU: A method for automatic evaluation of machine translation. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, pp. 311318.Google Scholar
RIBES (n.d.) Rank-Based Intuitive Bilingual Evaluation Score software. Available at: www.kecl.ntt.co.jp/icl/lirg/ribes/index.html.Google Scholar
Sudoh, Katsuhito and Nagata, Masaaki (2016). Chinese-to-Japanese patent machine translation based on syntactic pre-ordering for WAT 2016. In Proceedings of the 3rd Workshop on Asian Translation, Osaka, Japan, pp. 211215.Google Scholar
Sudoh, Katsuhito, Duh, Kevin, Tsukada, Hajime, Nagata, Masaaki, Xianchao, Wu, Matsuzaki, Takuya and Tsujii, Jun’ichi (2011). NTT-UT statistical machine translation in NTCIR-9 PatentMT. In Proceedings of NTCIR-9 Workshop Meeting. Tokyo: National Institute of Informatics, pp. 585592.Google Scholar
Sudoh, Katsuhito, Suzuki, Jun, Akiba, Yasuhiro, Tsukada, Hajime and Nagata, Masaaki (2014). Statistical machine translation system for patent sentences from English, Chinese, Korean to Japanese. In Proceedings of the 20th Annual Meeting of the Association for Natural Language Processing. Tokyo: The Association for Natural Language Processing, pp. 606609 (Japanese).Google Scholar
Suzuki, Jun, Isozaki, Hideki, Carreras, Xavier and Collins, Michael (2009). An empirical study of semi-supervised structured conditional models for dependency parsing. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore: Association for Computational Linguistics, pp. 551560.Google Scholar
Suzuki, Jun, Duh, Kevin and Nagata, Masaaki (2012). A joint natural language analysis method using extended Lagrangian relaxation. In Proceedings of the 18th Annual Meeting of the Association for Natural Language Processing. Hiroshima: The Association for Natural Language Processing of Japan, pp. 12841287 (in Japanese).Google Scholar
Takeno, Shunsuke, Nagata, Masaaki and Yamamoto, Kazuhide (2015). Empty category detection using path features and distributed case frames. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 13351340.Google Scholar
Takeno, Shunsuke, Nagata, Masaaki and Yamamoto, Kazuhide (2016). Integrating empty category detection into preordering machine translation. In Proceedings of the 3rd Workshop on Asian Translation, pp. 157165.Google Scholar
Takeno, Shunsuke, Nagata, Masaaki and Yamamoto, Kazuhide (2017). Controlling target features in neural machine translation via prefix constraints. In Proceedings of the 4th Workshop on Asian Translation. Osaka: Association of Computational Linguistics, pp. 5563.Google Scholar
Tanaka, Takaaki and Nagata, Masaaki (2013). Constructing a practical constituent parser from a Japanese treebank with function labels. In Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically Rich Languages. Seattle, WA: Association for Computational Linguistics, pp. 108118.Google Scholar
Tanaka, Takaaki and Nagata, Masaaki (2015). Word-based Japanese typed dependency parsing with grammatical function analysis. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Beijing: The Association for Computational Linguistics, pp. 237242.Google Scholar
Tanaka, Takaaki, Miyao, Yusuke, Asahara, Masayuki, Uematsu, Sumire, Kanayama, Hiroshi, Mori, Shinsuke and Yuji, Matsumoto (2016). Universal dependencies for Japanese. In Proceedings of the Tenth International Conference on Language Resources and Evaluation. Portorož, Slovenia: Association of Computational Linguistics.Google Scholar
Xia, Fei and McCord, Michael (2004). Improving a statistical MT system with automatically learned rewrite patterns. Proceedings of COLING 2004. Geneva: Association of Computational Linguistics, pp. 508514.Google Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×