This introductory text to statistical machine translation (SMT) provides all of the theories and methods needed to build a statistical machine translator, such as Google Language Tools and Babelfish. In general, statistical techniques allow automatic translation systems to be built quickly for any language-pair using only translated texts and generic software. With increasing globalization, statistical machine translation will be central to communication and commerce. Based on courses and tutorials, and classroom-tested globally, it is ideal for instruction or self-study, for advanced undergraduates and graduate students in computer science and/or computational linguistics, and researchers in natural language processing. The companion website provides open-source corpora and tool-kits.Read more
- The first introductory guide to this burgeoning field - takes readers step by step through theory and methods
- Class tested by the author at universities and conference tutorials
- Accompanying website provides additional exercises and links to further resources
Reviews & endorsements
"Philipp Koehn has provided the first comprehensive text for this rapidly growing field of statistical machine translation. This book is an invaluable resource for students, researcher, and software developers, providing a lucid and detailed presentation of all the important ideas needed to understand or create a state-of-the-art statistical machine translation system."
Robert C. Moore, Microsoft ResearchSee more reviews
"This is an excellent introduction for someone interested in statistical translation. It is quite readable..."
Jeffrey Putnam, Computing Reviews
Not yet reviewed
Be the first to review
Review was not posted due to profanity×
- Date Published: January 2010
- format: Hardback
- isbn: 9780521874151
- length: 446 pages
- dimensions: 254 x 179 x 25 mm
- weight: 1.02kg
- contains: 24 b/w illus. 70 exercises
- availability: Available
Table of Contents
Part I. Foundations:
2. Words, sentences, corpora
3. Probability theory
Part II. Core Methods:
4. Word-based models
5. Phrase-based models
7. Language models
Part III. Advanced Topics:
9. Discriminative training
10. Integrating linguistic information
11. Tree-based models
Find resources associated with this titleYour search for '' returned .
Type Name Unlocked * Format Size
This title is supported by one or more locked resources. Access to locked resources is granted exclusively by Cambridge University Press to instructors whose faculty status has been verified. To gain access to locked resources, instructors should sign in to or register for a Cambridge user account.
Please use locked resources responsibly and exercise your professional discretion when choosing how you share these materials with your students. Other instructors may wish to use locked resources for assessment purposes and their usefulness is undermined when the source files (for example, solution manuals or test banks) are shared online or via social networks.
Supplementary resources are subject to copyright. Instructors are permitted to view, print or download these resources for use in their teaching, but may not change them or use them for commercial gain.
If you are having problems accessing these resources please contact firstname.lastname@example.org.
Instructors have used or reviewed this title for the following courses
- Artificial Intelligence
- Language Engineering
- Machine Translation
- Natural Language Processing
- Natural Language Processing Techniques
- Statistical Machine Translation
- Statistical Natural Language Processing
Sorry, this resource is locked
Please register or sign in to request access. If you are having problems accessing these resources please email email@example.comRegister Sign in
You are now leaving the Cambridge University Press website. Your eBook purchase and download will be completed by our partner www.ebooks.com. Please see the permission section of the www.ebooks.com catalogue page for details of the print & copy limits on our eBooks.Continue ×