{"id":16092,"date":"2016-04-27T10:00:00","date_gmt":"2016-04-27T09:00:00","guid":{"rendered":"http:\/\/blog.journals.cambridge.org\/?p=16092"},"modified":"2016-04-27T14:30:52","modified_gmt":"2016-04-27T13:30:52","slug":"checking-in-on-grammar-checking","status":"publish","type":"post","link":"https:\/\/www.cambridge.org\/core\/blog\/2016\/04\/27\/checking-in-on-grammar-checking\/","title":{"rendered":"Checking in on grammar checking"},"content":{"rendered":"<div id=\"bsf_rt_marker\"><\/div><p><a href=\"http:\/\/journals.cambridge.org\/nle\/grammar\" target=\"_blank\"><strong>\u2018Checking in on Grammar Checking\u2019<\/strong><\/a> by Robert Dale is the latest Industry Watch column to be published in the journal <a href=\"http:\/\/journals.cambridge.org\/action\/displayJournal?jid=NLE\" target=\"_blank\"><em>Natural Language Engineering<\/em><\/a>.<\/p>\n<p>Reflecting back to 2004, industry expert Robert Dale reminds us of a time when Microsoft Word was the dominant software used for grammar checking. Bringing us up-to-date in 2016, Dale discusses the evolution, capabilities and current marketplace for grammar checking and its diverse range of users:\u00a0from academics, men on dating websites to the fifty top celebrities on Twitter.<\/p>\n<p>Below is an extract from the article, which is available to read in full <strong><a href=\"http:\/\/journals.cambridge.org\/nle\/grammar\" target=\"_blank\">here.<\/a><\/strong><\/p>\n<blockquote><p><strong>An appropriate time to re\ufb02ect<\/strong><br \/>\nI am writing this piece on a very special day. It\u2019s National Grammar Day, \u2018observed\u2019 (to use Wikipedia\u2019s crowdsourced choice of words) in the US on March 4th. The word \u2018observed\u2019 makes me think of citizens across the land going about their business throughout the day quietly and with a certain reverence; determined, on this day of all days, to ensure that their subjects agree with their verbs, to not their in\ufb01nitives split, and to avoid using prepositions to end their sentences with. I can\u2019t see it, really. I suspect that, for most people, National Grammar Day ranks some distance behind National Hug Day (January 21st) and National Cat Day (October 29th). And, at least in Poland and Lithuania, it has to compete with St Casimir\u2019s Day, also celebrated on March 4th. I suppose we could do a study to see whether Polish and Lithuanian speakers have poorer grammar than Americans on that day, but I doubt we\u2019d \ufb01nd a signi\ufb01cant di\ufb00erence. So National Grammar Day might not mean all that much to most people, but it does feel like an appropriate time to take stock of where the grammar checking industry has got to. I last wrote a piece on commercial grammar checkers for the Industry Watch column over 10 years ago (Dale 2004). At the time, there really was no alternative to the grammar checker in Microsoft Word. What\u2019s changed in the interim? And does anyone really need a grammar checker when so much content these days consists of generated-on-a-whim tweets and SMS messages?<\/p>\n<p><strong>The evolution of grammar checking<\/strong><br \/>\nGrammar checking software has evolved through three distinct paradigms. First-generation tools were based on simple pattern matching and string replacement, using tables of suspect strings and their corresponding corrections. For example, we might search a text for any occurrences of the string isnt and suggest replacing them by isn\u2019t. The basic technology here was pioneered by Bell Labs in the UNIX Writer\u2019s Workbench tools (Macdonald 1983) in the late 1970s and early 1980s, and was widely used in a range of more or less derivative commercial software products that appeared on the market in the early \u201980s. Anyone who can remember that far back might dimly recall using programs like RightWriter on the PC and Grammatik on the Mac. Second-generation tools embodied real syntactic processing. IBM\u2019s Epistle (Heidorn et al. 1982) was the \ufb01rst really visible foray into this space, and key members of the team that built that application went on to develop the grammar checker that, to this day, resides inside Microsoft Word (Heidorn 2000). These systems rely on large rule-based descriptions of permissible syntax, in combination with a variety of techniques for detecting ungrammatical elements and posing potential corrections for those errors. Perhaps not surprisingly, the third generation of grammar-checking software is represented by solutions that make use of statistical language models in one way or another. The most impressive of these is Google\u2019s context-aware spell checker (Whitelaw et al. 2009)\u2014when you start taking context into account, the boundary between spell checking and grammar checking gets a bit fuzzy. Google\u2019s entrance into a marketplace is enough to make anyone go weak at the knees, but there are other third-party developers brave enough to explore what\u2019s possible in this space. A recent attempt that looks interesting is Deep Grammar (www.deepgrammar.com). We might expect to \ufb01nd that modern grammar checkers draw on techniques from each of these three paradigms. You can get a long way using simple table lookup for common errors, so it would be daft to ignore that fact, but each generation adds the potential for further coverage and capability.<\/p><\/blockquote>\n<p>The remainder of the article discusses the following:<\/p>\n<ul>\n<li>Today&#8217;s grammar-checking marketplace<\/li>\n<li>Capabilities<\/li>\n<li>Who needs a grammar checker?<\/li>\n<\/ul>\n<p><a href=\"http:\/\/journals.cambridge.org\/nle\/grammar\" target=\"_blank\">&#8216;Checking in on grammar checking&#8217;<\/a> is an Open Access article.\u00a0You may also be interested in complimentary access to a <a href=\"http:\/\/journals.cambridge.org\/action\/displaySpecialArticle?jid=NLE&amp;bespokeId=14784\" target=\"_blank\">collection of related articles about grammar<\/a> published in Natural Language Engineering. These papers are fully available until 30th June 2016.<\/p>\n<p>Other recent Industry Watch articles by Robert Dale:<\/p>\n<ul>\n<li><a href=\"http:\/\/journals.cambridge.org\/nle\/translation\" target=\"_blank\">How to make money in the translation business<\/a><\/li>\n<li><a href=\"http:\/\/journals.cambridge.org\/nle\/cloud\" target=\"_blank\">NLP Meets the Cloud<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Reflecting back to 2004, industry expert Robert Dale reminds us of a time when Microsoft Word was the dominant software used for grammar checking. Bringing us up-to-date in 2016, Dale discusses the evolution, capabilities and current marketplace for grammar checking and its diverse range of users: from academics, men on dating websites to the fifty top celebrities on Twitter.<\/p>\n","protected":false},"author":341,"featured_media":16112,"comment_status":"open","ping_status":"open","sticky":true,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1593,1],"tags":[735,1836,1835,608,544],"coauthors":[],"class_list":["post-16092","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-computer-science","category-news","tag-computer-science","tag-grammar","tag-industry-watch","tag-language","tag-natural-language-engineering"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts\/16092","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/users\/341"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/comments?post=16092"}],"version-history":[{"count":0,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts\/16092\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/media\/16112"}],"wp:attachment":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/media?parent=16092"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/categories?post=16092"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/tags?post=16092"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/coauthors?post=16092"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}