Identifying off-topic student essays without topic-specific training data

D. HIGGINS; J. BURSTEIN; Y. ATTALI

doi:10.1017/S1351324906004189

Abstract

Educational assessment applications, as well as other natural-language interfaces, need some mechanism for validating user responses. If the input provided to the system is infelicitous or uncooperative, the proper response may be to simply reject it, to route it to a bin for special processing, or to ask the user to modify the input. If problematic user input is instead handled as if it were the system's normal input, this may degrade users' confidence in the software, or suggest ways in which they might try to “game” the system. Our specific task in this domain is the identification of student essays which are “off-topic”, or not written to the test question topic. Identification of off-topic essays is of great importance for the commercial essay evaluation system CriterionSM. The previous methods used for this task required 200–300 human scored essays for training purposes. However, there are situations in which no essays are available for training, such as when users (teachers) wish to spontaneously write a new topic for their students. For these kinds of cases, we need a system that works reliably without training data. This paper describes an algorithm that detects when a student's essay is off-topic without requiring a set of topic-specific essays for training. This new system is comparable in performance to previous models which require topic-specific essays for training, and provides more detailed information about the way in which an essay diverges from the requested essay topic.

Information

Footnotes

The authors would like to thank Chi Lu and Slava Andreyev for their help in carrying out the experiments described in this paper.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Douglas, Dan and Hegelheimer, Volker 2007. ASSESSING LANGUAGE USING COMPUTER TECHNOLOGY. Annual Review of Applied Linguistics, Vol. 27, Issue. , p. 115.

Spencer, Brenda and Louw, Henk 2008. A practice-based evaluation of an on-line writing evaluation system: First-World technology in a Third-World teaching context. Language Matters, Vol. 39, Issue. 1, p. 111.

Burstein, Jill 2009. Computational Linguistics and Intelligent Text Processing. Vol. 5449, Issue. , p. 6.

Shermis, M.D. Burstein, J. Higgins, D. and Zechner, K. 2010. International Encyclopedia of Education. p. 20.

Li, Yali and Yan, Yonghong 2010. Notice of Retraction: Automated Essay Scoring System for CET4. p. 94.

Kakkonen, Tuomo and Sutinen, Erkki 2011. EssayAid: towards a semi-automatic system for assessing student texts. International Journal of Continuing Engineering Education and Life-Long Learning, Vol. 21, Issue. 2/3, p. 119.

Litman, Diane 2012. Adaptive Technologies for Training and Education. p. 247.

Burstein, Jill 2012. The Encyclopedia of Applied Linguistics.

Li, Yali and Yan, Yonghong 2012. An Effective Automated Essay Scoring System Using Support Vector Regression. p. 65.

Kumar, Niraj and Dey, Lipika 2013. Automatic Quality Assessment of Documents with Application to Essay Grading. p. 216.

Deane, Paul 2013. On the relation between automated essay scoring and modern views of the writing construct. Assessing Writing, Vol. 18, Issue. 1, p. 7.

Higgins, Derrick and Heilman, Michael 2014. Managing What We Can Measure: Quantifying the Susceptibility of Automated Scoring Systems to Gaming Behavior. Educational Measurement: Issues and Practice, Vol. 33, Issue. 3, p. 36.

Breyer, F. Jay Attali, Yigal Williamson, David M. Ridolfi‐McCulla, Laura Ramineni, Chaitanya Duchnowski, Matthew and Harris, April 2014. A Study of the Use of the e‐rater® Scoring Engine for the Analytical Writing Measure of the GRE® revised General Test. ETS Research Report Series, Vol. 2014, Issue. 2, p. 1.

Serrano, J. Ignacio del Castillo, M. Dolores and Iglesias, Ángel 2014. Effects of text essay quality on readers’ working memory by a computational model. Biologically Inspired Cognitive Architectures, Vol. 7, Issue. , p. 39.

2015. Beyond the Bubble Test. p. 387.

Zhang, Mo Chen, Jing and Ruan, Chunyi 2015. Quantitative Psychology Research. Vol. 140, Issue. , p. 191.

Haberman, Shelby J. Yao, Lili and Sinharay, Sandip 2015. Prediction of true test scores from observed item scores and ancillary data. British Journal of Mathematical and Statistical Psychology, Vol. 68, Issue. 2, p. 363.

Lee, Kong Joo and Lee, Gyoung Ho 2015. Automatic Detection of Off-topic Documents using ConceptNet and Essay Prompt in Automated English Essay Scoring. Journal of KIISE, Vol. 42, Issue. 12, p. 1522.

Chen, Jing and Zhang, Mo 2016. Quantitative Psychology Research. Vol. 167, Issue. , p. 315.

Zhang, Mo Chen, Jing and Ruan, Chunyi 2016. Evaluating the Advisory Flags and Machine Scoring Difficulty in the e‐rater® Automated Scoring Engine. ETS Research Report Series, Vol. 2016, Issue. 2, p. 1.

Download full list

Article contents

Identifying off-topic student essays without topic-specific training data

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Identifying off-topic student essays without topic-specific training data

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests