Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-07T04:57:16.127Z Has data issue: false hasContentIssue false

Automated evaluation of the quality of ideas in compositions based on concept maps

Published online by Cambridge University Press:  24 May 2021

Li-Ping Yang
Affiliation:
Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, Beijing, P. R. China
Tao Xin*
Affiliation:
Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, Beijing, P. R. China
Fang Luo
Affiliation:
Department of Psychology, Beijing Normal University, Beijing, P. R. China
Sheng Zhang
Affiliation:
Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, Beijing, P. R. China
Xue-Tao Tian
Affiliation:
School of Computer and Information Technology, Beijing Jiaotong University, Beijing, P. R. China
*
*Corresponding author. E-mails: xintao@bnu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Nowadays, automated essay evaluation (AEE) systems play an important role in evaluating essays and have been successfully used in large-scale writing assessments. However, existing AEE systems mostly focus on grammar or shallow content measurements rather than higher-order traits such as ideas. This paper proposes a new formulation of graph-based features for concept maps using word embeddings to evaluate the quality of ideas for Chinese compositions. The concept map derived from the student’s composition is composed of the concepts appearing in the essay and the co-occurrence relationship between the concepts. By utilizing real compositions written by eighth-grade students from a large-scale assessment, the scoring accuracy of the computer evaluation system (named AECC-I: Automated Evaluation for Chinese Compositions—Ideas) is higher than the baselines. The results indicate that the proposed method deepens the construct-relevant coverage of automatic ideas evaluation in compositions and that it can provide constructive feedback for students.

Information

Type
Article
Copyright
© The Author(s), 2021. Published by Cambridge University Press
Figure 0

Table 1. Ideas’ scoring guide for Chinese composition in ATQE

Figure 1

Figure 1. The process of constructing a concept map from a composition.

Figure 2

Figure 2. The composition is transformed into a concept map (Chinese characters have been translated into English in the graph on the right.

Figure 3

Table 2. A framework of features extracted by computer

Figure 4

Table 3. Description of the training set and test set

Figure 5

Table 4. The R2 for multiple linear regression and QWK value (Quadratic-weighted kappa between automated and human scoring) for the feature sets for ideas scoring on the test set

Figure 6

Table 5. The exact (E) agreements and exact-plus-adjacent (E+A) agreements for the six datasets

Figure 7

Figure 3. Comparison of the score distributions between automated and human scoring on two prompts.

Figure 8

Table 6. Correlation and partial correlation (controlling for length) of concept map features with idea scores

Figure 9

Table 7. Factor pattern after Promax rotation for the four factors on prompts 1 and 2 and the factor names are F1 (main idea), F. (local support), F3 (idea development), and F4 (similarity)

Figure 10

Figure 4. Relative feature importance is expressed as a percent of the total weights from regression for the predicted ideas scores.

Figure 11

Table 8. The regression table for the two prompts with the standardized coefficients and p-values

Figure 12

Table 9. Factor scores for simulated compositions

Figure 13

Figure 5. The changes of three-factor scores in different types of simulated composition.

Figure 14

Figure 6. Concept maps made from compositions with idea qualities ranging from excellent to off-topic.

Figure 15

Table B.1.