Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-06-02T13:32:54.708Z Has data issue: false hasContentIssue false

Unfolding the Network of Peer Grades: A Latent Variable Approach

Published online by Cambridge University Press:  16 June 2025

Giuseppe Mignemi
Affiliation:
Department of Decision Sciences, Bocconi University, Milan, Italy Department of Statistics, London School of Economics and Political Science, London, UK
Yunxiao Chen*
Affiliation:
Department of Statistics, London School of Economics and Political Science, London, UK
Irini Moustaki
Affiliation:
Department of Statistics, London School of Economics and Political Science, London, UK
*
Corresponding author: Yunxiao Chen; Email: y.chen186@lse.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Peer grading is an educational system in which students assess each other’s work. It is commonly applied under Massive Open Online Course (MOOC) and offline classroom settings. With this system, instructors receive a reduced grading workload, and students enhance their understanding of course materials by grading others’ work. Peer grading data have a complex dependence structure, for which all the peer grades may be dependent. This complex dependence structure is due to a network structure of peer grading, where each student can be viewed as a vertex of the network, and each peer grade serves as an edge connecting one student as a grader to another student as an examinee. This article introduces a latent variable model framework for analyzing peer grading data and develops a fully Bayesian procedure for its statistical inference. This framework has several advantages. First, when aggregating multiple peer grades, the average score and other simple summary statistics fail to account for grader effects and, thus, can be biased. The proposed approach produces more accurate model parameter estimates and, therefore, more accurate aggregated grades by modeling the heterogeneous grading behavior with latent variables. Second, the proposed method provides a way to assess each student’s performance as a grader, which may be used to identify a pool of reliable graders or generate feedback to help students improve their grading. Third, our model may further provide insights into the peer grading system by answering questions such as whether a student who performs better in coursework also tends to be a more reliable grader. Finally, thanks to the Bayesian approach, uncertainty quantification is straightforward when inferring the student-specific latent variables as well as the structural parameters of the model. The proposed method is applied to two real-world datasets.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Network diagram representing the network structure of peer grading data. Note: Each circle is a vertex of the network and represents a student. The arrows are the peer grades, which serve as edges connecting two students; their direction indicates whether the student receives or gives the grade.

Figure 1

Figure 2 Path diagram representing the network structure of peer grading data. Note: The latent variables of four independent students are represented as an example. Students’ grades, reported in the squared box, refer to two assessments, as the subscripts indicated. The curve double-arrows stand for correlation; the straight (solid and dotted) lines represent the effect of the respective latent variable. For the sake of readability, we prefer to adopt the solid lines for the effect of variables referring to the role of the examinee (i.e., $\alpha , \eta ^2$), whereas the dotted lines refer to the effect of the latent variables associated with the role of grader (i.e., $\beta , \phi ^2$).

Figure 2

Figure 3 Path diagram representing the network structure of peer grading data for a single assessment. Note: The latent variables of three independent students are represented as an example. The box indicates the students’ grades for a single assessment. The double arrows represent correlation, while the straight (solid and dotted) lines represent the effect of the respective latent variable. The meaning of the arrows is consistent with those of Figure 2. The solid line represents the effect of the latent variable related to the role of the examinee (i.e., $\theta $). The dotted lines refer to the effect of the latent variables associated with the grader role (i.e., $\beta , \phi ^2$).

Figure 3

Table 1 Multiple-assessments example: Four model specifications are compared using a leave-one-out cross-validation approach

Figure 4

Table 2 Multiple-assessments example: Model M4 estimated structural parameters

Figure 5

Figure 4 Multiple-assessments example: Posterior distribution of the true score of the first assessment (a), mean bias (b), and reliability (c) of student $i=1$. Note: The black dotted lines indicate the $95\%$ quantile-based credible interval and the posterior mean of each estimated parameter.

Figure 6

Table 3 Single assessment example: Four model specifications are compared using a leave-one-out cross-validation approach

Figure 7

Table 4 Single assessment example: Model M4 estimated structural parameters

Figure 8

Figure 5 Single assessment example: Posterior distribution of the true score (a), mean bias (b), reliability (c) of student $i=1$. Note: The black dotted lines indicate the $95\%$ quantile-based credible interval and the posterior mean of each estimated parameter.

Figure 9

Figure B1 Estimated linear latent growth of a random sample of four students. Note: The red and the black dotted lines are the true linear growth and the estimated one, respectively; the red bands are the $95\%$ credible intervals of this trend.

Figure 10

Table C2 Estimated structural parameters

Figure 11

Figure C2 Priors placed under different scenarios on $\sigma _1, \dots ,\sigma _4, \mu _3,\mu _4$. Note: The blue, orange, and green solid lines indicate, respectively, the half-Cauchy, the inverse-gamma and the exponential priors.

Figure 12

Table C3 Root mean square error (RMSE) and mean absolute error (MAE) related to students’ true scores and structural parameters under different scenarios across 10 independent datasets

Supplementary material: File

Mignemi et al. supplementary material

Mignemi et al. supplementary material
Download Mignemi et al. supplementary material(File)
File 218 KB