Modeling Learning in Doubly Multilevel Binary Longitudinal Data Using Generalized Linear Mixed Models: An Application to Measuring and Explaining Word Learning

Sun-Joo Cho; Amanda P. Goodwin

doi:10.1007/s11336-016-9496-y

Modeling Learning in Doubly Multilevel Binary Longitudinal Data Using Generalized Linear Mixed Models: An Application to Measuring and Explaining Word Learning

Published online by Cambridge University Press: 01 January 2025

Sun-Joo Cho and

Amanda P. Goodwin

Show author details

Sun-Joo Cho*: Affiliation:
Vanderbilt University’s Peabody College
Amanda P. Goodwin: Affiliation:
Vanderbilt University’s Peabody College
*: Correspondence should be made to Sun-Joo Cho, Vanderbilt University’s Peabody College, Nashville, TN, USA. Email: sj.cho@vanderbilt.edu. http://www.vanderbilt.edu/psychological_sciences/bio/sun-joo-cho

Article contents

Abstract
Introduction
Measuring and Explaining Word Learning
Modeling
Illustration
Simulation Study
Discussion
Footnotes
References

Rights & Permissions

Abstract

When word learning is supported by instruction in experimental studies for adolescents, word knowledge outcomes tend to be collected from complex data structure, such as multiple aspects of word knowledge, multilevel reader data, multilevel item data, longitudinal design, and multiple groups. This study illustrates how generalized linear mixed models can be used to measure and explain word learning for data having such complexity. Results from this application provide deeper understanding of word knowledge than could be attained from simpler models and show that word knowledge is multidimensional and depends on word characteristics and instructional contexts.

Keywords

binary longitudinal data doubly multilevel data generalized linear mixed models learning psycholinguistic data word learning

Information

Type: Original paper
Information: Psychometrika , Volume 82 , Issue 3 , September 2017 , pp. 846 - 870

DOI: https://doi.org/10.1007/s11336-016-9496-y [Opens in a new window]
Copyright: Copyright © 2016 The Psychometric Society

1. Introduction

Reading theories suggest that the more students know about words, the easier it is to comprehend text (Perfetti, Reference Perfetti2007). Yet word learning is a monumental task because readers can encounter about 180,000 different words in academic texts (Graves, Reference Graves2007). While many of these words are learned through exposure like through speech and reading, many must be taught via vocabulary instruction (Graves, Reference Graves2007). Therefore, word learning is a primary focus for reading researchers.

Word learning is difficult to study because words vary in how easy or hard they are to learn due to different word characteristics (Nagy, Anderson, & Herman, Reference Nagy, Anderson and Herman1987) and readers also differ in how easily they learn words (Perfetti, Reference Perfetti2007). For example, a word like statistician would be harder than a word like mathematician because it is less frequent and is made up of less frequent parts that are less likely to be recognized like statistic versus math (Schreuder & Baayan, Reference Schreuder, Baayan and Feldman1995). Also, as documented by the Matthew effect (Stanovich, Reference Stanovich1986), both words would be harder to learn for a reader with less vocabulary knowledge, like a child who did not know the term math. This is because children learn words by linking new lexical representations to lexical representations already in their lexicon (Perfetti, Reference Perfetti2007). English language learners, who are learning English as a second language and therefore have fewer English lexical representations, may therefore have additional challenges learning English words. Instruction also plays a role, with research suggesting that teaching students about morphological principles, like how to use affixes and root words to figure out the meanings of words, can support word learning (Goodwin & Ahn, Reference Goodwin and Ahn2010; Reference Goodwin and Ahn2013). Overall, reading researchers face multiple sources of variability when considering word learning; and reading researchers need statistical models that can take into account the complex nature of word learning.

In this article, we present a case study that highlights the complexity of word learning. This case study examines the word learning of 202 adolescents who are part of an intervention aimed at building vocabulary knowledge and reading skills. The students differ in word reading, reading comprehension, morphological awareness, and language background and the words differ in frequency, length, and transparency. Also, learning occurs as part of two different instructional conditions. As we will explain in detail in the next section, our case study includes the complex data structures that reading researchers often encounter when studying word learning. These include (a) multiple measures of aspects of word knowledge (e.g., multiple choice, self-report, and related words production), (b) multilevel reader data (e.g., readers nested within teachers), (c) multilevel item data (e.g., items nested within wordsFootnote 1), (d) longitudinal data structure (e.g., pretest and posttest), and (e) multiple groups (e.g., a comparison group vs. an intervention group). We specify and illustrate how generalized linear mixed models can be used to measure and explain word learning for doubly multilevel binary longitudinal data. We use the term doubly multilevel when there is multilevel structure on both the reader and item side which are cross-classified. This multilevel design is also called a multilevel double mixed design in the literature (González, De Boeck, & Tuerlinckx, Reference González, De Boeck and Tuerlinckx2014).

We provide the method and case study as an alternative to prior studies in reading education that have used overall performance on literacy measures to investigate group differences or the effects of word characteristics. For example, Lesaux, Kieffer, Kelley, and Russ Harris (Reference Lesaux, Kieffer, Kelley and Russ Harris2014) examined whether academic vocabulary instruction improved various literacy skills including vocabulary knowledge for adolescent learners. Their study had a similar complex design as described above, but a key difference was that vocabulary knowledge was assessed with standardized and researcher-designed measures that provided information on overall vocabulary performance rather than taking an item-level approach that allowed for differentiation of performance by word and reader characteristics. Because overall performance was used, the study did not show how learning was different for different types of words. Use of our approach would have deepened the understandings derived from this study.

Our study uses generalized linear mixed models, which have been applied to investigate characteristics of readers and items simultaneously in the psycholinguistics literature (e.g., Baayen, Davidson, & Bates, Reference Baayen, Davidson and Bates2008; Cho, Gilbert, & Goodwin, Reference Cho, Gilbert and Goodwin2013; González et al. , Reference González, De Boeck and Tuerlinckx2014). Baayen (Reference Baayen2008, p. 275) listed the advantages of simultaneous modeling of readers and items: first, the simultaneous approach provides insight into the full random-effects structure of readers and items; second, it has slightly greater power of detecting the effects of covariates compared to a separate regression approach by readers or items; third, it can be used to model longitudinal effects and more complex random-effects structures; fourth, it makes it possible to add covariates of readers and items to the model. To our knowledge, however, the generalized linear mixed model approach has not been applied to measure and explain word learning (i.e., posttest scores–pretest scores) when the data involve the complex data structures listed above.

We build on earlier work that shows that generalized linear mixed models for categorical responses are equivalent to item response models (De Boeck & Wilson, Reference De Boeck and Wilson2004; Rijmen, Tuerlinckx, De Boeck, & Kuppens, Reference Rijmen, Tuerlinckx, De Boeck and Kuppens2003; Skrondal & Rabe-Hesketh, Reference Skrondal and Rabe-Hesketh2004). This means that for binary longitudinal data, a generalized linear mixed model can be equivalent to a longitudinal item response model (Embretson, Reference Embretson1991).Footnote 2 Earlier work has extended longitudinal item response models to include a multilevel structure on the reader side (e.g., readers nested within teachers) (Muthén & Asparouhov, Reference Muthén, Asparouhov, van der Linden and Hambleton2013), which is needed to avoid biased parameter estimates and standard errors that occur when ignoring nestings within multilevel item response modeling (e.g., Fox, Reference Fox2010; Kamata, Reference Kamata2001). Prior studies have also modeled multilevel structures on the item side (e.g., items nested within words) to avoid the less accurate item parameters estimates that are obtained when ignoring nestings, but these models have been cross-sectional item response models (Cho et al. , Reference Cho, Gilbert and Goodwin2013; Cho, De Boeck, Embretson, & Rabe-Hesketh, Reference Cho, De Boeck, Embretson and Rabe-Hesketh2014; Geerlings, Glas, & van der Linden, Reference Geerlings, Glas and van der Linden2011; Glas & van der Linden, Reference Glas and van der Linden2003; Sinharay, Johnson, & Williamson, Reference Sinharay, Johnson and Williamson2003) rather than longitudinal. Novel model specification in this paper involves a doubly multilevel structure on both the reader and item sides within a longitudinal item response model. All models are fit using the lme4 package version 0.999375-39 (Bates, Maechler, & Bolker, Reference Bates, Maechler and Bolker2011) in R-2.10.1 (R Development Core Team, 2009).

Hereafter, this paper is organized as follows. In Section 2, the data complexity in measuring and explaining word learning is described and empirical research questions are presented. In Section 3, generalized linear mixed models are detailed to answer the research questions, and intraclass correlations are derived based on the model. In Section 4, the data are analyzed using the specified models in Section 3. In Section 5, a simulation study is implemented to show the parameter recovery of the model similar to the model applied to empirical data and to show consequences of a doubly multilevel structure. We end with summary and discussion in Section 6.

2. Measuring and Explaining Word Learning

2.1. Multidimensional Word Knowledge

There are multiple aspects of word knowledge to consider when assessing how well a reader knows a word (Pearson, Hiebert, & Kamil, Reference Pearson, Hiebert and Kamil2007; Perfetti, Reference Perfetti2007). For example, knowledge of a word’s definition, use, connotations, levels of abstractness, multiple meanings, and links to other related words are a few aspects to consider. Our modeling takes into consideration the multidimensional nature of word knowledge by considering appropriate measures of these multiple aspects, specifically multiple-choice synonym knowledge, self-report of meaning knowledge, and consideration of words within a morphological families through production of related words.

2.2. Doubly Multilevel Data

Within a single classroom, there is great variability of student skills and needs. Students are often grouped into smaller groups where instruction of specific skills or words occurs. Classroom teachers or specialized individuals such as reading specialists or second language certified teachers teach these different groups. As such, a multilevel structure on the reader side arises from the fact that teaching words commonly takes place in the context of small groups that are nested within a classroom teacher that is nested within a school. A multilevel structure on the item side arises from the fact that word knowledge is multidimensional (see above). Therefore, multiple aspects of word knowledge need to be assessed for the same word: items assessing each aspect are nested within words. Because research indicates word learning is incremental (Nagy et al. , Reference Nagy, Anderson and Herman1987), different readers, words, and instructional characteristics could lead to the development of different aspects of word knowledge, making it so that the multilevel structure on the item side is particularly important to include in studies of word learning.

2.3. Longitudinal and Multigroup Data from a Pretest–Posttest Study Design

Word learning often occurs in the context of different types of instruction that is differentially effective (Graves, Reference Graves2007). For example, recent standards movements like the Common Core (CCSS, National Governors Association Center for Best Practices & Council of Chief State School, 2010) suggest different instructional strategies to support word learning and help adolescents better comprehend texts. Experimental studies are needed to determine whether a word learning intervention shows superiority over instruction-as-usual controls and/or other research-based comparison instruction. Therefore, when considering instructional characteristics that contribute to word learning, both longitudinal and multigroup structures must be accounted for within the data. Longitudinal data are needed because differences between pretest and posttest word knowledge must be explored. Multigroup structures are also needed because word learning in the two different instructional conditions must be compared to determine which instruction is most effective at supporting word learning.

2.4. Word-Specific Word Knowledge

When studying word learning, it is important to consider what an individual knows about a specific word rather than about words in general. For example, a reader may be familiar with the definition of effortlessly but not know how to use the word or not know a specific meaning that is being conveyed within the text. For example, the reader may be familiar with effortlessly completing homework, but may not realize that a dictator could effortlessly torture individuals, and that meaning may come up when reading history texts. Word knowledge is word-specific in that when a reader is applying word knowledge to oral or reading comprehension, the reader needs to be able to access information about that particular word, otherwise comprehension challenges ensue. Perfetti’s (Reference Perfetti2007) lexical quality hypothesis emphasizes the importance of considering word-specific knowledge when investigating reading. This is because while general literacy skills like word reading are important, reading comprehension is dependent on quick and easy accessing of word-specific information, like accessing the meaning of effortlessly as related to the context it is being used within the text.

2.5. Empirical Research Questions

The current study explores word learning between pretest and posttest for two instructional groups where readers are nested within teachers and items are nested within words. We are guided by the following research questions:

(a) What is the dimensionality structure of word knowledge and if there are different dimensions, does significant word learning occur for each aspect of word knowledge across the two hours of instruction? To explore this, we will look specifically at
1. (i) the relationship between different measures of aspects of word knowledge (multiple-choice, self-report, and production of morphological relatives) for a given word, and
2. (ii) the learning for each aspect of word knowledge across the two hours of instruction examining mean and variability in word learning for each aspect at posttest.
(b) How is word learning affected by word characteristics and instructional contexts? Here we will look specifically at
1. (i) how word characteristics like derived-word frequency, root word frequency, opaqueness, number of morphemes, number of morphological relatives, and affix frequency explain variance in word easiness across words, and
2. (ii) whether significant group differences exist between readers in the morphological instruction in combination with comprehension strategy instruction condition (i.e., intervention group) versus readers in the comprehension strategy instruction alone condition (i.e., comparison group), controlling for readers’ background. Our approach unites earlier research that has looked at how single aspects of word knowledge are learned and which has explored either instructional effects or word characteristics that affect word learning. As such, it provides a more comprehensive understanding of word learning.

3. Modeling

3.1. Model Specification

In this section, we describe the generalized mixed effect models with data representation, an equation, and a diagram. To frame this data structure within the multilevel literature (e.g., Bryk & Raudenbush, Reference Bryk and Raudenbush1992, Ch. 8), item responses at Level 1 are cross-classified with readers and items at Level 2 and are cross-classified with teachers and words at Level 3. Readers at Level 2 are nested within teachers at Level 3 on the reader side, whereas items at Level 2 are nested within words at Level 3 on the item side. In the model specification, it is assumed that the same three measures were administered at pretest and posttest and an item is loaded on one measure (as suggested by the empirical illustration).

Figure 1.

Data representation for the binary longitudinal item responses ( $y$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf {y}}$$\end{document} ) with doubly multilevel structure in the exemplar case when $K = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K=2$$\end{document} , $J = 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J=6$$\end{document} , $G = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$G=2$$\end{document} , $I = 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I=6$$\end{document} , $T = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T=2$$\end{document} , and $D = 3$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=3$$\end{document} . k is an index for a teacher ( $k = 1, 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=1, 2$$\end{document} ); j is an index for a reader ( $j = 1, 2, 3, 4, 5, 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$j=1, 2, 3, 4, 5, 6$$\end{document} ); g is an index for a word ( $g = 1, 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$g=1, 2$$\end{document} ); i is an index for an item ( $i = 1, 2, 3, 4, 5, 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i=1, 2, 3, 4, 5, 6$$\end{document} ); t is an index for a time point ( $t = 1, 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t=1, 2$$\end{document} ); d is an index for a measure ( $d = 1, 2, 3$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d=1, 2, 3$$\end{document} ); k[j] indicates that a reader j is nested within a teacher k; and g[i] indicates that an item i is nested with a word g.

To show data complexity, data representation of binary longitudinal item responses ( $y$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf {y}}$$\end{document} ) with doubly multilevel structure is presented in Figure 1.Footnote 3 Denote a binary item response by $y_{k [j] g [i] t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}$$\end{document} for a teacher k ( $k = 1, \dots, K$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=1, \ldots , K$$\end{document} ), a reader j ( $j = 1, \dots, J$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$j=1, \ldots , J$$\end{document} ), a word g ( $g = 1, \dots, G$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$g=1, \ldots , G$$\end{document} ), an item i ( $i = 1, \dots, I$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i=1, \ldots , I$$\end{document} ), a time point t ( $t = 1, \dots, T$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t=1, \ldots , T$$\end{document} ), and a measure (of an aspect of word knowledge) d ( $d = 1, \dots, D$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d=1, \ldots , D$$\end{document} ). The subscript k[j] indicates that a reader j is nested within a teacher k and the subscript g[i] indicates that an item i is nested with a word g. For simplicity, the representation in Figure 1 shows an exemplar case of $K = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K=2$$\end{document} , $J = 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J=6$$\end{document} , $G = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$G=2$$\end{document} , $I = 6$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I=6$$\end{document} , $T = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T=2$$\end{document} , and $D = 3$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=3$$\end{document} . Furthermore, it is assumed that the same item set was given across time points. In this example, there are 72 (= 6 readers $\times$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 6 items $\times$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 2 time points) item responses, $y$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf {y}}$$\end{document} . Possible dependency in the item responses are due to the doubly multilevel structure (i.e., teacher clustering and word clustering), measure clustering, and time points (i.e., repeated measures).

In the specification of the generalized mixed effect model for binary longitudinal data with doubly multilevel structure, a binary item response ( $y_{k [j] g [i] t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}$$\end{document} ) has an independent Bernoulli distribution with a mean $P (y_{k [j] g [i] t d} = 1)$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(y_{k[j]g[i]td}=1)$$\end{document} conditional on the random effects we specify below. The linear predictor is

(1)

\begin{matrix} η_{k [j] g [i] t d} = X_{t d}^{'} β_{t d} + S_{t d}^{' (2)} s_{k [j] t d}^{(2)} + S_{t d}^{' (3)} s_{k t d}^{(3)} + W_{t d}^{' (2)} w_{g [i] t d}^{(2)} + W_{t d}^{' (3)} w_{g t}^{(3)}, \end{matrix}

where

• the superscript (2) refers to Level 2,
• the superscript (3) refers to Level 3,
• $X_{t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_{td}$$\end{document} is the design matrix for fixed effects,
• $S_{t d}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{td}^{(2)}$$\end{document} is the design matrix for random reader effects,
• $S_{t d}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{td}^{(3)}$$\end{document} is the design matrix for random teacher effects,
• $W_{t d}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$W_{td}^{(2)}$$\end{document} is the design matrix for random item effects,
• $W_{t d}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$W_{td}^{(3)}$$\end{document} is the design matrix for random word effects,
• $β_{t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _{td}$$\end{document} is a fixed effect; $β_{([T D] \times 1)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{\beta }}_{([TD] \times 1)}$$\end{document} is the vector of fixed effects,
• $s_{k [j] t d}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]td}^{(2)}$$\end{document} is a random reader effect for a time point t and a measure d at the reader level; $s_{k [j] ([T D] \times 1)}^{(2)} = {[s_{k [j] 11}^{(2)}, \dots, s_{k [j] t d}^{(2)}, \dots, s_{k [j] T D}^{(2)}]}^{'} \sim M N (0_{([T D] \times 1)}, Σ_{2 ([T D] \times [T D])})$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{s}_{k[j]([TD] \times 1)}^{(2)}=[s_{k[j]11}^{(2)}, \ldots , s_{k[j]td}^{(2)}, \ldots , s_{k[j]TD}^{(2)}]' \sim MN(\mathbf{0}_{([TD] \times 1)}, \Sigma _{2([TD] \times [TD])})$$\end{document} ,
• $s_{k t d}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{ktd}^{(3)}$$\end{document} is a random teacher effect for a time point t and a measure d; $s_{k ([T D] \times 1)}^{(3)} = {[s_{k 11}^{(3)}, \dots, s_{k t d}^{(3)}, \dots, s_{k T D}^{(3)}]}^{'} \sim M N (0_{([T D] \times 1)}, Σ_{3 ([T D] \times [T D])})$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{s}_{k([TD] \times 1)}^{(3)}=[s_{k11}^{(3)}, \ldots , s_{ktd}^{(3)}, \ldots , s_{kTD}^{(3)}]' \sim MN(\mathbf{0}_{([TD] \times 1)}, \Sigma _{3([TD] \times [TD])})$$\end{document} ,
• $w_{g [i] t d}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]td}^{(2)}$$\end{document} is a random item effect for a time point t and a measure d; $w_{g [i] t d}^{(2)} \sim N (0, σ_{3 t d}^{2})$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]td}^{(2)} \sim N(0,\sigma _{3td}^{2})$$\end{document} , and
• $w_{g t}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{gt}^{(3)}$$\end{document} is a random word effect for a time point t; and $w_{t g}^{(3)} \sim N (0, σ_{4 t}^{2})$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{tg}^{(3)} \sim N(0, \sigma _{4t}^{2})$$\end{document} .

Variance–covariance matrices of the random reader effects and the random teacher effects (i.e., $Σ_{2 ([T D] \times [T D])}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Sigma _{2([TD] \times [TD])}$$\end{document} and $Σ_{3 ([T D] \times [T D])}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Sigma _{3([TD] \times [TD])}$$\end{document} , respectively) are unstructured and should be positive definite to ensure the inverse of the matrices (i.e., the computation formulas in lme4) exists (Bates, Reference Bates2010).

The logit link function is selected in this study. With the specification above, the generalized mixed models for doubly multilevel structure can be described as follows:

(2)

\begin{matrix} logit [P (y_{k [j] g [i] t d} = & 1 | s_{k [j] t d}^{(2)}, s_{k t d}^{(3)}, w_{g [i] t d}^{(2)}, w_{g t}^{(3)})] \\ = & X_{t d}^{'} β_{t d} + S_{t d}^{' (2)} s_{k [j] t d}^{(2)} + S_{t d}^{' (3)} s_{k t d}^{(3)} + W_{t d}^{' (2)} w_{g [i] t d}^{(2)} + W_{t d}^{' (3)} w_{g t}^{(3)} . \end{matrix}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathrm{logit}\Bigg [P\Big (y_{k[j]g[i]td}= & {} 1|s_{k[j]td}^{(2)}, s_{ktd}^{(3)},w_{g[i]td}^{(2)},w_{gt}^{(3)}\Big )\Bigg ]\nonumber \\= & {} X_{td}^{\prime }\beta _{td} + S_{td}^{\prime (2)}s_{k[j]td}^{(2)} + S_{td}^{\prime (3)}s_{ktd}^{(3)} + W_{td}^{\prime (2)}w_{g[i]td}^{(2)} + W_{td}^{\prime (3)}w_{gt}^{(3)}. \end{aligned}$$\end{document}

Initial status at pretest and learning at posttest were modeled in our specification (Embretson, Reference Embretson1991). To measure the initial status and learning, a design matrix, as presented below, is specified for each measure. In addition, the same design matrix is imposed on $s_{k [j]}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{s}_{k[j]}^{(2)}$$\end{document} and $s_{k}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{s}_{k}^{(3)}$$\end{document} , respectively, which indicates that the initial status and learning are measured at each level on the person side. An element of the matrix specified below is 1 if a random effect is modeled for a measure in a time point and 0 otherwise. For three measures ( $D = 3$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=3$$\end{document} , multiple-choice response [MC], self-report [SR], and production of morphological relatives [MR]) and two time points ( $T = 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T=2$$\end{document} , pretest and posttest), $S^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{(2)}$$\end{document} and $S^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{(3)}$$\end{document} can be specified as follows:

\begin{matrix} (\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & 0 & 1 \end{matrix}), \end{matrix}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \left( \begin{array}{cccccc} 1&{}\quad 0&{}\quad 0&{}\quad 0&{}\quad 0&{}\quad 0\\ 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 0&{}\quad 0\\ 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 0\\ 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0\\ 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0\\ 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1\\ \end{array} \right) , \end{aligned}$$\end{document}

where the first column is for initial status on MC, the second column is for initial status on SR, the third column is for initial status on MR, the fourth column is for learning on MC, the fifth column is for learning on SR, and the sixth column is for learning on MR.

For unconditional models (without any covariates), X is a full copy of either $S^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{(2)}$$\end{document} or $S^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{(3)}$$\end{document} . In addition, $β_{([T D] \times 1)} = {[β_{11}, \dots, β_{t d}, \dots, β_{T D}]}^{'}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{\beta }}_{([TD] \times 1)}=[\beta _{11}, \ldots , \beta _{td}, \ldots , \beta _{TD}]'$$\end{document} is the vector of fixed intercepts, a logit for the probability of a correct response of an ‘average’ reader on an ‘average’ item with a logit link function. For conditional models (with readers and word covariates), additional columns and rows are added to X.

To measure learning, the pretest and posttest must have the same scale, which is called the measurement invariance assumption (e.g., Meade, Lautenschlager, & Hecht, Reference Meade, Lautenschlager and Hecht2005). The measurement invariance assumption can be met by having the same item effects (e.g., item easiness or item difficulty) across time points. Specifically, the item parameters are redefined with the measurement invariance assumption as follows:

(3)

\begin{matrix} w_{g [i] d}^{(2)} \sim N (0, σ_{3 d}^{2}) \end{matrix}

for $d = 1, \dots, D$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d=1, \ldots , D$$\end{document} and

(4)

\begin{matrix} w_{g}^{(3)} \sim N (0, σ_{4}^{2}) . \end{matrix}

The design matrix for the random item effects, $W^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$W^{(2)}$$\end{document} , can be specified as follows:

\begin{matrix} (\begin{matrix} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & 0 & 1 \\ 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & 0 & 1 \end{matrix}), \end{matrix}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \left( \begin{array}{cccccc} 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0\\ 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0\\ 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1\\ 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0\\ 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1&{}\quad 0\\ 0&{}\quad 0&{}\quad 1&{}\quad 0&{}\quad 0&{}\quad 1\\ \end{array} \right) , \end{aligned}$$\end{document}

where the first column is for MC at pretest, the second column is for SR at pretest, the third column is for MR at pretest, the fourth column is for MC at posttest, the fifth column is for SR at posttest, and the sixth column is for MR at posttest. In addition, the design matrix for the random word effect, $W^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$W^{(3)}$$\end{document} , is ( $[T D] \times [T D]$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[TD] \times [TD]$$\end{document} ) unit matrix with the measurement invariance assumption. In this study, the measurement invariance assumption is first tested and then assumed in parameter estimation in the illustration. In the lmer function, the time-invariant random item and word effects can be specified by having the same item and word indicator variables across time points involved in estimating $σ_{3 d}^{2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{3d}^{2}$$\end{document} and $σ_{4}^{2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{4}^{2}$$\end{document} .

Figure 2.

A diagram of a generalized linear mixed model for doubly multilevel binary longitudinal data. TRT treatment variable, word word characteristic variables, MC item responses for multiple-choice, SR item responses for self-report, MR item responses for production of morphological relatives, $s_{k [j] 11}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]11}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for MC at pretest, $s_{k [j] 12}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]12}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for SR at pretest, $s_{k [j] 13}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]13}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for MR at pretest, $s_{k [j] 21}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]21}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for MC at posttest, $s_{k [j] 22}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]22}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for SR at posttest, $s_{k [j] 23}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]23}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random reader effect for MR at posttest, $s_{k 11}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k11}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for MC at pretest, $s_{k 12}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k12}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for SR at pretest, $s_{k 13}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k13}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for MR at pretest, $s_{k 21}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k21}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for MC at posttest, $s_{k 22}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k22}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for SR at posttest, $s_{k 23}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k23}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random teacher effect for MR at posttest, $w_{g [i] 1}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]1}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random item effect for MC, $w_{g [i] 2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]2}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random item effect for SR, $w_{g [i] 3}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]3}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random item effect for MR, and $w_{g}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g}$$\end{document} $a$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {a}$$\end{document} random item-group effect. All latent variables at the reader level and at the teacher level are correlated at each level.

The doubly multilevel item response data structure is presented in Figure 2 related to the example of measuring and explaining word learning. Rectangles in the figure represent (binary) item responses for the three measures (i.e., MC, SR, and MR) and circles represent random effects. Dependency in item responses from J ( $j = 1, \dots, J$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$j=1, \ldots , J$$\end{document} ) readers nested within K ( $k = 1, \dots, K$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=1, \ldots , K$$\end{document} ) teachers, and I items ( $i = 1, \dots, I$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i=1, \ldots , I$$\end{document} ) nested within G words ( $g = 1, \dots, G$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$g=1, \ldots , G$$\end{document} ) for D measures ( $d = 1, 2, 3$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d=1, 2, 3$$\end{document} ) at T ( $t = 1, 2$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t=1, 2$$\end{document} ) time points (i.e., $J \times I \times T = J I T$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J \times I \times T = JIT$$\end{document} item responses) is explained by 16 random effects: 6 reader-level random effects ( $s_{k [j] 11}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]11}$$\end{document} , $s_{k [j] 12}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]12}$$\end{document} , $s_{k [j] 13}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]13}$$\end{document} , $s_{k [j] 21}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]21}$$\end{document} , $s_{k [j] 22}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]22}$$\end{document} , $s_{k [j] 23}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]23}$$\end{document} ), 6 teacher-level random effects ( $s_{k 11}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k11}$$\end{document} , $s_{k 12}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k12}$$\end{document} , $s_{k 13}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k13}$$\end{document} , $s_{k 21}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k21}$$\end{document} , $s_{k 22}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k22}$$\end{document} , $s_{k 23}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k23}$$\end{document} ), 3 item-level random effects ( $w_{g [i] 1}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]1}$$\end{document} , $w_{g [i] 2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]2}$$\end{document} , $w_{g [i] 3}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]3}$$\end{document} ), and 1 item group-level random effect ( $w_{g}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g}$$\end{document} ). On the reader side, the total random effect for each measure is decomposed into a reader-level random effect and a teacher-level random effect at each time point (at pretest, $s_{k [j] 11} + s_{k 11}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]11}+s_{k11}$$\end{document} for MC, $s_{k [j] 12} + s_{k 12}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]12}+s_{k12}$$\end{document} for SR, and $s_{k [j] 13} + s_{k 13}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]13}+s_{k13}$$\end{document} for MR; at posttest, $s_{k [j] 21} + s_{k 21}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]21}+s_{k21}$$\end{document} for MC, $s_{k [j] 22} + s_{k 22}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]22}+s_{k22}$$\end{document} for SR, and $s_{k [j] 23} + s_{k 23}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]23}+s_{k23}$$\end{document} for MR). At both the reader and teacher level, the six random effects are correlated (shown in the same rectangle) because the same readers responded to the three measures repeatedly (i.e., at pretest and posttest). On the item side, there are three item random effects (denoted by $w_{g [i] 1}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]1}$$\end{document} , $w_{g [i] 2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]2}$$\end{document} , and $w_{g [i] 3}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]3}$$\end{document} ) nested within a word random effect (denoted by $w_{g}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g}$$\end{document} ) modeled to explain dependency in item responses from items nested within words at both time points with measurement invariance assumed over time points. Random effects of readers and items are crossed random effects at Level 2 and random effects of teachers and words are crossed at Level 3. A reader characteristic such as an instructional treatment variable (denoted by TRT in the figure) can be used to explain a reader-level random effect for each measure at each time point (possibly controlling for other reader characteristics). Word characteristics (denoted by word in the figure) can be used to explain variability in item easiness across words.

3.2. Intraclass Correlations

Intraclass correlations (ICC) (denoted by $ρ$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho $$\end{document} ) can be calculated to show dependency in latent item responses among readers and items due to teachers and words, respectively. Let there be a latent item response $y_{k [j] g [i] t d}^{*}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}^{*}$$\end{document} such that the observed item response is $y_{k [j] g [i] t d} = 1$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}=1$$\end{document} if $y_{k [j] g [i] t d}^{*} \geq 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}^{*} \ge 0$$\end{document} , and $y_{k [j] g [i] t d} = 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_{k[j]g[i]td}=0$$\end{document} otherwise (e.g., Milanzi, Molenberghs, Alonso, Verbeke, & De Boeck, Reference Milanzi, Molenberghs, Alonso, Verbeke and De Boeck2015). With the measurement invariance assumption for item parameters (i.e., $w_{g [i] d}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g[i]d}^{(2)}$$\end{document} and $w_{g}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{g}^{(3)}$$\end{document} ), it is assumed that

(5)

\begin{matrix} y_{k [j] g [i] t d}^{*} = X_{t d}^{'} β_{t d} + S_{t d}^{' (2)} s_{k [j] t d}^{(2)} + S_{t d}^{' (3)} s_{k t d}^{(3)} + W_{t d}^{' (2)} w_{g [i] d}^{(2)} + W_{t d}^{' (3)} w_{g}^{(3)} + ϵ_{k [j] g [i] t d} . \end{matrix}

In addition, the error, $ϵ_{k [j] g [i] t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\epsilon _{k[j]g[i]td}$$\end{document} , is assumed to follow a logistic distribution (mean $= 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=0$$\end{document} , variance $= \frac{π^{2}}{3}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=\frac{\pi ^{2}}{3}$$\end{document} ) with a logit link.

The ICC due to teachers for each measure d at a time point t ( $ρ {(R)}_{t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{td}$$\end{document} ) can be defined as the correlation among latent item responses for the same teacher k, but for different readers j and $j^{'}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$j'$$\end{document} . It is conditional on random effects at the item and word levels, and random effects for the other measures and other time points than a time point t and a measure d (denoted by $s_{k [j] t^{'} d^{'}}^{(2)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k[j]t'd'}^{(2)}$$\end{document} and $s_{k t^{'} d^{'}}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{kt'd'}^{(3)}$$\end{document} ):

(6)

\begin{matrix} ρ {(R)}_{t d} = & C o r r (y_{k [j] g [i] t d}^{*}, y_{k [j^{'}] g [i] t d}^{*}) \\ = & \frac{C o v (y_{k [j] g [i] t d}^{*}, y_{k [j^{'}] g [i] t d}^{*} | s_{k [j] t^{'} d^{'}}^{(2)}, s_{k t^{'} d^{'}}^{(3)}, w_{g [i]}^{(2)}, w_{g}^{(3)})}{\sqrt{V a r (y_{k [j] g [i] t d}^{*} | s_{k [j] t^{'} d^{'}}^{(2)}, s_{k t^{'} d^{'}}^{(3)}, w_{g [i]}^{(2)}, w_{g}^{(3)})} \cdot \sqrt{V a r (y_{k [j^{'}] g [i] t d}^{*} | s_{k [j] t^{'} d^{'}}^{(2)}, s_{k t^{'} d^{'}}^{(3)}, w_{g [i]}^{(2)}, w_{g}^{(3)})}} \\ = & \frac{σ_{3 t d}^{2}}{σ_{2 t d}^{2} + σ_{3 t d}^{2} + \frac{π^{2}}{3}}, \end{matrix}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \rho (R)_{td}= & {} Corr\big (y_{k[j]g[i]td}^{*},y_{k[j']g[i]td}^{*}\big ) \nonumber \\= & {} \frac{Cov\Big (y_{k[j]g[i]td}^{*},y_{k[j']g[i]td}^{*}|s_{k[j]t'd'}^{(2)},s_{kt'd'}^{(3)},\mathbf{w}_{g[i]}^{(2)}, w_{g}^{(3)}\Big )}{\sqrt{Var\Big (y_{k[j]g[i]td}^{*}|s_{k[j]t'd'}^{(2)},s_{kt'd'}^{(3)},\mathbf{w}_{g[i]}^{(2)},w_{g}^{(3)}\Big )} \cdot \sqrt{Var\Big (y_{k[j']g[i]td}^{*}|s_{k[j]t'd'}^{(2)},s_{kt'd'}^{(3)},\mathbf{w}_{g[i]}^{(2)},w_{g}^{(3)}\Big )}} \nonumber \\= & {} \frac{\sigma _{3td}^{2}}{\sigma _{2td}^{2}+\sigma _{3td}^{2}+\frac{\pi ^{2}}{3}}, \end{aligned}$$\end{document}

where $σ_{2 t d}^{2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{2td}^{2}$$\end{document} and $σ_{3 t d}^{2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{3td}^{2}$$\end{document} are the variances of $Σ_{2}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Sigma _{2}$$\end{document} and $Σ_{3}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Sigma _{3}$$\end{document} , respectively. The $ρ {(R)}_{t d}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{td}$$\end{document} can also be interpreted as the proportion of the variance of the latent item response $y^{*}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y^{*}$$\end{document} for a measure d at a time point t that is accounted for by the teacher clustering.

As in our empirical data, each item is assumed to be related to only one measure d in the calculation the ICC due to words. ICC due to words ( $ρ (I)$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (I)$$\end{document} ) can be defined as the correlation among latent responses for the same word g, but for different items i and $i^{'}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i'$$\end{document} , conditional on random effects at the reader and teacher levels:

(7)

\begin{matrix} ρ (I) = & C o r r (y_{k [j] g [i] t d}^{*}, y_{k [j] g [i^{'}] t d}^{*}) \\ = & \frac{C o v (y_{k [j] g [i] t d}^{*}, y_{k [j] g [i^{'}] t d}^{*} | s_{k [j]}^{(2)}, s_{k}^{(3)})}{\sqrt{V a r (y_{k [j] g [i] t d}^{*} | s_{k [j]}^{(2)}, s_{k}^{(3)})} \cdot \sqrt{V a r (y_{k [j] g [i^{'}] t d}^{*} | s_{k [j]}^{(2)}, s_{k}^{(3)})}} \\ = & \frac{σ_{4}^{2}}{\sum_{d = 1}^{D} σ_{3 d}^{2} + σ_{4}^{2} + \frac{π^{2}}{3}} . \end{matrix}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \rho (I)= & {} Corr\big (y_{k[j]g[i]td}^{*},y_{k[j]g[i']td}^{*}\big ) \nonumber \\= & {} \frac{Cov\Big (y_{k[j]g[i]td}^{*},y_{k[j]g[i']td}^{*}|\mathbf{s}_{k[j]}^{(2)},\mathbf{s}_{k}^{(3)}\Big )}{\sqrt{Var \Big (y_{k[j]g[i]td}^{*}|\mathbf{s}_{k[j]}^{(2)},\mathbf{s}_{k}^{(3)}\Big )} \cdot \sqrt{Var \Big (y_{k[j]g[i']td}^{*}|\mathbf{s}_{k[j]}^{(2)},\mathbf{s}_{k}^{(3)}\Big )}} \nonumber \\= & {} \frac{\sigma _{4}^{2}}{\sum _{d=1}^{D}\sigma _{3d}^{2} +\sigma _{4}^{2}+\frac{\pi ^{2}}{3}}. \end{aligned}$$\end{document}

Also, the $ρ (I)$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (I)$$\end{document} can be seen as the explained variance of the latent item response $y^{*}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y^{*}$$\end{document} that is explained by the word clustering.

4. Illustration

Our illustration applies the generalized mixed effect model for binary longitudinal data with doubly multilevel structure to the word learning of 202 adolescents. The lme4 syntax used in the application is available from the first author upon request.

4.1. Data Description

Table 1 shows descriptive information for all variables described in detail below. This study is an extension of Goodwin (in press) and shares some overlapping data.

Table 1.

Descriptive statistics.

4.1.1. Sample

Our sample consisted of 202 students (118 fifth-grade; 84 sixth-grade) that were ethnically diverse (113 Black, 47 Hispanic, 37 Caucasian, 5 Asian), spoke a range of languages at home (128 native English speakers, 28 English language learners [ELL], 46 language minority youth [LMY]), and mostly lived in poverty (173 receiving free and reduced lunch services). Almost half (86) struggled in reading as shown by performing below basic on the state standardized reading test or performing below the 25th percentile on the standardized reading comprehension pretest with 14 receiving special education supports. All demographics were similar between the intervention and comparison group except that the intervention group had significantly more language minority youth than the comparison group.

The study took place within four schools (School A $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 13; B $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 35; C $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 98; D $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 56) in the southeastern United States. Schools A and D were traditional middle schools and Schools B and C were a STEM magnet and charter school, respectively. Throughout the larger school day, students were learning from 21 teachers who ranged in experience levels. For the purposes of this study, 10 tutors that were research team members (i.e., not the students’ typical teachers) delivered intervention and comparison instruction (i.e., four 30 min sessions). The three tutors who taught 80 % of students were certified teachers with a combined 30 years of teaching experience. The other research team members were education students with experience practice teaching. The tutors led the intervention using scripts and intervention-specific materials that they had been trained to use via two 1.5-hour sessions. Teacher cluster size (i.e., the number of students for each teacher) ranged from 1 to 35 with a median cluster size of 7. Tutor cluster size (i.e., the number of students for each tutor) with the median cluster size being 7.5. High fidelity to the intervention was indicated via a survey that awarded one point for completion of each main intervention component (mean score 5.32 out of 6 points, SD $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.75). Cronbach’s alpha, .85, was shown for tutor report and the report of a second coder regarding fidelity.

4.1.2. Instructional Intervention

Instruction occurred in four thirty-min small group sessions where students read two grade-level, content-specific texts and were taught comprehension strategies. Students were assessed before and after instruction, and groups had the same students and instructor throughout the study. In both the intervention and comparison groups, word learning was facilitated by underlining challenging words and discussing those words’ definitions and meanings. The main difference was that intervention students were taught to use units of meaning like root words or affixes to figure out the meaning of unknown words. Intervention students were taught to box and define the parts of the word that were known, then to sum the meanings together to determine the meaning of the larger word. They then placed that meaning in the context of the story. For more information, see Goodwin and Perkins (Reference Goodwin and Perkins2015), which details the intervention for educators.

4.1.3. Measures

Word knowledge was assessed by three researcher-created measures (i.e., multiple-choice, self-report, and depth shown by producing related words) at each time point. Different aspects of the same 16 words (listed in Table 5) were assessed by each measure and the same 48 items (= 16 words $\times$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 3 measures) were used at pretest and posttest. The words were identified from the two texts read during instruction such that those words represented the range of frequencies of all morphologically complex words in the two texts. All measures were presented in writing and read aloud to minimize confounds with decoding. Each word was assessed as described below.

Breadth/Multiple-Choice (MC) Measure Readers were presented with an underlined word within a short phrase without context clues. Readers then circled the word amongst five choices (one of which presented the option to choose ‘I don’t know’) that had the closest meaning to the target word. Answer choices accounted for context (i.e., meaning in the story), structure (i.e., words with the same suffix), and familiarity. This format was similar to standardized reading vocabulary measures. Items were scored as correct (score of 1) or incorrect (score of 0). Reliability in the form of Cronbach’s alpha was 0.71 at pretest and 0.68 at posttest.

Breadth/Self-report (SR) Measure Readers were asked to rate their knowledge of each word (no knowledge, some knowledge, full knowledge) and an example of the word big and classifications of knowledge were discussed. Items were scored as incorrect (value of 0, no knowledge or some knowledge) or correct (value of 1, full knowledge). Reliability in the form of Cronbach’s alpha for all items was calculated as 0.85 at pretest and at posttest.

Depth/Morphological Related Words Production (MR) Measure Readers were presented with the target word and asked to write related words in the space provided. It was explained to readers that related words were words with the same root or prefix or suffix as in the example of big where related words were bigger and biggest but not large, huge, or bug (which shared either meaning or overlap in spelling, but not morphological overlap). Responses were scored as incorrect (value of 0) if no accurate related words were provided or correct (value of 1) if one or more accurate related words (including derivations, inflections, and compounds) were provided. This measure represents a reader’s understanding of the target word within its larger morphological family. To minimize the role of spelling, all responses that were phonologically possible but spelled incorrectly were counted as correct. Reliability in the form of Cronbach’s alpha was calculated as 0.94 at pretest and 0.96 at posttest.

4.2. Word Covariates

Word characteristics served as time-invariant word covariates. Relevant word characteristics were identified from the literature (Carlisle & Katz, Reference Carlisle and Katz2006; Goodwin, Gilbert, Cho, & Kearns, Reference Goodwin, Gilbert, Cho and Kearns2014; Nagy et al. , Reference Nagy, Anderson and Herman1987) and data were secured for the 16 target words described in Table 5.

Morphologically Complex Word Frequency (MWF) The Standard Frequency Index (SFI) from the Educators Word Frequency Guide (Zeno, Ivens, Millard, & Duvvuri, Reference Zeno, Ivens, Millard and Duvvuri1995) was used to represent how often a word is used within written academic texts (in this case, within a corpus of 60,527 academic texts). SFI values can be interpreted where 50 % of the words in sixth-grade texts have SFI values of 32 or greater. These values are based on D, which is a measure of dispersion of a word across subject areas. SFI values are logarithmic transformations of U, which is the frequency of the type per million tokens weighted by D.

Root Word Frequency (RWF) Root words for each target word were identified from Becker, Dixon, and Anderson-Inman (Reference Becker, Dixon and Anderson-Inman1980). Then SFI values were obtained for each root word as described above.

Morphological Family Size (NUMREL) Morphological family members for each target word were identified using Becker et al. ’s (Reference Becker, Dixon and Anderson-Inman1980) database. The database was digitalized and then searched for words that had the same root word as the target word including inflections, derivations, and compounds. An example is for effortlessly, the words effortless, effortful, and effortfully all had the same root word effort.

Number of Morphemes (NUM) The number of morphemes within each target word was identified from Becker et al. ’s (Reference Becker, Dixon and Anderson-Inman1980) database. The morphographic breakdown was used with the example effortlessly shown to contain four morphemes as in ef+(fort)+less+ly.

Average Affix Frequency (FAFFIX) The Becker et al. ’s (Reference Becker, Dixon and Anderson-Inman1980) database was also used to determine the average frequency of the affixes contained within the target word. The frequency of each affix was identified within the database and then the average frequency was calculated. An example is effortlessly, which has the suffixes less (frequency of 219) and ly (frequency of 1101), making the average affix frequency for effortlessly 660.

Most Frequent Affix (HIGHAFFIX) To determine the most frequent affix, the frequencies of each affix were determined using Becker et al. (Reference Becker, Dixon and Anderson-Inman1980). The frequency of the affix with the highest frequency value was recorded, with for example, the frequency of 1101 for ly recorded for effortlessly.

Opaqueness (OPAQUE) Transparency of each target word was determined as opaque (value $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1) if there were spelling and phonological changes between the target word and its root word like circulate (whose root word is circle) or as transparent (value $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0) if there were no spelling or phonological changes between the target word and its root word like effortlessly (whose root word is effort).

4.3. Analysis

To answer research question (a)i, the dimensionality structure of word knowledge was first investigated at each time point. Three hypothesized structures stemming from the vocabulary literature were explored (i.e., a 1-dimensional [1D] model, a 2-dimensional [2D] model [multiple-choice , self-report $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} breadth, related word production $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} depth], and a 3-dimensional [3D] model). The unidimensional perspective represents the view taken by many researchers that word knowledge can be assessed in many ways, yet each measure is assessing a general construct of word knowledge. The many studies that assess word knowledge via a single assessment represent this view (see Pearson et al. , Reference Pearson, Hiebert and Kamil2007 for further details). The hypothesized two dimensional model represents findings that there are two specific dimensions of word knowledge that must be considered. These are breadth, which conveys how many words students have basic knowledge of, and depth, which refers to how much knowledge a person has about words (Ouellette, Reference Ouellette2006; Tannenbaum, Torgesen, & Wagner, Reference Tannenbaum, Torgesen and Wagner2006). Here, a person may recognize command but not be able to define or interpret or use the word across multiple contexts. Depth of word knowledge appears to be more important than breadth in supporting reading comprehension (Ouellette, Reference Ouellette2006). The three dimensional model takes into account findings like those from Kieffer and Lesaux (Reference Kieffer and Lesaux2012) that indicate word knowledge is also composed of additional dimensions including a dimension for morphological awareness (i.e., knowledge of units of meaning like root words [e.g., command] and affixes [e.g., er in commander] and links to morphological relatives [e.g., command, commander, commanding]).

The three comparison models have the intercept, random effects at the reader level, and random effects at the item level, respectively. To determine which multilevel structure is needed to be modeled (i.e., cross-classified tutors effects, nested teacher effects, and nested word effects), the models with and without nested random effects and crossed random effects were compared at each time point based on which hypothesized structure was shown to fit the data best (i.e., dimensionality). Akaike information criterion (AIC; Akaike, Reference Akaike1974) and Bayesian information criterion (BIC; Schwarz, Reference Schwarz1978) were used to identify the best- fitting model for the dataset. In the calculation of BIC, the number of readers ( $J = 202$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J=202$$\end{document} ) was used to adjust the sample size for model complexity.

For the finalized unconditional model based on results of research question (a)(i), the measurement invariance was investigated by comparing word easiness at the word level between the pretest and the posttest. When the correlation coefficient between pretest and posttest word easiness is high, we conclude that there is evidence of measurement invariance because the relative ordering of the word easiness did not change between the two time points.

Based on model selection and measurement invariance results, the generalized mixed effect model for doubly multilevel structure that was specified earlier was fit to answer research question (a)ii, which explored whether significant word learning (i.e., intercept coefficients at posttest) occurred for each measure of word knowledge. Also, variability in initial knowledge and learning for each measure of word knowledge was interpreted. Using estimates of the model, word easiness values (technically, the conditional modes of the random effects) were calculated using the extractor function ranef of arm package version 1.3-06 (Gelman et al. , Reference Gelman, Su, Yajima, Hill, Pittau, Kerman and Zheng2010) in R to show variability in item easiness across words. To evaluate the model fit, we calculated item fit (the mean of the standard residuals over persons) and person fit (the mean of the standard residuals over items) statistics to confirm our measurement model represented each test item and each person well. Standardized residuals smaller than $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 2 or larger than 2 would indicate possible misfit at 5 % level. Harrell’s c calculated using somers2 function in Hmisc package (Harrell, Reference Harrell2015) was also calculated as a measure of the ordinal predictive power of the model. In addition, ICCs were calculated to show the degree of dependency due to teachers or words. The finalized measurement model (i.e., the unconditional model) was labeled as Model 1.

To answer research question (b) (i.e., the effects of word and instructional characteristics on word easiness and word learning, respectively), we added reader covariates like instructional treatment and word covariates to the unconditional model from research question (a)ii (i.e., Model 1). We focused on instructional contexts and word characteristics because preliminary analyses showed that the effects of all reader covariates except language background were not significant covariates of readers’ differences in learning. Thus, only the language background covariate will be used for further analyses among covariates of reader characteristics. Two dummy variables were created for language background with native English speakers as the reference group: ELL variable (English speaker $= 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=0$$\end{document} , ELL $= 1$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=1$$\end{document} ) and LMY variable (English speaker $= 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ = 0$$\end{document} , LMY $= 1$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=1$$\end{document} ). A dummy variable was also created for intervention status with the comparison group being the reference group (comparison group $= 0$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=0$$\end{document} , intervention group $= 1$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=1$$\end{document} ). All continuous word covariates were standardized. The model for research question (b) was labeled as Model 2.

4.4. Results

4.4.1. Results of Research Question (a)

Table 2 presents the number of parameters (Num.), the log-likelihood (LL), AIC, and BIC of the three candidate models regarding the number of dimensions and the four candidate models regarding the different random effects at each time point. At both time points, the 3D model having random effects over readers, teachers, items, and words was the best-fitting model. The correlation coefficient between word easiness values at pretest and posttest was 0.90, which provides evidence that there was the measurement invariance necessary to measure learning. Related to research question (a)i, results indicate word knowledge is best conceptualized as multidimensional (i.e., the 3D model) where multiple choice, self-report, and related words production are related but assess different aspects of word knowledge.

Table 2.

Results of model selection for pretest (top) and posttest (bottom) (Results of research question (a)i).

Num. number of parameters to be estimated, LL log-likelihood, 1D 1-dimensional model, 2D 2-dimensional model with multiple-choice , self-report representing meaning and related word production representing interrelatedness, 3D 3-dimensional model, Word random word (item group) effects, Tutor (cross-classified) random tutor effects, and Tch random teacher (cluster) effects.

Results for the generalized mixed effect model for doubly multilevel structure are displayed in Table 3 (fixed effects) and in Table 4 (population parameters of random effects).Footnote 4 Based on results reported in Model 1 of Table 4, ICCs were calculated:

$ρ {(R)}_{11} = 0.21 / (0.61 + 0.21 + 3.29) = 0.051$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{11}=0.21/(0.61+0.21+3.29)=0.051$$\end{document} , $ρ {(R)}_{12} = 0.29 / (2.12 + 0.29 + 3.29) = 0.051$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{12}=0.29/(2.12+0.29+3.29)=0.051$$\end{document} , $ρ {(R)}_{13} = 3.00 / (5.74 + 3.00 + 3.29) = 0.249$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{13}=3.00/(5.74+3.00+3.29)=0.249$$\end{document} , $ρ {(R)}_{21} = 0.02 / (0.15 + 0.02 + 3.29) = 0.006$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{21}=0.02/(0.15+0.02+3.29)=0.006$$\end{document} , $ρ {(R)}_{22} = 0.11 / (1.47 + 0.11 + 3.29) = 0.023$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{22}=0.11/(1.47+0.11+3.29)=0.023$$\end{document} , $ρ {(R)}_{23} = 0.55 / (9.17 + 0.55 + 3.29) = 0.042$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)_{23}=0.55/(9.17+0.55+3.29)=0.042$$\end{document} , and $ρ (I) = (1.51 / [0.15 + 0.06 + 0.59] + 1.51 + 3.29) = 0.270$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (I)=(1.51/[0.15+0.06+0.59]+1.51+3.29)=0.270$$\end{document} . The ICC values of $ρ (R)$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (R)$$\end{document} indicate that 5.1 and 6.0 % of the latent item response for the multiple choice (MC) measure were explained by teacher clustering at pretest and posttest, respectively; 5.1 and 2.3 % of the latent item response for the self-report (SR) measure were explained by teacher clustering at pretest and posttest, respectively; and 24.9 and 4.2 % of the latent item response for the related word production (MR) measure were explained by teacher clustering at pretest and posttest, respectively. Also, the ICC value of $ρ (I)$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (I)$$\end{document} suggest that 27 % of the latent item response was accounted for word clustering. A common rule of thumb in educational research is that ICCs over 0.05 indicate that there is nonignorable dependency due to clusters (e.g., Jak, Oort, & Dolan, Reference Jak, Oort and Dolan2013). According to this rule of thumb, dependency due to teachers for initial status and due to words is of concern. It is interesting that lower ICCs were found in learning due to teachers. Our interpretation of the lower ICCs are that the main word learning that was occurring was related to the intervention or comparison instruction, which was delivered by tutors.

Table 3.

Results for fixed effects of unconditional and conditional models.

Significance in bold at the 5 % level. INT intercepts, TRT treatment variable (comparison group $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, intervention group $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), ELL $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} ELL variable (English speaker $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, ELL $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), LMY LMY variable (English speaker $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, LMY $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), MWF morphologically complex word frequency variable, RWF root word frequency variable, NUMREL morphological family size variable, NUM number of morphemes variable, FAFFIX average affix frequency variable, HIGHAFFIX most frequent affix variable, OPAQUE opaqueness variable, MC multiple-choice, SR self-report, and MR production of morphological relatives

In Table 3, significance for the fixed effects is presented in bold. Significance at the 5 % level two-tailed test was determined by whether the absolute value of the t-statistic exceeded 2. Word learning results are shown in Model 1 of Table 3 for fixed effects. Related to research question (a)ii, results suggest there was significant learning of words as shown by the significant learning intercept coefficient for each aspect of word knowledge at posttest (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.704, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.078 for multiple choice; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1.308, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.141 for self-report; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 2.124, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.310 for related words production). These effects on the probability scale (transformed from the logit scale) are 0.169 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 0.704)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-0.704)]-[1/(1+$$\end{document} exp(0))]), 0.287 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 1.308)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-1.308)]-[1/(1+$$\end{document} exp(0))]), and 0.393 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 2.124)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-2.124)]-[1/(1+$$\end{document} exp(0))]) for multiple choice, self-report, and related words production, respectively.

Table 4.

Results for random effects of unconditional and conditional models.

MC multiple-choice, SR self-report, and MR production of morphological relatives.

As shown in Table 4, we found that students differed in their knowledge and learning of different aspects of word knowledge. For example, we found small variability in initial knowledge and learning for multiple choice responses compared to large variability for generation of morphologically related words. This likely reflects differences between breadth and depth measures as one would expect less variability in a receptive recognition measure compared to a production measure (Anderson & Freebody, Reference Anderson and Freebody1981). It also likely reflects the incremental nature of word learning (Nagy et al. , Reference Nagy, Anderson and Herman1987), as words were taught with different intensities and with different aspects of knowledge emphasized. A single exposure to a word may build initial meaning knowledge (Carey & Bartlett, Reference Carey and Bartlett1978; Markson & Bloom, Reference Markson and Bloom1997), whereas multiple experiences may be required to build other aspects of word knowledge, like further depth of knowledge (Elleman, Lindo, Morphy, & Compton, Reference Elleman, Lindo, Morphy and Compton2009). These differences in initial knowledge and learning highlight how assessment, instruction, and experiences with words would need to be adapted depending on the type of word knowledge being examined and built.

We also explored word easiness as it relates to word learning. Table 5 displays the predicted word easiness as well as values for each word characteristic. In the table, all word characteristics except OPAQUE were standardized. Words varied in terms of easiness as shown by the value of 1.51 of the variance estimate of the random word easiness and by the easiness values shown in Table 5 where words ranged from easy (uncomfortable, 1.827) to hard (provision, $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 2.572). This variation is what will be explored using word covariates.

Table 5.

Word easiness variability over words from the unconditional generalized mixed effect model.

All word characteristics except OPAQUE were standardized. MWF morphologically complex word frequency, RWF root word frequency, NUMREL morphological family size, NUM number of morphemes, FAFFIX average affix frequency, HIGHAFFIX most frequent affix, and OPAQUE opaqueness.

For the unconditional model used to answer research question a(ii) (i.e., Model 1), standardized residuals for persons ranged from $- 0.135$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-0.135$$\end{document} to 0.117 and standardized residuals for items ranged from $- 0.036$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-0.036$$\end{document} to 0.072, suggesting good fit to the dataset. Harrell’s c was 0.91, which indicates the model fit is relatively satisfactory.

4.4.2. Results of Research Question (b)

Instructional effects on word learning controlling for language background as well as effects of word characteristics on word easiness were investigated in research question (b). Estimates of fixed effects are presented in the Model 2 column of Table 3 and estimates of population parameters of random effects are shown in the Model 2 column of Table 4. When controlling for instruction as well as the other covariates, there were significant differences in learning between the native speakers and ELLs for self-report and related word production (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 0.837, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.329 for the self-report; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 1.583, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.688 for the related word production), however, there was no significant difference in learning between the native speakers and ELLs for multiple choice performance (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 0.104, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.182). In addition, there were no significant differences in learning between the native speakers and LMY for all three word knowledge measures (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.205, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.149 for multiple choice; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.005, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.251 for self-report; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 0.526, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.559 for related word production). These results indicate that morphological instruction was able to support ELL students in building multiple choice knowledge and to support LMY students in building for all three word knowledge measures.

Controlling for the other covariates, there was no significant instructional group difference in initial performance for all three word knowledge measures (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.008, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.141 for multiple choice; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.245, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.227 for self-report; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.561, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.368 for related word production). However, controlling for the other covariates, there were significant instructional group differences in learning for all three word knowledge measures (EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.472, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.134 for multiple choice; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1.120, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.210 for self-report; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1.303, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.480 for related word production). On the probability scale, these effects are 0.116 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 0.472)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-0.472)]-[1/(1+$$\end{document} exp(0))]), 0.254 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 1.120)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-1.120)]-[1/(1+$$\end{document} exp(0))]), and 0.286 ( $= [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=[1/(1+$$\end{document} exp $(- 1.303)] - [1 / (1 +$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(-1.303)]-[1/(1+$$\end{document} exp(0))]) for multiple choice, self-report, and related word production, respectively. This indicates intervention students were much more likely to show more word knowledge at posttest versus comparison students. At the reader level, the explained variances in learning for each measure compared to Model 1 was 33.3, 27.4, and 7.3 % for multiple choice, self-report, and related word production, respectively.

Figure 3.

Visualization of the significant partial effects of NUMREL (top) and OPAQUE (bottom), adjusted to the 0 values of the other continuous covariates, the reference level of the other categorical covariates, and the 0 values of all random effects.

Controlling for the other covariates in the model, two word characteristics had significant effects on word easiness: morphological family size (NUMREL; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.798, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.310) and opaqueness (OPAQUE; EST $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 2.666, SE $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0.800). The effect of 0.798 means that word easiness is 0.798 easier for every 1SD change in morphological family size. The effect of $-$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 2.666 indicates that words that are opaque are harder by 2.666 compared to words that are not opaque. Figure 3 presents the significant partial effects of NUMREL (top) and OPAQUE (bottom) on the probability scale with 95 % confidence interval (CI) (shown with dotted lines for NUMREL and with a vertical bar for OPAQUE), adjusted to the 0 values of the other continuous covariates, the reference level of the other categorical covariates, and the 0 for all random effects. Including word characteristics was particularly important in explaining word easiness. The explained variance for word easiness using word characteristics (compared to Model 1) is 0.621, which indicates that 62.1 % of variance in word easiness was explained by word characteristics. When word easiness values were predicted on the basis of the word characteristic fixed effects, the correlation with the estimated word easiness in Model 1 was 0.78, which means that there is moderately high explanatory power of the word characteristics.

5. Simulation Study

A simulation study was designed to show parameter accuracy of the specified model similar to Model 2 in the empirical study when the lmer function was used for parameter estimation. Parameter estimates obtained from Model 2 in the empirical study (shown in Tables 3 and 4) are used as true parameters with the same condition found in the empirical study. Specifically, item responses were generated using Model 2 for 202 persons (e.g., readers) nested within 21 clusters (e.g., teachers) and 48 items nested within 16 words at 2 time points. Five hundred replications for each model were considered.

Compared to previous generalized linear mixed effect models for binary longitudinal data (or longitudinal item response models), the novel specification in the current study is to have a doubly multilevel structure on reader and item sides. Thus, in addition to showing parameter recovery, consequences of ignoring multilevel random effects ( $s_{k t d}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{ktd}^{(3)}$$\end{document} and $w_{g t}^{(3)}$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{gt}^{(3)}$$\end{document} in Equation 2) were investigated by fitting Model 2 without the multilevel random effects to the same 500 generated datasets. BiasFootnote 5 and root mean square error (RMSE) were calculated for accuracy measures in both Model 2 and Model 2 without the multilevel random effects.

No convergence problems occurred in either Model 2 or Model 2-1 (Model 2 without the multilevel random effects). Bias and RMSE are reported in Table 6 for each parameter estimate and for means of correlation estimates. Bias and RMSE of Model 2 appeared comparable to those seen in generalized mixed effect modeling (e.g., Cho, Partchev, & De Boeck, Reference Cho, Partchev and De Boeck2012). Overall, bias and RMSE were larger for Model 2-1 than for Model 2. Larger differences in bias and RMSE between the two models were found for the fixed effects of items than for those of readers. This is an expected result because the ICC due to words were larger than the ICC due to teachers in the simulation condition (as in the empirical study). The variances of random effects were overestimated in Model 2-1 as a consequences of ignoring the doubly multilevel structure. These results indicate that ignoring doubly multilevel structure would lead to less accurate parameter estimates.

Table 6.

Results of simulation study based on Model 2 and Model 2 without multilevel random effects (Model 2-1).

- not modeled, INT intercepts, TRT treatment variable (comparison group $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, intervention group $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), ELL ELL variable (English speaker $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, ELL $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), LMY LMY variable (English speaker $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 0, LMY $=$ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=$$\end{document} 1), MWF morphologically complex word frequency variable, RWF root word frequency variable, NUMREL morphological family size variable, NUM number of morphemes variable, FAFFIX average affix frequency variable, HIGHAFFIX most frequent affix variable, OPAQUE opaqueness variable, MC multiple-choice, SR self-report, and MR production of morphological relatives.

6. Discussion

In this study, we illustrate how the use of generalized linear mixed modeling for doubly multilevel binary longitudinal data advances the ability of reading researchers to explore word learning in a way that accurately accounts for the complexities involved. We were able to confirm the multidimensional nature of word knowledge and extend understanding related to how both word and instructional characteristics contributed to word learning. Methodologically, we were able to extend cross-sectional models that accounted for doubly multilevel structures to longitudinal settings. Such extensions would be helpful in various content areas from education to clinical psychology. A simulation study was also implemented based on the empirical study. Simulation results showed that parameters were recovered well using the lmer function and in a similar condition to the empirical study. Further, the simulation study reported that accounting for a doubly multilevel structure is necessary for accurate parameter estimates.

Because we focused on the illustration, there were methodological limitations in this paper. First, in the application, we investigated random-effect structures in the multidimensional and doubly multilevel data. AIC and BIC were used to select the best-fitting model among the models having different multidimensional and multilevel structures. Yet BIC, unlike AIC, includes a sample size as a penalty term in its calculation. In this study, the number of readers was used for calculating BIC, which is similar to that used in other applications for multilevel item response modeling (e.g., Bartolucci, Pennoni, & Vittadini, Reference Bartolucci, Pennoni and Vittadini2011). It is not clear, though, whether the sample size should be the number of readers or the number of clusters or both in multilevel data (Skrondal & Rabe-Hesketh, Reference Skrondal and Rabe-Hesketh2004). Also, the sample size count is often complicated in cross-classified data, like in our case study. In our application, both AIC and BIC suggested the same best-fitting models, providing evidence of the trustworthiness of our findings, but further research on the use of BIC would be beneficial for future multilevel and cross-classified modeling.

Another methodological challenge is related to the crossed random effects in our model specification (i.e., reader random effect and item random effect; teacher random effect and word random effect). In such a model with the crossed random effects, different estimation methods for degrees of freedom may result in differences in p-values for fixed effect inference (Molenberghs & Verbeke, Reference Molenberghs, Verbeke, De Boeck and Wilson2004, p. 135). In our application, the inference for the effects of word characteristics, based on the t-distribution, can be of particular concern because of the small number of items (i.e., 48 items) and high ICC due to words. More methodological work is required to investigate the patterns in power and Type 1 error rate in testing the fixed effects of the crossed random effect models when the different significance tests are used. In order to see whether the different significance testing leads to the same results, significance results for the effects of word characteristics were compared between the t test with a p value (at the 5 % level in a two-tailed test as used in the application) and the 95 % highest posterior density (HPD) interval test based on Markov chain Monte Carlo (MCMC) samples. To obtain HPD for the fixed effects, the same model we used to answer research question (b) was fit to the same data in the illustration using WinBUGS 1.4 (Spiegelhalter, Thomas, & Best, Reference Spiegelhalter, Thomas and Best2003). The posterior density of the fixed-effects parameter was close to symmetric with 10,000 MCMC samples after burn-in. The same significance results were found between the two significance tests, suggesting trustworthiness of our findings. We also suggest that further simulation studies would be required to generalize our simulation results to other conditions which vary in terms of the number of items, the number of persons, the number of clusters (e.g., teachers and words), cluster sizes, and the different degrees of ICCs.

In spite of these methodological limitations, this paper showed the promise of this model as applied to our case study that measured and explained word learning in a pretest and posttest study where there were multiple aspects of word knowledge and a doubly multilevel structure. From this work, we deepened understanding of word learning by showing that word knowledge is multidimensional and depends on word characteristics and instructional contexts. Our results suggest that different aspects of word knowledge are related but also provide unique information and develop differently. A student might know a synonym of a word, but only have some knowledge of that word’s morphological family. Instruction can build these different aspects of word knowledge, and results suggest that instruction develops different amounts of certain aspects of word knowledge more than others. Additionally, readers of different backgrounds seem to learn different aspects of word knowledge differently. In our study, English language learners showed less growth on self-report and related word production than native English speakers, controlling for the other covariates, whereas no significant differences were noted for multiple choice knowledge. This suggests that instruction was successful in developing certain aspects of word knowledge, whereas additional instruction might be necessary to develop these other facets of word knowledge. Importantly, our work also suggests that certain words are harder to learn. These tended to be words with smaller morphological families and words where the root word and affixes are combined in an opaque manner involving sound and spelling changes.

Overall, this work shows the importance of taking a comprehensive approach to considering word learning. By considering word knowledge as multidimensional and item-specific, we were able to identify nuances about word learning that would not have been possible without such an approach. Our study confirms that not all aspects of all words are learned equally by all learners and therefore, responses must be treated as item-specific to partition variance between readers and words to explore covariates of each. By looking beyond overall performance, our work indicates that word learning depends on word features, readers’ language backgrounds, and also instruction.

Footnotes

¹ We use the term “item” to indicate performance on an aspect of word knowledge specific to a measure and the term “word” to indicate a set of items that are focused on the same word.

² In this paper, we chose the term ‘generalized linear mixed model’ rather than ‘item response model’ because it is more commonly used in psycholinguistic literature (e.g., Baayen, Reference Baayen2008; Baayen et al. , Reference Baayen, Davidson and Bates2008; González et al. , Reference González, De Boeck and Tuerlinckx2014).

³ The authors thank the reviewer of a previous version of this paper for suggesting the representation.

⁴ Due to space consideration in Table 4, estimates were reported with two decimal points. Correlation matrices of reader-level and teacher-level random effects were positive definite based on estimates with three decimal points.

⁵ Relative bias can be more appealing than bias to indicate unacceptable bias with an empirical cutoff (e.g., 10 %). However, the relative bias can be an extremely large number when ‘true’ parameters are close to 0. Thus, bias was chosen for the accuracy measure instead of relative bias.

References

Akaike, M.. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 415–438.CrossRef Google Scholar

Anderson, R. C., Freebody, P.Vocabulary knowledge 1981 Newark: International Reading Association.Google Scholar

Baayen, R. H., (2008). Analyzing linguistic data: A practical introduction to statistics using R. New York: Cambridge University Pressdoi:10.1017/CBO9780511801686.CrossRef Google Scholar

Baayen, R., Davidson, D., Bates, D.. (2008). Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59, 390–412. doi:10.1016/j.jml.2007.12.005.CrossRef Google Scholar

Bartolucci, F., Pennoni, F., Vittadini, G.. (2011). Assessment of school performance through a multilevel latent Markov Rasch model. Journal of Educational and Behavioral Statistics, 36, 491–522. doi:10.3102/1076998610381396.CrossRef Google Scholar

Bates, D. M., (2010). lme4: Mixed-effects modeling with R. New York: Springer.Google Scholar

Bates, D., Maechler, M., & Bolker, B. (2011). lme4: linear mixed-effects models using S4 classes. R package version 0.999375-39. http://cran.rproject.org/package=lme4.Google Scholar

Becker, W. C., Dixon, R., Anderson-Inman, L.Morphographic and root word analysis of 26,000 high frequency words 1980 Eugene: College of Education: University of Oregon Follow Through Project.Google Scholar

Bryk, A. S., Raudenbush, S. W., (1992). Hierarchical linear models in social and behavioral research: Applications and data analysis methods. 1Newbury Park, CA: Sage Publications.Google Scholar

Carey, S., Bartlett, E.. (1978). Acquiring a single new word. Papers and Reports on Child Language Development, 15, 17–29.Google Scholar

Carlisle, J. F., Katz, L. A.. (2006). Effects of word and morpheme familiarity on reading of derived words. Reading and Writing, 19, 669–693. doi:10.1007/s11145-005-5766-2.CrossRef Google Scholar

Cho, S-J, De Boeck, P., Embretson, S., Rabe-Hesketh, S.. (2014). Additive multilevel item structure models with random residuals: Item modeling for explanation and item generation. Psychometrika, 79, 84–104. doi:10.1007/s11336-013-9360-2.CrossRef Google Scholar PubMed

Cho, S-J, Gilbert, J. K., Goodwin, A. P.. (2013). Explanatory multidimensional multilevel random item response model: An application to simultaneous investigation of word and person contributions to multidimensional lexical quality. Psychometrika, 78, 830–855. doi:10.1007/s11336-013-9333-5.CrossRef Google Scholar

Cho, S-J, Partchev, I., De Boeck, P.. (2012). Parameter estimation of multiple item profiles models. British Journal of Mathematical and Statistical Psychology, 65, 438–466. doi:10.1111/j.2044-8317.2011.02036.x.CrossRef Google Scholar

De Boeck, P., Wilson, M.Explanatory item response models: A generalized linear and nonlinear approach 2004 New York: Springerdoi:10.1007/978-1-4757-3990-9.CrossRef Google Scholar

Elleman, A. M., Lindo, E. J., Morphy, P., Compton, D. L.. (2009). The impact of vocabulary instruction on passage-level comprehension of school-age children: A meta-analysis. Journal of Research on Educational Effectiveness, 2, 1–44. doi:10.1080/19345740802539200.CrossRef Google Scholar

Embretson, S. E.. (1991). A multidimensional latent trait model for measuring learning and change. Psychometrika, 56, 495–515. doi:10.1007/BF02294487.CrossRef Google Scholar

Fox, J-PBayesian item response modeling 2010 New York: Springerdoi:10.1007/978-1-4419-0742-4.CrossRef Google Scholar

Geerlings, H., Glas, C. A. W., van der Linden, W. J.. (2011). Modeling rule-based item generation. Psychometrika, 76, 337–359. doi:10.1007/s11336-011-9204-x.CrossRef Google Scholar

Gelman, A., Su, Y. S., Yajima, M., Hill, J., Pittau, M. G., Kerman, J., & Zheng, T. (2010). Data analysis using regression and multilevel/hierarchical models. R package version 1.3-06..Google Scholar

Glas, C. A. W., van der Linden, W. J.. (2003). Computerized adaptive testing with item cloning. Applied Psychological Measurement, 27, 247–261. doi:10.1177/0146621603027004001.CrossRef Google Scholar

González, J., De Boeck, P., Tuerlinckx, F.. (2014). Linear mixed modelling for data from a double mixed factorial design with covariates: A case-study on semantic categorization response times. Journal of Royal Statistical Soceity C, 63, 289–230. doi:10.1111/rssc.12031.CrossRef Google Scholar

Goodwin, A. P.. (2016). Effectiveness of word solving: Integrating morphological problem solving within comprehension instruction for middle school students. Reading and Writing: An International Journal, 29(1), 91–116. doi:10.1007/s11145-015-9581-0.CrossRef Google Scholar

Goodwin, A. P., Ahn, S.. (2010). A meta-analysis of morphological interventions: Effects on literacy achievement of children with literacy difficulties. Annals of Dyslexia, 60, 183–208. doi:10.1007/s11881-010-0041-x.CrossRef Google Scholar PubMed

Goodwin, A. P., Ahn, S.. (2013). A meta-analysis of morphological interventions in English: Effects on literacy outcomes for school-age children. Scientific Studies of Reading, 17, 257–285. doi:10.1080/10888438.2012.689791.CrossRef Google Scholar

Goodwin, A. P., Perkins, J.. (2015). Word detectives: Morphological Instruction that supports academic language. The Reading Teacher Journal, 68, 504–517.Google Scholar

Goodwin, A. P., Gilbert, J. K., Cho, S-J, Kearns, D. M.. (2014). Probing lexical representations: Simultaneous modeling of word and reader contributions to multidimensional lexical representations. Journal of Educational Psychology, 106, 448–468. doi:10.1037/a0034754.CrossRef Google Scholar

Graves, M. F.. (2007). Vocabulary instruction in the middle grades. Voices from the Middle, 15, 13–19.CrossRef Google Scholar

Harrell, F. (2015). Harrell miscellaneous. R package version 3.15-0..Google Scholar

Jak, S., Oort, F. J., Dolan, C. V.. (2013). A test for cluster bias: Detecting violations of measurement invariance across clusters in multilevel data. Structural Equation Modeling: A Multidisciplinary Journal, 20, 265–282. doi:10.1080/10705511.2013.769392.CrossRef Google Scholar

Kamata, A.. (2001). Item analysis by the hierarchical generalized linear model. Journal of Educational Measurement, 38, 79–93. doi:10.1111/j.1745-3984.2001.tb01117.x.CrossRef Google Scholar

Kieffer, M. J., Lesaux, N. K.. (2012). Knowledge of words, knowledge about words: Dimensions of vocabulary in first and second language learners in sixth grade. Reading and Writing, 25, 347–373. doi:10.1007/s11145-010-9272-9.CrossRef Google Scholar

Lesaux, N. K., Kieffer, M. J., Kelley, J. G., Russ Harris, J.. (2014). Effects of academic vocabulary instruction for linguistically diverse adolescents: Evidence from a randomized field trial. American Educational Research Journal, 51, 1159–1194. doi:10.3102/0002831214532165.CrossRef Google Scholar

Markson, L., Bloom, P.. (1997). Evidence against a dedicated system for vocabulary learning in Children. Nature, 385, 813–815. doi:10.1038/385813a0.CrossRef Google Scholar PubMed

Meade, A. W., Lautenschlager, G. J., Hecht, J. E.. (2005). Establishing measurement equivalence and invariance in longitudinal data with item response theory. International Journal of Testing, 5, 279–300. doi:10.1207/s15327574ijt0503_6.CrossRef Google Scholar

Milanzi, E., Molenberghs, G., Alonso, A., Verbeke, G., De Boeck, P.. (2015). Reliability measures in item response theory: Manifest versus latent correlation functions. British Journal of Mathematical and Statistical Psychology, 68, 43–64. doi:10.1111/bmsp.12033.CrossRef Google Scholar PubMed

Molenberghs, G., Verbeke, G.De Boeck, P., Wilson, M.. (2004). An introduction to generalized nonlinear mixed models. Explanatory item response models: A generalized linear and nonlinear approach. New York: Springer 111–153. doi:10.1007/978-1-4757-3990-9_4.CrossRef Google Scholar

Muthén, B. O., Asparouhov, T., van der Linden, W. J., & Hambleton, R. K.. (2013). Item response modeling in Mplus: A multi-dimensional, multi-level, and multi-time point example. Handbook of item response theory, models, statistical tools, and applications. Boca Raton, FL: Chapman & Hall.Google Scholar

Nagy, W. E., Anderson, R. C., & Herman, P. A.. (1987). Learning word meanings from context during normal reading. American Educational Research Journal, 24, 237–270. doi:10.3102/00028312024002237.CrossRef Google Scholar

National Governors Association Center for Best Practices and Council of Chief State School. (2010). The common core standards: English language arts. Washington, D.C.: National Governors Association Center for Best Practices, Council of Chief State School Officers..Google Scholar

Ouellette, G. P.. (2006). What’s meaning got to do with it: The role of vocabulary in word reading and reading comprehension. Journal of Educational Psychology, 98, 554–566. doi:10.1037/0022-0663.98.3.554.CrossRef Google Scholar

Pearson, P. D., Hiebert, E. H., Kamil, M. L.. (2007). Vocabulary assessment: What we know and what we need to learn. Reading Research Quarterly, 42, 282–296. doi:10.1598/RRQ.42.2.4.CrossRef Google Scholar

Perfetti, C. A.. (2007). Reading ability: Lexical quality to comprehension. Scientific Studies of Reading, 11, 357–383. doi:10.1080/10888430701530730.CrossRef Google Scholar

R Development Core Team (2009). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-project.org.Google Scholar

Rijmen, F., Tuerlinckx, F., De Boeck, P., Kuppens, P.. (2003). A nonlinear mixed model framework for item response theory. Psychological Methods, 8, 185–205. doi:10.1037/1082-989X.8.2.185.CrossRef Google Scholar PubMed

Schreuder, R., Baayan, R. H.,Feldman, L.. (1995). Modeling morphological processing. Morphological aspects of language processing. Hillsdale, NJ: Lawrence Erlbaum 131–154.Google Scholar

Schwarz, G.. (1978). Estimating the dimension of a model. Annals of Statistics, 6, 461–464. doi:10.1214/aos/1176344136.CrossRef Google Scholar

Sinharay, S., Johnson, M. S., Williamson, D. M.. (2003). Calibrating item families and summarizing the results using family expected response functions. Journal of Educational and Behavioral Statistics, 28, 295–313. doi:10.3102/10769986028004295.CrossRef Google Scholar

Skrondal, A., Rabe-Hesketh, S.Generalized latent variable modeling: Multilevel, longitudinal, and structural equation models 2004 Boca Raton, FL: Chapman & Halldoi:10.1201/9780203489437.CrossRef Google Scholar

Spiegelhalter, D. J., Thomas, A., Best, N. G., (2003). WinBUGS (Version, 1.4.)[Computer Program]. Cambridge, UK: MRC Biostatistics Unit, Institude of Public Health.Google Scholar

Stanovich, K. E.. (1986). Matthew effects in reading: Some consequences of individual differences in the acquisition of literacy. Reading Research Quarterly, 21, 360–107. doi:10.1598/RRQ.21.4.1.CrossRef Google Scholar

Tannenbaum, K. R., Torgesen, J. K., Wagner, R. K.. (2006). Relationships between word knowledge and reading comprehension in third-grade children. Scientific Studies of Reading, 10, 381–398. doi:10.1207/s1532799xssr1004_3.CrossRef Google Scholar

Zeno, S. M., Ivens, S. H., Millard, R. T., Duvvuri, R.The educator’s word frequency guide 1995 New York: Touchstone Applied Science Associates.Google Scholar

Figure 1. Data representation for the binary longitudinal item responses (y\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {y}}$$\end{document}) with doubly multilevel structure in the exemplar case when K=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$K=2$$\end{document}, J=6\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=6$$\end{document}, G=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$G=2$$\end{document}, I=6\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$I=6$$\end{document}, T=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$T=2$$\end{document}, and D=3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$D=3$$\end{document}. k is an index for a teacher (k=1,2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$k=1, 2$$\end{document}); j is an index for a reader (j=1,2,3,4,5,6\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$j=1, 2, 3, 4, 5, 6$$\end{document}); g is an index for a word (g=1,2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$g=1, 2$$\end{document}); i is an index for an item (i=1,2,3,4,5,6\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$i=1, 2, 3, 4, 5, 6$$\end{document}); t is an index for a time point (t=1,2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$t=1, 2$$\end{document}); d is an index for a measure (d=1,2,3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$d=1, 2, 3$$\end{document}); k[j] indicates that a reader j is nested within a teacher k; and g[i] indicates that an item i is nested with a word g.

Figure 2. A diagram of a generalized linear mixed model for doubly multilevel binary longitudinal data. TRT treatment variable, word word characteristic variables, MC item responses for multiple-choice, SR item responses for self-report, MR item responses for production of morphological relatives, sk[j]11\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]11}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for MC at pretest, sk[j]12\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]12}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for SR at pretest, sk[j]13\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]13}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for MR at pretest, sk[j]21\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]21}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for MC at posttest, sk[j]22\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]22}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for SR at posttest, sk[j]23\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k[j]23}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random reader effect for MR at posttest, sk11\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k11}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for MC at pretest, sk12\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k12}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for SR at pretest, sk13\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k13}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for MR at pretest, sk21\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k21}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for MC at posttest, sk22\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k22}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for SR at posttest, sk23\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$s_{k23}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random teacher effect for MR at posttest, wg[i]1\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$w_{g[i]1}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random item effect for MC, wg[i]2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$w_{g[i]2}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random item effect for SR, wg[i]3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$w_{g[i]3}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random item effect for MR, and wg\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$w_{g}$$\end{document}a\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {a}$$\end{document} random item-group effect. All latent variables at the reader level and at the teacher level are correlated at each level.

Table 1. Descriptive statistics.

Table 2. Results of model selection for pretest (top) and posttest (bottom) (Results of research question (a)i).

Table 3. Results for fixed effects of unconditional and conditional models.

Table 4. Results for random effects of unconditional and conditional models.

Table 5. Word easiness variability over words from the unconditional generalized mixed effect model.

Figure 3. Visualization of the significant partial effects of NUMREL (top) and OPAQUE (bottom), adjusted to the 0 values of the other continuous covariates, the reference level of the other categorical covariates, and the 0 values of all random effects.

Table 6. Results of simulation study based on Model 2 and Model 2 without multilevel random effects (Model 2-1).

Article contents

Modeling Learning in Doubly Multilevel Binary Longitudinal Data Using Generalized Linear Mixed Models: An Application to Measuring and Explaining Word Learning

Abstract

Keywords

Information

1. Introduction

2. Measuring and Explaining Word Learning

2.1. Multidimensional Word Knowledge

2.2. Doubly Multilevel Data

2.3. Longitudinal and Multigroup Data from a Pretest–Posttest Study Design

2.4. Word-Specific Word Knowledge

2.5. Empirical Research Questions

3. Modeling

3.1. Model Specification

3.2. Intraclass Correlations

4. Illustration

4.1. Data Description

4.1.1. Sample

4.1.2. Instructional Intervention

4.1.3. Measures

4.2. Word Covariates

4.3. Analysis

4.4. Results

4.4.1. Results of Research Question (a)

4.4.2. Results of Research Question (b)

5. Simulation Study

6. Discussion

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests