ReferencesAmerican Council on the Training of Foreign Languages. 1989. ACTFL Proficiency Guidelines. Yonkers: Author.
Bachman, L. F. 2005. Building and supporting a case for test use. Language Assessment Quarterly 2: 1–34.
Bachman, L. F. and Palmer, A. S.. 1996. Language testing in practice. Oxford: Oxford University Press.
Bormuth, J. R. 1970. On the theory of achievement test items. Chicago: University of Chicago Press.
Chi, M. T. H., Glaser, R., and Farr, M., eds. 1988. The nature of expertise. Mahwah, N.J.: Erlbaum.
Collins, A., J. S. Brown, and S. E. Newman. 1989. Cognitive apprenticeship: Teaching the crafts of reading, writing, and mathematics. In Knowing, learning, and instruction: Essays in honor of Robert Glaser, edited by Resnick, L. B., 453–94. Hillsdale, N.J.: Lawrence Erlbaum Associates.
Cronbach, L. J., G. C. Gleser, H. Nanda, and N. Rajaratnam. 1972. The dependability of behavioral measurements: Theory of generalizability for scores and profiles. New York: Wiley.
Douglas, D. 2000. Assessing language for specific purposes. Cambridge: Cambridge University Press.
Enright, M. K., W. Grabe, K. Koda, P. Mosenthal, P. Mulcahy, and M. Schedl, 2000. TOEFL 2000 reading framework: A working paper (TOEFL Monograph Series MS-17). Princeton: Educational Testing Service.
Ericsson, K. A. 1996. The acquisition of expert performance: An introduction to some of the issues. In The road to excellence: The acquisition of expert performances, sports, and games, edited by Ericsson, K. A.. Mahwah, N.J.: Lawrence Erlbaum Associates.
Gasser, H. 1955. How to draw and paint. New York: Dell.
Gitomer, D. H., L. S. Steinberg, and R. J. Mislevy. 1995. Diagnostic assessment of trouble-shooting skill in an intelligent tutoring system. In Cognitively diagnostic assessment, edited by Nichols, P., Chipman, S., and Brennan, R., 73–101. Hillsdale, N.J.: Erlbaum.
Greeno, J. G. 1983. Conceptual entities. In Mental models, edited by Gentner, D. and Stevens, A. L.. Hillsdale, N.J.: Lawrence Erlbaum Associates.
Greeno, J. G., A. M. Collins, and L. B. Resnick. 1997. Cognition and learning. In Handbook of educational psychology, edited by Berliner, D. and Calfee, R., 15–47. New York: Simon and Schuster Macmillan.
Greeno, J. G., P. D. Pearson, and A. H. Schoenfeld. 1997. Implications for the National Assessment of Educational Progress of research on learning and cognition. In Assessment in transition: Monitoring the nation's educational progress, background studies, edited by Linn, R., Glaser, R., and Bohrnstedt, G., 151–215. Stanford: The National Academy of Education.
Holland, P. W. and Wainer, H.. 1993. Differential item functioning. Hillsdale, N.J.: Erlbaum.
Kadane, J. B. and Schum, D. A.. 1996. A probabilistic analysis of the Sacco and Vanzetti evidence. New York: Wiley.
Lave, J. 1988. Cognition in practice. New York: Cambridge University Press.
Linn, R. L. 1993. Linking results of distinct assessments. Applied Measurement in Education 6: 83–102.
Messick, S. 1989. Validity. In Educational measurement, 3rd ed., edited by Linn, R. L., 13–103. New York: American Council on Education/Macmillan.
Messick, S. 1994. The interplay of evidence and consequences in the validation of performance assessments. Educational Researcher 23: 13–23.
Mislevy, R. J. 1994. Evidence and inference in educational assessment. Psychometrika 59: 439–83.
Mislevy, R. J. 2003. Substance and structure in assessment arguments. Law, Probability, and Risk 2: 237–58.
Mislevy, R. J. and Gitomer, D. H.. 1996. The role of probability-based inference in an intelligent tutoring system. User-Modeling and User-Adapted Interaction 5: 253–82.
Mislevy, R. J., Steinberg, L., and Almond, R.. 2003. On the structure of educational assessment. Measurement: Interdisciplinary Research and Perspectives 1: 3–62.
Mitchell, R. 1992. Testing for learning: How new approaches to evaluation can improve American schools. New York: The Free Press.
Myford, C. M. and Mislevy, R. J.. 1996. Monitoring and improving a portfolio assessment system. CSE Technical Report 402. Los Angeles: National Center for Research on Evaluation, Standards, and Student Testing (CRESST).
Newell, A. and Simon, H. A.. 1972. Human problem solving. Englewood Cliffs, N.J.: Prentice-Hall.
Resnick, L. B. 1997. Student performance portfolios. In Psychology and educational practice, edited by Walberg, H. J. and Haertel, G. D., 158–75. Berkeley: McCutchan.
Riconscente, M., Mislevy, R. J., and Hamel, L.. 2005. An introduction to PADI task templates. PADI Technical Report #3. Menlo Park, Calif.: SRI International.
Salthouse, T. A. 1991. Expertise as the circumvention of human processing limitations. In Toward a general theory of expertise, edited by Ericcson, K. A. and Smith, J., 286–300. Cambridge: Cambridge University Press.
Schum, D. A. 1994. The evidential foundations of probabilistic reasoning. New York: Wiley.
Schutz, A. and Moss, P. A.. 2004. Reasonable decisions in portfolio assessment: Evaluating complex evidence of teaching. Education Policy Analysis Archives 12. http://epaa.asu.edu/epaa/v12n33/.
Shafer, G. 1976. A mathematical theory of evidence. Princeton: Princeton University Press.
Steinberg, L. S. and Gitomer, D. G.. 1996. Intelligent tutoring and assessment built on an understanding of a technical problem-solving task. Instructional Science 24: 223–58.
Stewart, J. and R. Hafner. 1994. Research on problem solving: Genetics. In Handbook of research on science teaching and learning, edited by Gabel, D., 284–300. New York: Macmillan.
Toulmin, S. E. 1958. The uses of argument. Cambridge: Cambridge University Press.
Wolf, D., J. Bixby, J. Glenn, and Gardner. 1991. To use their minds well: Investigating new forms of student assessment. In Review of educational research, vol. 17, edited by Grant, G., 31–74. Washington, D.C.: American Educational Research Association.