DAELMANS, H. E., VAN DER HEMSTOKROOS, H. H., HOOGENBOOM, R. J., et al (2005) Global clinical performance rating, reliability and validity in an undergraduate clerkship. Netherlands Journal of Medicine, 63, 279–284.
DEVITT, J. H., KURREK, M. M., COHEN, M. M., et al (1997) Testing the raters: inter-rater reliability of standardised anaesthesia simulator performance. Canadian Journal of Anaesthesia, 44, 924–928.
JACKSON, C. & TINKLER, P. (2001) Back to basics: a consideration of the purposes of the PhD viva. Assessment and Evaluation in Higher Education, 26, 355–366.
JONES, R., HIGGS, R., DE ANGELIS, C., et al (2001) Changing face of medical curricula. Lancet, 357, 699–703.
LANDIS, J. R. & KOCH, G. G. (1977) The measurement of observer agreement for categorical data. Biometrics, 33, 159–174.
MCDANIEL, M. A., WHETZELL, D. L., SCHMIDT, F. L., et al (1994) The validity of employment interviews: a comprehensive review and meta-analysis. Journal of Applied Psychology, 79, 599–616.
MCKINLEY, R. K., HASTINGS, A. M. & PETERSEN, S. (2005) The long case revisited. Medical Education, 39, 442–447.
MORGAN, P. J., CLEAVE-HOGG, D. & GUEST, C. B. (2001) A comparison of global ratings and checklist scores from an undergraduate assessment using an anaesthesia simulator. Academic Medicine, 76, 1053–1055.
MORLEY, L., LEONARD, D. & DAVID, M. (2002) Variations in vivas: quality and equality in British PhD assessments. Studies in Higher Education, 27, 263–273.
NORCINI, J. J. (2002) The death of the long case?
BMJ, 324, 408–409.
Norcini, J. J., Blank, L. L., Arnold, G. K., et al (1995) The mini-CEX (clinical evaluation exercise): a preliminary investigation. Annals of Internal Medicine, 123, 795–799.
PETRUSA, E. R. (2002) Clinical performance assessments. In International Handbook for Research in Medical Education (eds NORMAN, G. R., VAN DER VLEUTEN, C. P. M. & NEWBLE, D. I.) pp. 673–709. Kluwer Academic Publishers.
SCHWIEBERT, P. & DAVIS, A. (1993) Increasing inter-rater agreement on a family medicine clerkship oral examination – a pilot study. Family Medicine, 25, 182–185.
TUTTON, P. J. M. & GLASGOW, E. F. (2005) Reliability and predictive capacity of examinations in anatomy and improvement in the reliability of viva voce (oral) examinations by the use of a structured rating system. Clinical Anatomy, 2, 29–34.
VANDER VLEUTEN, C.P.M. & SCHUWIRTH, L.W.T. (2005) Assessing professional competence: from methods to programmes. Medical Education, 39, 309–317.
WASS, V. & JOLLY, B. (2001) Does observation add to the validity of the long case?
Medical Education, 35, 729–734.
WASS, V. & VAN DER VLEUTEN, C. P. M. (2004) The long case. Medical Education, 38, 1176–1180.
WASS, V., JONES, R. & Van Der Vleuten, (2001) Standardised or real patients to test clinical competence? The long case revisited. Medical Education, 35, 321–325.
WIESNER, W. H. & CRONSHAW, S. F. (1988) A meta-analytic investigation of the impact of the interview format and degree of structure on the validity of the employment interview. Journal of Occupational Psychology, 61, 275–290.