The long case : the case against its revival

In their editorial, Michael et al focus on what they see as shortcomings in one important area of clinical assessment of psychiatrists in training and they suggest that methods of workplace-based assessment have failed. However, current thinking in assessment of doctors is to consider assessment systems as a whole and we contend that both forms of assessment - clinical examinations and workplace-based assessments - are needed because they fulfil different needs. This is not to argue that more should not be done to ensure that all who are involved in the assessment of doctors are better prepared for the task or that the current portfolio of assessment methods is complete.

Many involved in education will be familiar with Miller's pyramid, which conceptualises medical expertise in four hierarchical levels, from propositional knowledge at the base ('what the doctor knows') to performance at the pinnacle ('what the doctor does'). 1 Miller's model emphasises the importance of assessment systems examining each of these levels.Clinical examinations, whether of the Observed Structured Clinical Examination (OSCE) type or based on variations of the long case, are assessments that focus solely on the competence level of Miller's pyramid ('what the doctor can do').
As the present statutory regulator of postgraduate medical education in the UK, the General Medical Council (GMC) points out in its Standards for Curricula and Assessment Systems 2 that an assessment system is an integrated set of assessments which supports and assesses the whole curriculum and that 'competence (can do) is necessary but not sufficient for performance (does)' (p.3).It follows that an assessment system should consist of both competence and performance elements.
Van der Vleuten's approach to considering utility of assessment 3 is a very helpful one when selecting tools that should be used at each level of assessment.He pointed out that in mathematical terms the utility of an assessment system might be considered as the product of its reliability, validity, feasibility and educational impact.If the value of any of these qualities approaches zero, no matter how positive the remaining values are, the utility of the assessment system will also approach zero.
Although the long case examination is one of the most venerable forms of assessment in medical education, 4 and it has a good deal of face validity to psychiatrists, there are serious concerns about its reliability, 4 which ultimately reduces its utility as a tool in assessing the competence level.These concerns arise because the assessment is based on an encounter with one patient and unstructured questioning by examiners. 5Norcini 6 has reported reliability estimates for a single long case of 0.24.Having more assessments performed by more assessors and observing the whole encounter between candidate and patient increase the reliability of the long case.Six such long-case assessments are needed to bring a reliability coefficient of 0.8.Unfortunately, however, the large amount of assessment time needed and the lack of willing and suitable patients severely limit the feasibility of the long case examination.

The new way -OSCE
First introduced by Harden et al in Dundee, 7 the OSCE was a response to the long-standing problems with oral examinations. 8It was the first assessment of its kind to combine standardised live performances with multiple scenarios, each assessed by multiple independent examiners, and was hailed as a triumph in medical education. 8Evidence began to emerge that OSCEs resulted in increased reliability and validity when compared with traditional and less structured oral examinations. 9,10By the late 20th century, the OSCE was adopted worldwide and became known as the methods of workplace-based assessment have failed.However, current thinking in assessment of doctors is to consider assessment systems as a whole and we contend that both forms of assessment -clinical examinations and workplace-based assessments -are needed because they fulfil different needs.This is not to argue that more should not be done to ensure that all who are involved in the assessment of doctors are better prepared for the task or that the current portfolio of assessment methods is complete.
Declaration of interest W.B. is the Dean of the Royal College of Psychiatrists and in this role has overall responsibility for the College's assessment system.A.B. has delivered training events for the College and for other bodies in the administration and interpretation of workplace-based assessment tools.

The long case: the case against its revival
Commentary on . . .The long case { Wendy Burn, 1 Andrew Brittlebank 1 { See editorial, pp.377-381, this issue gold standard in undergraduate and postgraduate competence assessment in medical training and for other health professions. 11,12n 2008, the Royal College of Psychiatrists had to review its examination system to meet the standards of the Postgraduate Medical Education and Training Board (PMETB), which was then the statutory regulator of postgraduate medical education.It was decided to adopt the OSCE format as being the best for a high-stakes clinical examination and to modify the OSCE approach to deal with more complex clinical situations.And so the Clinical Assessment of Skills and Competencies (CASC) came into being.The College monitors its examinations carefully.Performance data from the most recent sitting of the CASC are normally distributed and pass rates were not significantly different over the 4 days of the examination.Reliability has increased slightly from the previous CASC diet, reaching the desired a level of 0.80 or greater on two examination days.None of the stations performed poorly.This suggests the exam is robust.
Because the OSCE combines reliability with the opportunity to test in a greater number of domains than the long case while offering a degree of feasibility, it looks likely that this format will remain an important component of the testing of the competence level of medical expertise.

Assessing the performance level
As far as assessing the performance level of medical expertise is concerned, Michael et al 13 are correct in saying that workplace-based assessment methods were little used in psychiatry and in UK medical education before their introduction as a requirement of the PMETB in 2007, as a part of the new curriculum.Since their introduction, a large study involving more than 600 doctors in training evaluated the performance of workplace-based assessment in psychiatry in the UK. 14 It has shown that an integrated package of workplace-based assessments can deliver acceptable levels of reliability and validity for feasible amounts of assessment time.Critically, the participants in this study all received a package of face-toface training in the methods of assessment.The study showed that workplace-based assessments of a psychiatrist's clinical encounter skills could achieve levels of reliability that could support high-stakes assessment decisions after approximately 3 hours of observation.Feedback from both the doctors being assessed and from assessors in this study indicated high levels of acceptability of this form of assessment.
Since the work that underpinned the Brittlebank 14 study was undertaken, it has become a curriculum requirement for all who conduct the assessment of psychiatrists in training to undergo training in assessment methods.There is evidence that such training improves the rigour and reliability of workplace-based assessments. 15This may go some way in addressing the cynicism found by Menon et al 16 shortly after the introduction of workplace-based assessment.
The portfolio of assessment tools is not complete.As Michael et al suggest, there is a need for a tool that assesses a trainee's integration of the clinical skills that are required in the primary assessment of a patient's problem.This integrative approach was used by the Royal College of Physicians of the UK in the design of the Acute Care Assessment Tool (ACAT), which was developed to give feedback on a doctor's performance on an acute medical take, where the doctor is expected to bring a range of skills to bear in a complex situation. 17The ACAT has been shown to be a useful and popular assessment tool. 17We suggest that rather than developing new clinical examinations, psychiatrists should adopt a similar approach to our physician colleagues and develop a tool that incorporates aspects of the Assessment of Clinical Expertise and the casebased discussion tools to assess the complex assessment, formulation and decision-making skills that are brought to bear when a psychiatrist assesses a new patient.

About the authors
Wendy Burn is Dean and Andrew Brittlebank is Specialist Advisor for Curriculum, Royal College of Psychiatrists.
Summary In their editorial, Michael et al focus on what they see as shortcomings in one important area of clinical assessment of psychiatrists in training and they suggest that