Virtual patients design and its effect on clinical reasoning and student experience: a protocol for a randomised factorial multi-centre study
© Bateman et al.; licensee BioMed Central Ltd. 2012
Received: 28 May 2012
Accepted: 4 July 2012
Published: 1 August 2012
Virtual Patients (VPs) are web-based representations of realistic clinical cases. They are proposed as being an optimal method for teaching clinical reasoning skills. International standards exist which define precisely what constitutes a VP. There are multiple design possibilities for VPs, however there is little formal evidence to support individual design features. The purpose of this trial is to explore the effect of two different potentially important design features on clinical reasoning skills and the student experience. These are the branching case pathways (present or absent) and structured clinical reasoning feedback (present or absent).
This is a multi-centre randomised 2x2 factorial design study evaluating two independent variables of VP design, branching (present or absent), and structured clinical reasoning feedback (present or absent).The study will be carried out in medical student volunteers in one year group from three university medical schools in the United Kingdom, Warwick, Keele and Birmingham. There are four core musculoskeletal topics. Each case can be designed in four different ways, equating to 16 VPs required for the research. Students will be randomised to four groups, completing the four VP topics in the same order, but with each group exposed to a different VP design sequentially. All students will be exposed to the four designs. Primary outcomes are performance for each case design in a standardized fifteen item clinical reasoning assessment, integrated into each VP, which is identical for each topic. Additionally a 15-item self-reported evaluation is completed for each VP, based on a widely used EViP tool. Student patterns of use of the VPs will be recorded.
In one centre, formative clinical and examination performance will be recorded, along with a self reported pre and post-intervention reasoning score, the DTI. Our power calculations indicate a sample size of 112 is required for both primary outcomes.
This trial will provide robust evidence to support the effectiveness of different designs of virtual patients, based on student performance and evaluation. The cases and all learning materials will be open access and available on a Creative Commons Attribution-Share-Alike license.
KeywordsVirtual patients Clinical reasoning Elearning Education Undergraduate Musculoskeletal Rheumatology
Virtual patients(VPs) can be defined as electronic representations of realistic clinical cases . They have been proposed as being an ideal tool to teach clinical reasoning skills . A recent literature review, and systematic review of the literature has highlighted a lack of evidence supporting individual design properties for virtual patients as in other elearning areas . VPs are widely used in up to one third of US and Canadian medical schools, however until 2007 development costs were high . Multiple tools now exist to author virtual patients Much of the focus until now was been on their utility as educational tools in comparison to traditional teaching, in keeping with other elearning research .
A range of software packages exist for case authoring including ‘CAMPUS’, University of Heidelberg ; ‘Labyrinth’ from the University of Edinburgh ; ‘Web-SP’ from the Karolinska Institute, Sweden ; and ‘vpSim’ from the University of Pittsburgh . Other researchers used bespoke software solutions to author cases [10, 11].
An international technology interoperability standard for VPs was adopted in 2009 by the Medbiquitous Consortium . This benchmark allows the interoperability of VP cases between compatible software systems allowing authoring, editing and playing of cases . This facilitates collaboration, research, open access and the upkeep of these electronic resources [14, 15]. This has potentially changed the working definition of what a VP is, by re-defining properties and dimensions of VPs[16, 17]. A European Commission funded study has produced self-reported evaluation scores to help evaluation, the EViP project .
There are numerous VP design properties identified in the literature  and their impact on the learning experience are poorly understood. Of particular interest are, firstly, the use of branching case pathways , and secondly, the role of structured feedback to promote clinical reasoning. Branching cases are more difficult to construct, more expensive when compared with linear cases , and may have unpredictable effects on individual students . Clinical reasoning teaching support provides structured approaches to clinical reasoning [20, 21], which can be deployed in VPs. In the ‘SNAPPS’ approach , students summarise case findings, narrow a differential diagnosis, analyse the differential diagnosis by comparing and contrasting possibilities, and plan management.
A number of validated tools exist to evaluate clinical reasoning and student experiences with a VP. To measure clinical reasoning, tools include the Key Feature Problem [22, 23], and the Diagnostic Thinking Inventory[24, 25]. Other appropriate assessments include multiple-choice questions, Bayesian reasoning questions [25, 26] and diagnostic proficiency .
In musculoskeletal medicine there is a challenge in meeting the needs of increasing student populations . This is confounded by a lack of exposure to clinical cases  which can potentially be mitigated by the use of virtual patients.
Problem statement and hypothesis
The influence of different design features in a VP is under-researched, although they can significantly affect the time and cost of VP production. A research study in this area would be able to answer this research question.
We hypothesise that the two important independent VP design variables, branching (present or absent), and structured clinical reasoning feedback (present or absent) are likely to influence students clinical reasoning in cases, and their user experience in terms of realism, engagement and learning value.
The aim of this study is to evaluate how two independent VP design variables influence their effectiveness as an educational tool in musculoskeletal medicine. The specific objectives in the study are firstly to evaluate the performance of students exposed to different virtual patient designs in identical assessments of clinical reasoning skills. Secondly we aim to determine how different VP designs influence the student experience when using a VP. Finally we are attempting to explore the relationships between student performance in VP assessment metrics and other measurements of clinical skills, including written and clinical examinations.
This is a randomised 2x2 factorial design study evaluating two independent variables of VP design, branching (present or absent), and structured clinical reasoning feedback (present or absent).
Setting and participants
The setting is three university medical schools in the United Kingdom. These are the Warwick Medical School (WMS), the University of Birmingham Medical School (UBMS), and Keele Medical School (KMS). WMS runs a four year MBChB degree open only to graduate entry medical students, UBMS and KMS have a five year MBChB degree course, open to undergraduate entry medicine (UEM) graduate entry medical (GEM) students. The research project will run from 2011 to 2013.
Virtual Patient software information technology
Virtual patient cases in the study are created to the Medbiquitous standard  using the XML programming language . The software used to create and host the cases is DecisionSim® v2.0, developed by the University of Pittsburgh. The cases are compatible with open source VP systems such as Open Labyrinth . Access to cases, participation, electronic consent, and post case evaluations will be controlled by the VP software, and content hosted on the University of Warwick virtual learning environment Internet pages. Students will be registered with and logged in to the software, allowing tracking of decisions and performance.
Recruitment and baseline data collection
All eligible students will be invited to participate in the study. Inclusion criteria are students in the year group studying MSK medicine. The only exclusion criteria are students who do not volunteer or consent. Eligible students will be invited to attend an oral presentation and demonstration of the study, and given an approved study participant information sheet. Students who do not electronically record their informed consent will not be able to complete any cases, and are not considered to be participants. Students who consent will be considered to be study participants from this point onwards. At this point the baseline data collected from students will be gender, email address, student type (UEM/ GEM), year of study, and institution.
Additional data and information on other aspects of student performance will be collected from the examinations officer at WMS only. This includes student performance on formative clinical and written assessments at both at the end of the musculoskeletal block, and the end of year assessments.
Intervention and Independent design variables
The intervention consists of students completing four VP cases sequentially. Each case takes approximately 30 minutes to complete. The cases focus on four core clinical musculoskeletal areas. These are large joint arthritis, back pain, polyarthritis, and connective tissue disease. The 2x2 factorial study design means that any cases can be designed in four different ways. The four case designs are: A) not branched+ no-feedback; B) branched+ no feedback; C) not branched+ feedback; D) branched+ feedback. Students will use all four of the case designs during the research (see Figure 1).
The first variable is branching pathways through the VP, present or absent. There are four branching points with three choices through the thirty-minute case. This gives a possible 81 core pathways (3^4) through the case in a branched form. The linear case has a single core pathway, with participants being redirected back to the core pathway irrespective of the decision made, for example by feedback from a supervising clinician in the case. The second variable is the use of structured feedback to promote clinical reasoning skills, present or absent. This will be in a predetermined approach through the cases at five key points through the case, based on the ‘SNAPPS’ approach , systematic approaches to help Bayesian reasoning , and symptom categorisation .
Cases will be piloted and tested by healthcare professionals and a cohort of students in one centre prior to the study commencing. For the study, students will complete cases at WMS and KMS is in the form of sequential teaching sessions to students, taking place in a computer cluster. Students at UBMS will complete cases during allocated self-study time during their musculoskeletal block.
Other than the described independent variables we will control for other design variables highlighted in a critical literature review .
Inclusion and exclusion criteria
Inclusion criteria are students enrolled on the medical degree course and in the musculoskeletal teaching block in one of the medical schools in the study. Students must electronically sign consent to be included. Exclusion criteria are students from other year-groups. Students registered for a medical degree are required and assumed to have appropriate language and information technology skills.
Students will be blind to their group allocation. Investigator blinding for the purposes of the data analysis and allocation is not used. In the institution where clinical examination performance is recorded, none of the investigators examine within the clinical specialty (musculoskeletal medicine).
Outcomes Measured during the study
Outcomes for individual cases
All institions (WMS, KMS, UBMS)
Primary Outcome Measures collected for each VP
Validated clinical reasoning assessments.
Key Feature Problems score (x/8)
Student completes during the case
Clinical Decision (x/4)
Bayesian Statistical Question (x/1)
Working diagnosis (x/2)
(Total score x/15)
Self reported evaluation (EVIP)
EViP Questionnaire (multiple domains)
After each case
Case Preference: reasoning (n from 4) learning (n from 4)
On completion of all cases
Preference of case (learning)
Preference of case (realism)
Secondary Outcome measures
Other metrics collected in electronic environment
Time spent per case (seconds)
During the case, recorded automatically
Time spent per node (seconds)
Number of nodes visited
Case completion percentage.
Time spent per question (seconds)
Validated self reported reasoning assessment (DTI)
Baseline and Mean improvement
Immediately Pre-and post- intervention
Written and clinical
1 week post intervention
Written and clinical
End of the year
Student evaluation of each case will be collected using an electronic version of the EViP questionnaire, a fifteen item self reported evaluation. This explores exploring authenticity, professionalism, learning, and coaching through the case, using Likert scales with additional free text responses. Secondary outcome measures for each case are student’s patterns of use of the case, such as time taken per case, case completion rates, and time taken to complete individual decisions.
Additional data will be collected from one centre, WMS, to support the validity of the VPs as educational and assessment tools. This includes a pre- and post-test Diagnostic Thinking Inventory, a 41 item validated assessment of clinical reasoning ability . Performance in summative and formative written and clinical assessments, measured one week following the VP case, and several months later will also be collected.
Sample size determination
Sample size calculation for Key Feature Problems outcomes, and student self evaluation scores
Key Feature problems
Student self reported Evaluation scores
If the interaction between branching and feedback interventions is of same order of magnitude as the expected main effects then we would require a fourfold increase in the sample size to give a total of 352 students. 
For self-reported scores, where a previous study reported a standard deviation of 0.93,  a 10% difference in scores (with a maximum of 5) is considered significant, that is a difference in mean scores of 0.5, corresponding to a standardised effect size of approximately 0.5 (moderate). Based on these assumptions, we would require a total sample size of 112 students to detect this difference with 80% power at the (two-sided) 5% level (Table 2).
Therefore 112 students would be required to detect the main effects and large interaction effect of branching and feedback on self-reported scores. To detect an interaction effect of the same order of magnitude as the expected main effects would require a total of 448 students. The pool of students available for recruitment into this study is large at the three centres (WMS, n~160; UBMS, n~400; KMS, n~150). Given unforeseen recruitment problems and some loss to follow-up, a target of 112 students should be easily achievable to quantify the main intervention effects (branching and feedback) which are the primary focus of the study, with increasing recruitment above this target providing increasing power to detect potential interactions between the main effects.
We will present absolute numbers for enrolment, eligibility, and complete follow up. Descriptive statistics will be used to present student demographics, along with the mean, standard deviation, standard error of the mean, and 95% confidence intervals for primary and secondary outcome measures.
The primary analysis will be based on complete cases on a per-protocol analysis. It seems likely that some data may not be available due to voluntary withdrawal of participants, or drop-out through lack of completion of individual data items , unforeseen technical difficulties, and general loss to follow-up. Where possible the reasons for data ‘missingness’ will be ascertained and reported. The pattern of the missingness will be carefully considered and the reasons for non-compliance, withdrawal or other protocol violations will be stated and any patterns summarised. The primary analysis will investigate the fixed effects of the factorial combinations of branching and feedback on the primary outcome measures, performance in a standardised composite clinical reasoning assessment and a 15-item self reported evaluation. Analysis of covariance (ANCOVA) will be used to identify main effects, effect sizes, and interactions between the two independent design variables (feedback and branching). Blocking factors in the ANCOVA will adjust for the effects randomisation group, case ordering and recruiting centre, with student GEM status and gender as covariates. Tests from the ANCOVA will be two-sided and considered to provide evidence for a significant difference if p-values are less than 0.05 (5% significance level). Estimates of treatment effects will be presented with 95% confidence intervals. Students case preferences for learning and realism, and EViP will be evaluated using chi-squared tests for grouping factors case design and number.
We will determine the predictive validity of performance in the VP composite assessment, using one institution’s summative examination results, WMS. We will use the correlation coefficient (Pearson’s product–moment, r) to determine the effect size of any linear correlation between the VP scores and institution examinations.
A detailed statistical analysis plan (SAP) will be agreed with the trial management group at the start of the study, with any subsequent amendments to this initial SAP being clearly stated and justified. The routine statistical analysis will mainly be carried out using R (http://www.r-project.org/) and S-PLUS (http://www.insightful.com/). Results from this study will also be compared with results from other studies.
The National Health Service Local Research Ethics Committee approved the protocol as an educational research study. Warwick Medical School Biomedical Research Ethics Committee gave written ethics and institution approval in 2010. The study has institutional approval from KMS and UBMS.
The main purpose of this randomised-factorial study design is to identify the most effective design principles for VPs across a range of musculoskeletal cases, addressing a research question identified recently in the literature , which have not been answered by a recent meta-analysis .
Our use of validated assessments of clinical reasoning where possible helps to validate the research findings, however there are limitations in these existing tools to measure clinical reasoning. The use of assessment data from one institution from both summative and written examinations will assist with the interpretation of the predictive and criterion validity of the VP cases when considered as formative assessments.
The blinding of students to group allocations will hopefully minimise bias and preconceptions about virtual patients. The students in the study do not have VP education formally integrated into any curriculum, however previous exposure and familiarity with existing open access cases cannot be excluded.
Interpretation of the research findings will be facilitated by open access publication of VPs generated. This has not been used in recent published peer reviewed research on VPs [2, 3, 34–36]. The publication of these cases with the research will allow appropriate integration of the materials as a learning resource of the design process, and allow other researchers to evaluate the research methods .
JB is an Education Research Fellow with Arthritis Research UK, undertaking a PhD at Warwick Medical School. MA is a consultant rheumatologist and Director of Medical Education at University Hospitals Coventry and Warwickshire NHS Trust, where NP works as a statistician with Warwick Medical School. JK is head of the Education Development and Research Team (EDRT), Warwick Medical School. DD is Associate Professor of Medical Education at the EDRT.
This study is funded from an Educational Research Fellowship Grant from Arthritis Research UK, UK Registered charity number 207711. We would like to thank Arthritis Research UK Education and Training Committee, and faculty from musculoskeletal departments at the host and collaborating institutions. We wish to also thank the publisher Wiley for the use of the Diagnostic Thinking Inventory. We thank Dr JB McGee from the University of Pittsburgh, and DecisionSim for assistance and collaboration with the VP production.
- Ellaway RH, Poulton T, Smothers V, Greene P: Virtual patients come of age. Med Teach. 2009, 31 (8): 683-684. 10.1080/01421590903124765.View ArticleGoogle Scholar
- Cook DA, Triola MM: Virtual patients: a critical literature review and proposed next steps. Medical Education. 2009, 43 (4): 303-311. 10.1111/j.1365-2923.2008.03286.x.View ArticleGoogle Scholar
- Cook DA, Erwin PJ, Triola MM: Computerized virtual patients in health professions education: a systematic review and meta-analysis. Acad Med. 2010, 85 (10): 1589-1602. 10.1097/ACM.0b013e3181edfe13.View ArticleGoogle Scholar
- Huang G, Reynolds R, Candler C: Virtual patient simulation at US and Canadian medical schools. Acad Med. 2007, 82 (5): 446-451. 10.1097/ACM.0b013e31803e8a0a.View ArticleGoogle Scholar
- Cook DA, Levinson AJ, Garside S, Dupras DM, Erwin PJ, Montori VM: Internet-based learning in the health professions: a meta-analysis. Jama. 2008, 300 (10): 1181-1196. 10.1001/jama.300.10.1181.View ArticleGoogle Scholar
- Ruderich F, Bauch M, Haag M, Heid J, Leven FJ, Singer R, Geiss HK, Junger J, Tonshoff B: CAMPUS–a flexible, interactive system for web-based, problem-based learning in health care. Studies in health technology and informatics. 2004, 107 (Pt 2): 921-925.Google Scholar
- Begg M: Virtual patients: practical advice for clinical authors using Labyrinth. Clin Teach. 2010, 7 (3): 202-205. 10.1111/j.1743-498X.2010.00382.x.View ArticleGoogle Scholar
- Zary N, Johnson G, Boberg J, Fors UG: Development, implementation and pilot evaluation of a Web-based Virtual Patient Case Simulation environment–Web-SP. BMC medical education. 2006, 6: 10-10.1186/1472-6920-6-10.View ArticleGoogle Scholar
- McGee JB, Neill J, Goldman L, Casey E: Using multimedia virtual patients to enhance the clinical curriculum for medical students. Studies in health technology and informatics. 1998, 52 (Pt 2): 732-735.Google Scholar
- Wilson AS, Goodall JE, Ambrosini G, Carruthers DM, Chan H, Ong SG, Gordon C, Young SP: Development of an interactive learning tool for teaching rheumatology–a simulated clinical case studies program. Rheumatology (Oxford). 2006, 45 (9): 1158-1161. 10.1093/rheumatology/kel077.View ArticleGoogle Scholar
- Friedman CP, France CL, Drossman DD: A randomized comparison of alternative formats for clinical simulations. Med Decis Making. 1991, 11 (4): 265-272. 10.1177/0272989X9101100404.View ArticleGoogle Scholar
- MedBiquitous Virtual Patient Player Specifications and Description Document V.1.0. http://www.medbiq.org/working_groups/virtual_patient/VirtualPatientPlayerSpecification.pdf.
- Ellaway R, Poulton T, Fors U, McGee JB, Albright S: Building a virtual patient commons. Med Teach. 2008, 30 (2): 170-174. 10.1080/01421590701874074.View ArticleGoogle Scholar
- Ellaway RH, Davies D: Design for learning: deconstructing virtual patient activities. Med Teach. 2011, 33 (4): 303-310. 10.3109/0142159X.2011.550969.View ArticleGoogle Scholar
- Smothers V, Ellaway R, Balasubramaniam C: eViP: sharing virtual patients across Europe. AMIA Annual Symposium proceedings / AMIA Symposium AMIA Symposium. 2008, 1140: .
- Bateman J, Davies D: Virtual patients: are we in a new era?. Acad Med. 2011, 86 (2): 151-author reply 151View ArticleGoogle Scholar
- Triola MM, Campion N, McGee JB, Albright S, Greene P, Smothers V, Ellaway R: An XML standard for virtual patients: exchanging case-based simulations in medical education. AMIA Annual Symposium proceedings / AMIA Symposium AMIA Symposium. 2007, 2007: 741-745.Google Scholar
- Huwendiek S, De leng BA, Zary N, Fischer MR, Ruiz JG, Ellaway R: Towards a typology of virtual patients. Med Teach. 2009, 31 (8): 743-748. 10.1080/01421590903124708.View ArticleGoogle Scholar
- Bateman J, Davies D, Allen M: Mind wandering has an impact in electronic teaching cases. Medical Education. 2012, 46 (2): 235-10.1111/j.1365-2923.2011.04187.x.View ArticleGoogle Scholar
- Wolpaw T, Papp KK, Bordage G: Using SNAPPS to facilitate the expression of clinical reasoning and uncertainties: a randomized comparison group trial. Acad Med. 2009, 84 (4): 517-524. 10.1097/ACM.0b013e31819a8cbf.View ArticleGoogle Scholar
- Sedlmeier P, Gigerenzer G: Teaching Bayesian reasoning in less than two hours. J Exp Psychol Gen. 2001, 130 (3): 380-400.View ArticleGoogle Scholar
- Page G, Bordage G, Allen T: Developing key-feature problems and examinations to assess clinical decision-making skills. Acad Med. 1995, 70 (3): 194-201. 10.1097/00001888-199503000-00009.View ArticleGoogle Scholar
- Bordage G, Brailovsky C, Carretier H, Page G: Content validation of key features on a national examination of clinical decision-making skills. Acad Med. 1995, 70 (4): 276-281. 10.1097/00001888-199504000-00010.View ArticleGoogle Scholar
- Bordage G, Grant J, Marsden P: Quantitative assessment of diagnostic ability. Medical Education. 1990, 24 (5): 413-425. 10.1111/j.1365-2923.1990.tb02650.x.View ArticleGoogle Scholar
- Round AP: Teaching clinical reasoning–a preliminary controlled study. Medical Education. 1999, 33 (7): 480-483. 10.1046/j.1365-2923.1999.00352.x.View ArticleGoogle Scholar
- Sumner W, Hagen MD: Value of information in virtual patient performance evaluations. AMIA Annual Symposium proceedings / AMIA Symposium AMIA Symposium. 2008, 1149: .
- Badcock LJ, Raj N, Gadsby K, Deighton CM: Meeting the needs of increasing numbers of medical students–a best practise approach. Rheumatology (Oxford). 2006, 45 (7): 799-803. 10.1093/rheumatology/kel070.View ArticleGoogle Scholar
- Adebajo A, Windsor K, Hassell A, Dacre J: Undergraduate education in rheumatology. Rheumatology (Oxford). 2005, 44 (9): 1202-1203. 10.1093/rheumatology/keh677. author reply 1203View ArticleGoogle Scholar
- Extensible Markup Language (XML) 1.0. http://www.w3.org/TR/REC-xml/-sec-origin-goals.
- Moher D, Schulz KF, Altman DG: The CONSORT statement: revised recommendations for improving the quality of reports of parallel group randomized trials. BMC Med Res Methodol. 2001, 1: 2-10.1186/1471-2288-1-2.View ArticleGoogle Scholar
- Norman GR, Eva KW: Diagnostic error and clinical reasoning. Medical Education. 2010, 44 (1): 94-100. 10.1111/j.1365-2923.2009.03507.x.View ArticleGoogle Scholar
- Fischer MR, Kopp V, Holzer M, Ruderich F, Junger J: A modified electronic key feature examination for undergraduate medical students: validation threats and opportunities. Med Teach. 2005, 27 (5): 450-455. 10.1080/01421590500078471.View ArticleGoogle Scholar
- Montgomery AA, Peters TJ, Little P: Design, analysis and presentation of factorial randomised controlled trials. BMC Med Res Methodol. 2003, 3: 26-10.1186/1471-2288-3-26.View ArticleGoogle Scholar
- Berman N, Fall LH, Smith S, Levine DA, Maloney CG, Potts M, Siegel B, Foster-Johnson L: Integration strategies for using virtual patients in clinical clerkships. Acad Med. 2009, 84 (7): 942-949. 10.1097/ACM.0b013e3181a8c668.View ArticleGoogle Scholar
- Botezatu M, Hult H, Fors UG: Virtual patient simulation: what do students make of it? A focus group study. BMC medical education. 2010, 10: 91-10.1186/1472-6920-10-91.View ArticleGoogle Scholar
- Zary N, Johnson G, Fors U: Web-based virtual patients in dentistry: factors influencing the use of cases in the Web-SP system. Eur J Dent Educ. 2009, 13 (1): 2-9. 10.1111/j.1600-0579.2007.00470.x.View ArticleGoogle Scholar
- Wilbanks J: Another reason for opening access to research. Bmj. 2006, 333 (7582): 1306-1308. 10.1136/sbmj.39063.730660.F7.View ArticleGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6920/12/62/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.