- Research article
- Open Access
- Open Peer Review
Pattern recognition as a concept for multiple-choice questions in a national licensing exam
© Freiwald et al.; licensee BioMed Central Ltd. 2014
- Received: 8 February 2014
- Accepted: 17 October 2014
- Published: 14 November 2014
Multiple-choice questions (MCQ) are still widely used in high stakes medical exams. We wanted to examine whether and to what extent a national licensing exam uses the concept of pattern recognition to test applied clinical knowledge.
We categorized all 4,134 German National medical licensing exam questions between October 2006 and October 2012 by discipline, year, and type. We analyzed questions from the four largest disciplines: internal medicine (n = 931), neurology (n = 305), pediatrics (n = 281), and surgery (n = 233), with respect to the following question types: knowledge questions (KQ), pattern recognition questions (PRQ), inverse PRQ (IPRQ), and pseudo PRQ (PPRQ).
A total 51.1% of all questions were of a higher taxonomical order (PRQ and IPRQ) with a significant decrease in the percentage of these questions (p <0.001) from 2006 (61.5%) to 2012 (41.6%). The proportion of PRQs and IPRQs was significantly lower (p <0.001) in internal medicine and surgery, compared to neurology and pediatrics. PRQs were mostly used in questions about diagnoses (71.7%). A significantly higher (p <0.05) percentage of PR/therapy questions was found for internal medicine compared with neurology and pediatrics.
The concept of pattern recognition is used with different priorities and to various extents by the different disciplines in a high stakes exam to test applied clinical knowledge. Being aware of this concept may aid in the design and balance of MCQs in an exam with respect to testing clinical reasoning as a desired skill at the threshold of postgraduate medical education.
- Multiple-choice questions
- Pattern recognition
- Clinical reasoning
Multiple-choice questions (MCQs) are still used in high stakes exams worldwide to assess the knowledge of medical students. Even though alternative assessment formats are available and increasingly applied, such as modified essay questions (MEQs) or objective structured clinical exams (OSCEs), the ease of use and testing efficiency of these formats are tempting features for the continued and widespread application of MCQs. In the USA and Germany, for example, MCQs constitute a major part of the National Medical Licensing Exam. While MCQs were originally designed to assess factual knowledge, well-constructed MCQs can also assess the application of knowledge, resembling a taxonomically higher order than the simple recall of isolated facts . Answering ‘higher order’ MCQs still requires cognitive knowledge, yet their realism receives greater acceptance by students and teachers [2, 3]. Cognitive knowledge alone does not guarantee competence, which integrates knowledge, skills, and attitudes . However, Glaser has already demonstrated in a developmental study in 1984, that knowledge is the single best determinant of expertise . This raises the question of whether ‘higher order’ MCQs might provide an opportunity to test the clinical reasoning skills of medical students.
Clinical reasoning used by physicians in daily practice presents itself as a combination of two different approaches: diagnostic pattern recognition (PR) and analytical hypothesis-based thinking . The ability of students to succeed in PR and clinical data interpretation shows a steady growth curve over increasing years at medical school [7, 8]. In a study to determine the relationship between problem-solving strategies and the likelihood of diagnostic success, the latter was significantly greater when study participants used PR rather than hypothetico-deductive, i.e. analytical, reasoning . While PR appears to happen unconsciously and almost automatically, the process of making an instant diagnosis is still based on the recognition of distinctive features of a certain disease and is a reasoning strategy widely used by medical experts with many years of experience . Even though PR is an important diagnostic tool and should be taught as a clinical reasoning strategy at medical school, clinicians must be aware that patterns can become rigid and the excessive focus on favorite patterns can lead to diagnostic errors when key features are prematurely assumed to represent a particular disease . Novices or unreflective physicians might focus too much on looking for the presence of specific patterns and may overlook other potentially important information . Nonetheless, medical students need to be familiar with PR and clinical data interpretation as diagnostic reasoning strategies and need to familiarize themselves with both of these principles during their undergraduate studies. However, analytical thinking requires feedback  and, therefore, cannot be applied in MCQ exams in a similar way.
Even though PR is, by definition, a personal and idiosyncratic process and might not be explicitly taught as clinical reasoning in every medical school, we hypothesize that it is an ideal concept to test applied medical knowledge in high stakes exams. To test whether and to what extent PR is used in MCQs, we defined a framework for the detection of disease patterns clinically used for PR. Based on this framework we analyzed all MCQs from the German National Licensing Exams, Part 2, between October 2006 and October 2012 in the disciplines of internal medicine, surgery, neurology, and pediatrics.
Since October 2006, every German National Medical Licensing Exam, Part 2, has consisted of 320 MCQs and an additional oral-practical exam lasting two hours per student. The exam takes place at the end of the final year of a six-year medical undergraduate curriculum and is held twice a year, in April and October. The actual final number of valid questions per exam is often below 320, because invalid MCQs are excluded after the exam has taken place. All MCQs have five possible answers and include questions with either a single correct or a single incorrect answer. In addition, long patient cases with six to 17 questions related to the same case are presented, with a single correct answer per question. Extended-matching MCQs are not included in this exam.
MCQs for the German National Medical Licensing Exam, Part 2, are developed by a national institute (IMPP, Institute for medical and pharmaceutical national exam questions). Panelists, recruited from the different specialist medical societies develop and revise the questions with respect to their scientific and clinical content. In a second step, IMPP employees check the questions with respect to formal correctness, comprehensibility, and difficulty. In a third step, referees in different panels solve the anonymized questions and discuss and revise them afterwards, if necessary, with respect to content and structure. The vote to actually use a certain question in an exam has to be unanimous.
We screened a total of 4,134 questions from the German National Medical Licensing Exam, Part 2 (October 2006 until October 2012), and assigned each question to one of 23 medical disciplines based on its topic and the correct answer. Questions from the four largest disciplines (internal medicine, surgery, neurology, and pediatrics), which constituted more than 42% of all questions, were included in this study. Questions from other disciplines, such as ophthalmology, were excluded from our analysis, because their numbers per discipline were too small for statistical analysis. In certain years, some disciplines were not even included in the exam. Questions were assigned to the pediatric discipline when the age of the described patient was below 18 years. When an overlap between internal medicine and surgery was detected, questions were assigned to the surgery discipline when surgical procedures were the correct treatment. This resulted in 1,750 questions, which were included in our analysis (internal medicine: n = 931, neurology: n = 305, pediatrics: n = 281, surgery n = 233).
Examples for the different types of MCQs
PR question (PRQ)
A 26-year-old man presents with increased thirst, urinary frequency and nocturia over the past several months. Physical examination is unremarkable. Twenty-four-hour urine osmolarity is <300 mOsm/L. A fluid deprivation test does not result in an increased urine osmolarity. Administration of 0.03 μg/kg of desmopressin results in a urine osmolarity of 450 mOsm/L after 2 hours. Which of the following is the most likely diagnosis?
Pseudo PR question (PPRQ)
A 42-year-old female consults her general practitioner because of increasing frequency of diarrhea with voluminous stools. The symptoms started six months ago and she is moving her bowels up to five times a day. Furthermore, she complains of flatulence and loss of weight (3 kg). Further investigations result in the diagnosis of celiac disease. Which diagnostic finding confirms this diagnosis?
Inverse PR question (IPRQ)
Different symptoms can lead to the diagnosis of renal artery stenosis; which symptom does not belong in this list?
Knowledge question (KQ)
Which type of bleeding is typical for low platelets?
Each question was assessed and categorized by two of four physician panelists. When disagreement in categorizing occurred, the question was discussed with one of the other two panelists and categorized according to the best fitting category according to the descriptions mentioned above.
Data were analyzed using SPSS statistical software (version 21). We assessed differences between the question categories and between the different disciplines with the χ 2-test and significance levels of p <0.05.
Question types per discipline and year
The dual process theory of reasoning includes a fast and intuitive approach and a slow and analytical approach . Experts tend to use the intuitive approach more often than novices do; however, when they cannot refer to the pattern of an illness script (a collection of signs and symptoms) , they use hypothetico-deductive reasoning as an analytical approach . Since pattern recognition is the fast approach of clinical reasoning applied every day by physicians, we hypothesized that it would occur in high stakes MCQ exams as a relevant concept. We identified 51.1% of all questions from internal medicine, surgery, neurology, and pediatrics from the German National Licensing Exam between 2006 and 2012 to be taxonomically higher order questions involving pattern recognition. However, their proportion dropped continuously from 61.5% in 2006 to 41.6% in 2012, which was way below the suggested level of at least 50% taxonomically higher order questions in MCQ exams . We also detected almost 5% of PPRQs as being ill defined PRQs, which resemble KQs, a pitfall in question design that could be added to a suggested list of common MCQ pitfalls . This can easily be avoided when panelist involved in designing questions are aware of it and will improve the quality of PRQs, raising the overall number of taxonomically higher order MCQs in an exam. However, item writing flaws are still a problem in high stakes exams, as has been demonstrated for MEQs that failed in over 50% to test more than mere recall of knowledge [15, 20]. Intensive and repeated training of panelists might be necessary.
Our study revealed that PR/diagnosis questions occurred in more than 70% of all identified PRQs that used only the first step of the clinical reasoning process  as their basic concept. In surgery and internal medicine, we detected the largest numbers of PR/therapy and PR/diagnostic procedures questions. These provide an additional step upwards in the taxonomy, because they include the interpretation of a pattern’s meaning and the application of additional knowledge . Thereby, this type of PRQ includes not only typical signs or symptoms of a disease, but also additional information, such as certain laboratory results, as part of a pattern. According to the dual-process theory, additional information is usually obtained in the clinical reasoning process by active collection , which cannot be simulated in MCQs. A cognitive model resembling pattern recognition, including additional information, has recently been developed, in order to generate multiple-choice test items . This could be very helpful in designing PRQs at a taxonomically higher order and, therefore, we suggest that pattern recognition should be added as a specific medical concept to MCQ item writing guidelines .
Using PRQs more frequently in MCQ exams to increase the cognitive level of the questions cannot be concluded from our study without additional considerations. A possible reason why PR/diagnosis questions occur mostly in neurology and pediatrics could be the high availability of disease patterns in these disciplines [25, 26]. This is especially true for the core neurological diagnostic approach of logically localizing a neural lesion, which translates well into the concept of pattern recognition. However, it must be noted that patterns in these disciplines often define rare diseases with greater relevance in postgraduate medical education within these specific disciplines. Therefore, the use of PRQs in exams for undergraduate medical students should preferably be in alignment with the content and specificity of the respective medical curriculum . Furthermore, it has been demonstrated that the skill of pattern recognition as a non-analytical model of clinical reasoning increases with experience . To train pattern recognition skills longitudinally and to provide an alignment of medical undergraduate training with PRQs in high stakes exams, students should have sufficient opportunities to practice the skill of pattern recognition and to receive supervision and feedback for their learning process [13, 22]. Another opportunity to teach diagnostic patterns could be the use of virtual patients  or electronically available PRQs and also IPRQs, where patterns can be highlighted in the learning process.
A limitation of our study is that it only included the four largest disciplines and excluded 19 smaller disciplines from the original analysis, albeit for statistical reasons. As another limitation, we only studied high stakes exams from one country. However, the framework we suggest for categorizing MCQs can be applied easily in all disciplines and countries and provides an additional concept for quality analysis of MCQ based exams. Furthermore, this framework provides a tool to design MCQs for applied knowledge using pattern recognition as the basis to test diagnostic and therapeutic strategies. It could also be helpful to find the desired balance between MCQs testing factual knowledge and applied knowledge while the amount of MCQs for applied knowledge might be higher in exams at the threshold of postgraduate medical education.
Pattern recognition is a prominent concept in MCQs from a National Medical Licensing Exam to test the application of clinical knowledge. Panelists involved in designing questions for high stakes exams should be aware of the PR concept in MCQs, in order to create PRQs with different emphases, depending on the requirements of the individual discipline. Undergraduate medical students should be provided with longitudinal learning opportunities for clinical reasoning, including feedback on their pattern recognition skills in the application of their knowledge. The quality of questions for applied knowledge in MCQ-based exams can be increased by using questions with unambiguous medical patterns to assess the clinical reasoning processes.
We would like to thank MIAMED GmbH for providing the software used for filtering and categorizing the original exam questions.
- Miller GE: The assessment of clinical skills/competence/performance. Acad Med. 1990, 65 (9 Suppl): S63-S67.View ArticleGoogle Scholar
- Peitzman SJ, Nieman LZ, Gracely EJ: Comparison of “fact-recall” with “higher-order” questions in multiple-choice examinations as predictors of clinical performance of medical students. Acad Med. 1990, 65 (9 Suppl): S59-S60.View ArticleGoogle Scholar
- Case SM, Swanson DB, Becker DF: Verbosity, window dressing, and red herrings: do they make a better test item?. Acad Med. 1996, 71 (10 Suppl): S28-S30.View ArticleGoogle Scholar
- Fernandez N, Dory V, Ste-Marie L-G, Chaput M, Charlin B, Boucher A: Varying conceptions of competence: an analysis of how health sciences educators define competence. Med Educ. 2012, 46: 357-365. 10.1111/j.1365-2923.2011.04183.x.View ArticleGoogle Scholar
- Glaser R: Education and thinking: the role of knowledge. Am Psychol. 1984, 39: 193-202.View ArticleGoogle Scholar
- Eva KW: What every teacher needs to know about clinical reasoning. Med Educ. 2005, 39: 98-106. 10.1111/j.1365-2929.2004.01972.x.View ArticleGoogle Scholar
- Williams RG, Klamen DL, Hoffman RM: Medical student acquisition of clinical working knowledge. Teach Learn Med. 2008, 20: 5-10. 10.1080/10401330701542552.View ArticleGoogle Scholar
- Williams RG, Klamen DL, White CB, Petrusa E, Fincher R-ME, Whitfield CF, Shatzer JH, McCarty T, Miller BM: Tracking development of clinical reasoning ability across five medical schools using a progress test. Acad Med. 2011, 86: 1148-1154. 10.1097/ACM.0b013e31822631b3.View ArticleGoogle Scholar
- Coderre S, Mandin H, Harasym PH, Fick GH: Diagnostic reasoning strategies and diagnostic success. Med Educ. 2003, 37: 695-703. 10.1046/j.1365-2923.2003.01577.x.View ArticleGoogle Scholar
- Regehr G, Norman GR: Issues in cognitive psychology: implications for professional education. Acad Med. 1996, 71: 988-1001. 10.1097/00001888-199609000-00015.View ArticleGoogle Scholar
- Norman GR, Eva KW: Diagnostic error and clinical reasoning. Med Educ. 2010, 44: 94-100. 10.1111/j.1365-2923.2009.03507.x.View ArticleGoogle Scholar
- Mattingly C, Fleming M: Clinical Reasoning: Forms of Inquiry in a Therapeutic Practice. 1994, Philadelphia: FA DavisGoogle Scholar
- Croskerry P: Context is everything or how could I have been that stupid?. Healthc Q. 2009, 12: e171-e176. 10.12927/hcq.2009.20945.View ArticleGoogle Scholar
- Ware J, Vik T: Quality assurance of item writing: during the introduction of multiple choice questions in medicine for high stakes examinations. Med Teach. 2009, 31: 238-243. 10.1080/01421590802155597.View ArticleGoogle Scholar
- Palmer EJ, Devitt PG: Assessment of higher order cognitive skills in undergraduate education: modified essay or multiple choice questions? Research Paper. BMC Med Educ. 2007, 7: 49-10.1186/1472-6920-7-49.View ArticleGoogle Scholar
- Kahneman D: Thinking, Fast and Slow. 2012, London, England: PenguinGoogle Scholar
- Schmidt HG, Norman GR, Boshuizen HP: A cognitive perspective on medical expertise: theory and implication. Acad Med. 1990, 65: 611-621. 10.1097/00001888-199010000-00001.View ArticleGoogle Scholar
- Norman G: Building on experience–the development of clinical reasoning. N Engl J Med. 2006, 355: 2251-2252. 10.1056/NEJMe068134.View ArticleGoogle Scholar
- Al-Faris EA, Alorainy IA, Abdel-Hameed AA, Al-Rukban MO: A practical discussion to avoid common pitfalls when constructing multiple choice questions items. J Family Community Med. 2010, 17: 96-102. 10.4103/1319-1683.71992.View ArticleGoogle Scholar
- Palmer EJ, Duggan P, Devitt PG, Russell R: The modified essay question: its exit from the exit examination?. Med Teach. 2010, 32: e300-e307. 10.3109/0142159X.2010.488705.View ArticleGoogle Scholar
- Groves M, O’Rourke P, Alexander H: Clinical reasoning: the relative contribution of identification, interpretation and hypothesis errors to misdiagnosis. Med Teach. 2003, 25: 621-625. 10.1080/01421590310001605688.View ArticleGoogle Scholar
- Pelaccia T, Tardif J, Triby E, Charlin B: An analysis of clinical reasoning through a recent and comprehensive approach: the dual-process theory. Med Educ Online. 2011, 16: 5890.View ArticleGoogle Scholar
- Gierl MJ, Lai H, Turner SR: Using automatic item generation to create multiple-choice test items. Med Educ. 2012, 46: 757-765. 10.1111/j.1365-2923.2012.04289.x.View ArticleGoogle Scholar
- Haladyna T, Downing S, Rodriguez M: A review of multiple-choice item-writing guidelines for classroom assessment. Appl Meas Educ. 2002, 15: 309-334. 10.1207/S15324818AME1503_5.View ArticleGoogle Scholar
- Rutkove SB: Pattern recognition. Neurology. 2003, 61: 585-586. 10.1212/01.WNL.0000078930.98769.11.View ArticleGoogle Scholar
- Muram D, Simmons KJ: Pattern recognition in pediatric and adolescent gynecology–a case for formal education. J Pediatr Adolesc Gynecol. 2008, 21: 103-108. 10.1016/j.jpag.2007.10.009.View ArticleGoogle Scholar
- Cookson J: Twelve tips on setting up a new medical school. Med Teach. 2013, 35: 715-719. 10.3109/0142159X.2013.799638.View ArticleGoogle Scholar
- Norman G, Young M, Brooks L: Non-analytical models of clinical reasoning: the role of experience. Med Educ. 2007, 41: 1140-1145.Google Scholar
- Adams EC, Rodgers CJ, Harrington R, Young MDB, Sieber VK: How we created virtual patient cases for primary care-based learning. Med Teach. 2011, 33: 273-278. 10.3109/0142159X.2011.544796.View ArticleGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6920/14/232/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.