- Open Access
Otoskills training during covid-19 pandemic: a before-after study
BMC Medical Education volume 21, Article number: 284 (2021)
The ongoing COVID-19 pandemic has disrupted the surgical training of residents. There is a real concern that trainees will not be able to meet their training requirements. Low-fidelity surgical simulation appears to be an alternative for surgical training. The educational benefits of repeating ossiculoplasty simulations under a microscope have never been evaluated. With this study we aimed to evaluate the differences in performance scores and on a global rating scale before and after training on an ossiculoplasty simulator.
In this quasi-experimental, prospective, single-centre, before-after study with blinded rater evaluation, residents performed five microscopic ossiculoplasty tasks with a difficulty gradient (sliding beads onto rods, the insertion of a partial prosthesis, the insertion of a total prosthesis, and the insertion of a stapedotomy piston under microscopic or endoscopic surgery) before and after training on the same simulator. Performance scores were defined for each task, and total performance scores (score/min) were calculated. All data were collected prospectively.
Six out of seven intermediate residents and 8/9 novices strongly agreed that the simulator was an effective training device and should be included in the ENT residency program. The mean effect of training was a significant increase in the total performance score (+ 0.52 points/min, [95 % CI, 0.40–0.64], p < 0.001), without a significant difference between novice and intermediate residents.
This preliminary study shows that techniques for middle-ear surgery can be acquired using a simulator, avoiding any risk for patients, even under lockdown measures.
The ongoing COVID-19 pandemic has disrupted the surgical training of residents . Particularly in demanding surgical specialities that involve acquisition of procedural skills. There is a real concern that trainees will not be able to meet their training requirements and the long-term impact of suspending training indefinitely is a severe disruption of essential medical services. Teaching in the operating room can be supplemented by surgical simulation, which allows students to improve their skills in the ever-decreasing time devoted to their training . To be effective, surgical simulations must be used as part of a coherent overall strategy based on clear teaching objectives and up-to-date procedures . Careful alignment of education and practice design principles with the intended outcomes is required. Deliberate practice (DP) and mastery learning (ML) approaches to train for procedural skills can ensure expert-level performance in various procedures . Indeed, the DP method is based on 4 key components and refers to engagement in structured activities with the goal of improving performance in a domain through an iterative cycle of practice, feedback, and successive refinement [5, 6]. DP is often coupled with the ML model, where tasks are broken into a series of smaller and progressively more complex microskills . DP and ML both improve performance across a variety of disciplines, including sports and music, and there is growing evidence of their effectiveness within medical education and surgical skills [4, 8]. To master ossiculoplasty, students require regular practice in the operating room . The risks of permanent hearing loss and peripheral facial paralysis make it a delicate procedure for which increased training would be beneficial, particularly since risk-free alternatives exist (virtual reality simulators or three-dimensional printed simulators).
Low-fidelity surgical simulation appears to be an interesting alternative for practical residency training because residents can access the simulator directly in keeping with infection control practices, even during lockdowns . A number of simulators have been evaluated for basic microsurgical procedures carried out in consultations, such as the treatment of external ear canal disorders and tympanostomy tube insertion [11,12,13]. The simulator investigated in this study has previously been evaluated for endoscopic surgery of the middle ear without a microscope [14, 15]. However, the educational benefits of repeating ossiculoplasty simulations under a microscope have never been evaluated. The setting of the present study was microscope-assisted otologic surgery training in an Ear Nose Throat (ENT) surgical residency program. The aim was to prepare residents to perform increasingly demanding ossiculoplasty surgery and allow them to adapt to any intraoperative complication. The hypothesis was that using this low-fidelity middle ear surgery simulator under a microscope would be an excellent alternative to training in the operating room and that the benefits would differ depending on residents’ level of experience.
The main objective was to assess the improvement in ossiculoplasty skills after training on a simulator. The main outcome measures were the differences in performance scores before and after training; interobserver agreement was analysed to assess the internal validity of the results. Secondary objectives included a pilot validation study of the simulator, assessed in terms of its ability to distinguish amongst novice, intermediate, and expert surgeons based on their performance and global rating scale scores on video-recorded exercises.
Materials and methods
This was a quasi-experimental, prospective, single-centre before–after study with blinded rater evaluation carried out between April and May 2020 in our department. The participants were all ENT residents attending a practical workshop on microscope-assisted ossiculoplasty. They were all participating voluntarily and free of charge. The inclusion criteria were: ENT residents with no history of surgical simulation training (regardless of level of experience in middle ear surgery). There were no exclusion criteria. Experts were recruited based on their experience in middle ear surgery from two different hospitals. Participants were divided into three groups based on their levels of experience in middle-ear surgery: novices (ENT residents who had never performed middle ear surgery), intermediate-level surgeons (ENT residents with more than two semesters of experience in an otology department), and experts (senior ENT surgeons). The results from experts were used only for the pilot simulator validation study (results at T1). For sample size calculation, an improvement of 25 % of the total performance score, a standard deviation of 0.28 of the score, and a correlation of 0.5 between measurements at T1 and T2, 16 participants were needed to show a statistical improvement with 90 % power (two-sided alpha risk of 5 %). Details of the study design are given in Fig. 1.
Structure of the workshop
Participants were evaluated at baseline (first evaluation, T1) and then had three one-hour training sessions over three weeks before being assessed again one week after the last training session (second evaluation, T2). Baseline evaluations were used to investigate the simulator’s ability to distinguish among novice, intermediate, and expert surgeons. Indeed, if the exercises proposed by the simulator are well calibrated, participants’ ability to succeed in the exercises, reflected by their score, should change according to their level of experience. Measurements at T2 were not used for the discriminative power analysis of the simulator because experts only performed evaluations at baseline, and there would be less heterogeneity at T2. Baseline evaluations and measurements at T2 for novice and intermediate were used to assess the educational benefits of the course based on a total and task-specific performance score (PS) per minute and a global rating scale (GRS). The simulator chosen for the study was the Otoskills device (Grace Medical, Memphis, USA), with three different modules (Fig. 2). The first module was made of small holes and used for 2 exercises (Fig. 2, B and C). The second module represented the superstructure of the stapes and was used for the insertion of a partial ossicular reconstruction prosthesis (PORP), as shown in Fig. 2, D. The last module represented the long crus of the incus and a stapedotomy for insertion of a piston (Fig. 2, E).
Figure of the simulator tested (A) with three different modules and four different tasks performed by participants : B) insertion of a TORP (module 1); C) rods on beads (module 1); D) insertion of a PORP (module 2); E) insertion of a piston through a stapedotomy (module 3). TORP, Total Ossicular Replacement Prothesis; PORP, Partial Ossicular Replacement Prosthesis.
The exercises were devised to help participants practice procedures requiring fine motor skills and the handling of one or two instruments inside the speculum. The main objective was to develop a series of exercises to prepare participants to perform ossiculoplasty without harming patients. Exercises were devised with a slowly increasing cognitive load. They were chosen based on cognitive load and technical skills required from surgical expert opinion. The low-cognitive demand training tasks involved manipulating rods and beads under the microscope to exercise 3D vision and depth perception (module 1). The second task was the insertion of a total ossicular replacement prosthesis (TORP) using a module, mimicking an oval window without a stapes (module 1). The exercise requiring the manipulation of two instruments inside the speculum was the positioning of a PORP on a module with a stapes suprastructure (module 2). A module simulating the long crus of the incus above a stapedotomy was used for the insertion of a piston prosthesis (module 3). A final exercise of endoscopic (rather than microscope-guided) prosthesis insertion was included to test residents’ abilities in two-dimensional surgery (module 3).
Evaluation of the exercises
The participants performed each exercise four times, and the time required to complete the set was recorded to define total and task-specific PS per minute, as described by Veaudor et al. . The scores for each task depended on the number of attempts required, so 5/5 if the task was completed on the first attempt, 3/5 if two attempts were needed, 1/5 if three attempts were needed, and down to zero if five or more attempts were necessary. Participants were also evaluated using a GRS [17,18,19,20,21], defined as the sum of 6 domains rated on 5-point scales: fluency of movement, knowledge of the procedure, anticipation, choice of instrument(s), and overall technique (as developed by Vanblaricom et al. ), and an additional domain on the insertion of a stapedotomy piston (choice of forceps, positioning of the piston on the forceps and on the incus), leading to a GRS score out of 30. Each task was filmed, and two raters—expert otologists trained on previous videos—independently evaluated the anonymized video recordings, blinded to the participants’ level of experience. The trainers and raters were not from the same centre, and their faces were not recorded in the videos to assure the blinding of the evaluation.
Interobserver agreement for the total PS and the GRS score was assessed by intraclass correlation coefficients (the confidence interval was obtained by bootstrapping). The simulator’s ability to discriminate among expert, intermediate, and novice surgeons was evaluated by comparing, at T1, their results in total PS (not divided by the time taken to perform the tasks) and the GRS score (Kruskal-Wallis nonparametric test). This choice was made to quickly identify the gap between groups, regardless of the time needed by each participant. The improvement in residents’ total PS after training was evaluated using a linear mixed-effects model, including the level of experience (novice or intermediate), the rater, the assessment time, and an interaction between time and experience to investigate a possible variation in learning times with experience. Random effects considered variability between the participants at baseline and variability of improvement over time. The evaluations obtained by each rater at each time point were taken into account in the statistical analysis. The model for the GRS score was similar but used a t-distribution for random errors. The results are reported as medians and first and third quartiles for quantitative variables, and as frequencies and percentages for categorical variables. Statistical significance was set at p < 0.05, and 95 % confidence intervals [95 % CI] are provided. All analyses were performed using R software (Version 3.5.3, www.r-project.org).
The scores from the two raters agreed closely, with an intraclass correlation coefficient of 0.98 [95 % CI, 0.97–0.99] for the total PS and 0.99 [95 % CI, 0.99–1.00] for the GRS score.
Evaluation of the discriminative power of the simulator at T1
There were statistically significant differences in the median total PS, which differed by more than 10 % between novice, intermediate, and expert surgeons (16.8 [16.2–19.2], 21.3 [19.5–21.8], and 25.8 [25.1–27.9], respectively, p-value = 0.004), the trend following the level of experience (Table 1). The GRS scores followed the same trend for novices, intermediate surgeons, and expert surgeons (13.0 [8.0–17.0], 23.0 [17.5–23.5], and 30.0 [30.0–30.0], respectively, p-value = 0.003; Table 1).
Effect of training on novices and intermediate-level participants
Overall, there was a significant improvement in the total PS score after training, and the size of the effect was 0.52 points per minute ([95 % CI, 0.40–0.64], p < 0.001). This improvement was not significantly different between the novice group, 0.44 [95 % CI, 0.28–0.59], and the intermediate group, 0.60 [95 % CI, 0.43–0.78] (p-value for interaction = 0.139). There was a high variability in improvement between participants inside groups (deviations of ± 0.42). Averaging the scores in the novice and intermediate groups for the total PS score, the overall deficit of novices compared with intermediate surgeons was − 0.42 points per minute ([− 0.63 to − 0.21], p < 0.001). Training also led to a significant improvement in the GRS score, and the size of the effect was 7.1 points overall ([95 % CI, 0.96–13.2], p = 0.023), which was not significantly different between the novice group, 9.2 [0.36–17.98], and the intermediate group, 5.0 [-3.81–13.81] (p-value for interaction = 0.520). There was no evidence of systematic bias between the scores awarded by the two raters (-0.01, ([95 % CI, -0.02–0.01], p-value = 0.266). The scores at T1 and T2 are described in Fig. 3. The coefficients of the multivariate model are shown in Additional file 1 for PS score and for GRS score (Additional file 1).
Surgical simulation allows new skills in ossiculoplasty to be acquired progressively in a personalized manner, in a safe environment, with immediate feedback, all of which are major educational benefits. The simulator evaluated here for microscope-assisted middle-ear surgery successfully discriminated between the three differently experienced groups (producing a statistically significant difference in mean scores). Training was also associated with a significant improvement in overall performance scores (+ 0.52 per min, [95 % CI, 0.40–0.64], p < 0.001). Regardless of their level of experience in middle ear surgery, all participants benefited from the training. These results were expected and are concordant with numerous known published studies [22,23,24]. This simulator has spiked the interest of both senior and novice doctors.
In terms of design, one strength of the study is that the evaluations were blinded, thus increasing their external validity and generalizability. As required when assessing skills in complex tasks , the raters were experts in the field. Furthermore, the evaluation method based on the number of attempts, the time taken to perform the tasks, and an overall assessment of fluency has already been shown to be correlated with participants’ levels of experience . Another advantage of the before–after study design is that the initial evaluation serves to confirm that the abilities assessed after training were not pre-existing. Moreover, the experimental design of the study was based on previous highly robust investigations  and included an evaluation of the simulator’s discriminating power, with results confirming its ability to distinguish among novice, intermediate, and expert surgeons (Table 1). The GRS scores were extremely discriminating at baseline between novice and intermediate residents (Table 1), with a substantial improvement after training. GRS represents a promising tool to objectively assess technical skills in simulation training with high construct validity and interrater reliability as reported in other studies [24, 25]. Compared with the checklist sometimes used to evaluate simulation exercises, GRS are more robust to task-specific variations .
The study’s limitations include the following: (i) the expert group was a benchmark rather than a true control group; (ii) a single-centre study is not commonly desirable in this kind of investigation and limits the generalizability of the results; and (iii) the small sample size of the three groups limits the external validity. Nevertheless, the accessibility and ease of setup of the simulator, even during a public health crisis, should lead to widespread use in the ENT community for future multicentre studies. Finally, evaluation was based on recorded videos interpreted by raters. We did not use electromagnetic motion tracking analysis to objectively measure surgical skills in the laboratory, which again limits the external validity of the outcomes. However, the presence of expert raters previously trained, who independently evaluated the anonymized video recordings and were blinded to the participants’ level of experience, was a robust design. Since the hand–eye dissociation required to perform manipulations under indirect visual control is what makes microscope-assisted procedures particularly difficult, it would have been interesting, although ethically questionable , to verify that the skills acquired via the simulator were transferable to the operating room. In the field of surgical simulation, more work is required to define which skill standards are to be met for a given task; that is, the threshold levels that must be reached for residents to be allowed to perform the procedure in the operating room.
Subgroup analyses showed that although intermediate residents improved significantly after training, their performance increased less than that of the novices, suggesting that skill levels plateau after an initial rapid improvement. This effect has been described before and seems related to deficiencies in self-assessment [27, 28], with students thinking they are doing better than they truly are. The magnitude of increase could also be limited by a ceiling effect attributed to the adopted scale . This highlights the importance of high-quality performance evaluations. One solution is to assess levels of training separately by offering adapted exercises according to practice level. In the midst of a public health crisis, finding the right balance between productivity and safety is crucial .
It is important that training alternatives be found to compensate for ever decreasing operating room time so that residents can master procedures without putting patients at risk. While there is no replacement for actual experience in the operating room, surgical simulators seem to be promising tools for ear surgery. This preliminary study shows that techniques for middle-ear surgery can be acquired using a simulator, avoiding any risk for patients, even under lockdown measures. It is likely to be an important part of training programs for middle-ear surgery in the 21st century.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Corona virus disease 2019
Ear nose and throat
Global rating scale
Partial ossicular reconstruction prosthesis
Total ossicular reconstruction prosthesis
Crosby DL, Sharma A. Insights on Otolaryngology Residency Training during the COVID-19 Pandemic. Otolaryngol Head Neck Surg. 2020;163:38–41.
Kneebone RL, Nestel D, Vincent C, Darzi A. Complexity, risk and simulation in learning procedural skills. Med Educ. 2007;41:808–14.
Kneebone RL, Scott W, Darzi A, Horrocks M. Simulation and clinical practice: strengthening the relationship. Med Educ. 2004;38:1095–102.
Petrosoniak A, Lu M, Gray S, Hicks C, Sherbino J, McGowan M, et al. Perfecting practice: a protocol for assessing simulation-based mastery learning and deliberate practice versus self-guided practice for bougie-assisted cricothyroidotomy performance. BMC Med Educ. 2019;19:100.
Ericsson KA. Deliberate practice and acquisition of expert performance: a general overview. Acad Emerg Med Off J Soc Acad Emerg Med. 2008;15:988–94.
Ericsson KA, Lehmann AC. Expert and exceptional performance: evidence of maximal adaptation to task constraints. Annu Rev Psychol. 1996;47:273–305.
Cook DA, Hatala R, Brydges R, Zendejas B, Szostek JH, Wang AT, et al. Technology-enhanced simulation for health professions education: a systematic review and meta-analysis. JAMA. 2011;306:978–88.
McGaghie WC, Issenberg SB, Cohen ER, Barsuk JH, Wayne DB. Does simulation-based medical education with deliberate practice yield better results than traditional clinical education? A meta-analytic comparative review of the evidence. Acad Med J Assoc Am Med Coll. 2011;86:706–11.
O’Brien DC, Kellermeyer B, Chung J, Carr MM. Experience with key indicator cases among otolaryngology residents. Laryngoscope Investig Otolaryngol. 2019;4:387–92.
Zirkle M, Roberson DW, Leuwer R, Dubrowski A. Using a virtual reality temporal bone simulator to assess otolaryngology trainees. The Laryngoscope. 2007;117:258–63.
Luu K, Straatman L, Nakku D, Westerberg B, Carter N, Clark M. Evaluation of a low-fidelity ear surgery simulator in a low-resource setting. J Laryngol Otol. 2017;131:1010–6.
Clark MPA, Nakku D, Westerberg BD. An endoscopic Ear Trainer for the low-resource setting. J Laryngol Otol. 2019;133:571–4.
Clark MPA, Westerberg BD, Mitchell JE. Development and validation of a low-cost microsurgery Ear Trainer for low-resource settings. J Laryngol Otol. 2016;130:954–61.
Dedmon MM, O’Connell BP, Kozin ED, Remenschneider AK, Barber SR, Lee DJ, et al. Development and Validation of a Modular Endoscopic Ear Surgery Skills Trainer. Otol Neurotol Off Publ Am Otol Soc Am Neurotol Soc Eur Acad Otol Neurotol. 2017;38:1193–7.
Dedmon MM, Xie DX, O Connell BP, Dillon NP, Wellborn PS, Bennett ML, et al. Endoscopic Ear Surgery Skills Training Improves Medical Student Performance. J Surg Educ. 2018;75:1480–5.
Veaudor M, Gérinière L, Souquet P-J, Druette L, Martin X, Vergnon J-M, et al. High-fidelity simulation self-training enables novice bronchoscopists to acquire basic bronchoscopy skills comparable to their moderately and highly experienced counterparts. BMC Med Educ. 2018;18:191.
Brydges R, Hatala R, Zendejas B, Erwin PJ, Cook DA. Linking simulation-based educational assessments and patient-related outcomes: a systematic review and meta-analysis. Acad Med J Assoc Am Med Coll. 2015;90:246–56.
Martin JA, Regehr G, Reznick R, MacRae H, Murnaghan J, Hutchison C, et al. Objective structured assessment of technical skill (OSATS) for surgical residents. Br J Surg. 1997;84:273–8.
Weizman NF, Manoucheri E, Vitonis AF, Hicks GJ, Einarsson JI, Cohen SL. Design and validation of a novel assessment tool for laparoscopic suturing of the vaginal cuff during hysterectomy. J Surg Educ. 2015;72:212–9.
VanBlaricom AL, Goff BA, Chinn M, Icasiano MM, Nielsen P, Mandel L. A new curriculum for hysteroscopy training as demonstrated by an objective structured assessment of technical skills (OSATS). Am J Obstet Gynecol. 2005;193:1856–65.
Vassiliou MC, Feldman LS, Andrew CG, Bergman S, Leffondré K, Stanbridge D, et al. A global assessment tool for evaluation of intraoperative laparoscopic skills. Am J Surg. 2005;190:107–13.
Datta V, Mackay S, Mandalia M, Darzi A. The use of electromagnetic motion tracking analysis to objectively measure open surgical skill in the laboratory-based model. J Am Coll Surg. 2001;193:479–85.
Kneebone R. Simulation in surgical training: educational issues and practical implications. Med Educ. 2003;37:267–77.
Zoller A, Hölle T, Wepler M, Radermacher P, Nussbaum BL. Development of a novel global rating scale for objective structured assessment of technical skills in an emergency medical simulation training. BMC Med Educ. 2021;21:184.
Ilgen JS, Ma IWY, Hatala R, Cook DA. A systematic review of validity evidence for checklists versus global rating scales in simulation-based assessment. Med Educ. 2015;49:161–73.
Kirkpatrick JJ, Naylor IL. The qualities and conduct of an English surgeon in 1446: as described in a manuscript attributed to Thomas Morstede. Ann R Coll Surg Engl. 1997;79:225–8.
Andersen SAW, Konge L, Mikkelsen PT, Cayé-Thomasen P, Sørensen MS. Mapping the plateau of novices in virtual reality simulation training of mastoidectomy. The Laryngoscope. 2017;127:907–14.
Jowett N, LeBlanc V, Xeroulis G, MacRae H, Dubrowski A. Surgical skill acquisition with self-directed practice using computer-based video training. Am J Surg. 2007;193:237–42.
Curran VR, Fairbridge NA, Deacon D. Peer assessment of professionalism in undergraduate medical education. BMC Med Educ. 2020;20:504.
The translation of this article was made by Green Grow Science (P. Guéry) and was supported by the Bibliotheque Scientifique de l’Internat de Lyon, France.
Ethics approval and consent to participate
This study involved human participants and was performed in accordance with the Declaration of Helsinki. It has been approved by an appropriate ethics committee “Comité d’Ethique du CHU de Lyon” (n°20–96). All data were anonymized. The data collected are outside the remit of French data protection laws. Informed consent to participate was obtained from all participants.
Consent for publication
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Fieux, M., Gavoille, A., Subtil, F. et al. Otoskills training during covid-19 pandemic: a before-after study. BMC Med Educ 21, 284 (2021). https://doi.org/10.1186/s12909-021-02706-8
- Medical education