- Research article
- Open Access
Mixed reality for teaching catheter placement to medical students: a randomized single-blinded, prospective trial
BMC Medical Education volume 20, Article number: 510 (2020)
Cost-effective methods to facilitate practical medical education are in high demand and the “mixed-reality” (MR) technology seems suitable to provide students with instructions when learning a new practical task. To evaluate a step-by-step mixed reality (MR) guidance system for instructing a practical medical procedure, we conducted a randomized, single-blinded prospective trial on medical students learning bladder catheter placement.
We enrolled 164 medical students. Students were randomized into 2 groups and received instructions on how to perform bladder catheter placement on a male catheterization training model. One group (107 students) were given their instructions by an instructor, while the other group (57 students) were instructed via an MR guidance system using a Microsoft HoloLens. Both groups did hands on training. A standardized questionnaire covering previous knowledge, interest in modern technologies and a self-evaluation was filled out. In addition, students were asked to evaluate the system’s usability. We assessed both groups’s learning outcome via a standardized OSCE (objective structured clinical examination).
Our evaluation of the learning outcome revealed an average point value of 19.96 ± 2,42 for the control group and 21.49 ± 2.27 for the MR group - the MR group’s result was significantly better (p = 0.00). The self-evaluations revealed no difference between groups, however, the control group gave higher ratings when evaluating the quality of instructions. The MR system’s assessment showed less usability, with a cumulative SUS (system usability scale) score of 56.6 (lower half) as well as a cumulative score of 24.2 ± 7.3 (n = 52) out of 100 in the NASA task load index.
MR is a promising tool for instructing practical skills, and has the potential to enable superior learning outcomes. Advances in MR technology are necessary to improve the usability of current systems.
German Clinical Trial Register ID: DRKS00013186
Technological progress has changed the medical field dramatically in recent decades, transforming the training of practical medical and surgical tasks. There is now a variety of training models [1,2,3] available. However, training often still takes place on the patient  and simulators and well-equipped skills labs are not widely available, as they are costly and require additional resources. While most university hospitals in Germany are equipped with skills labs for their students, simulation-based training for residents to perform more specialized and demanding medical tasks, especially at non-university hospitals, remains very limited. Therefore, cost-effective methods to facilitate practical medical education – methods that can potentially reduce the number of human instructors necessary - are in high demand. “Mixed reality” (MR) is a new technology that expands our perception of our surroundings through virtual objects or information that are projected into the user’s field of vision. The term refers to a continuum encompassing Virtual Reality (VR) and Augmented Reality (AR). There is strong evidence of the practicality of MR technology in a clinical setting, and numerous successful trials have applied this technology [5,6,7]. The information is usually delivered via a portable head-mounted display that can be activated nearly everywhere. For the training of practical skills, an MR system seems most suitable to provide students with instructions when learning skills on a training model or performing a practical task for the first time in a clinical setting. While studies have reported that simulation-based medical education can be more effective than traditional clinical education , the number of studies addressing this issue and making a “one to one” comparison is still small, since implementing simulation training often requires many implementation steps and necessitates that the technology accommodate the local technology standards at hand  There have been no randomized studies benefiting from the application of MR technology for practical medical education til now, and while it is generally agreed that MR has strong potential as a useful teaching tool for medical trainees, the scientific evidence is lacking. Our trial aimed to deliver an objective and subjective evaluation of learning outcomes applying an MR system to train medical students to perform bladder catheter placement. In addition, we designed the trial to also assess the usability of the MR system and students’ learning experience.
Between August 2017 and August 2018 we recruited 164 medical students who were doing their rotation in urology in our Department of Urology at the University of Freiburg Medical Center. Students were enrolled between their 4 and 5th year in a 6-year M.D. (medical doctor) program. Study participation was open to all medical students qualified to participate in the urological rotation; there were no exclusion criteria. While participating in the urological rotation is mandatory for medical students who have passed the theoretical exam in urology after attending a lecture series, neither inclusion nor performing in this study was associated with their regular curriculum. All participants consented to participate prior to their inclusion in the study.
This study was designed adhering to the CONSORT guidelines, approved by our local ethics committee, and conducted in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. It was registered prospectively in the German Clinical Trial Register (ID:DRKS00013186). Sample size and the ~ 2:1 ratio were calculated after concluding a pilot study using a two sided t-test (significance level p ≤ 0.05). Students were randomized into 2 study groups (numbers lottery) and given 30 min of instruction on how to perform bladder catheter placement. The instructions given to both groups were identical and standardized (supplementary file 5), and followed the recommendations on bladder catheter placement . Bladder catheter placement was carried out using a male catheterization-training model (male catheter model type LM29, Erler-Zimmer GmbH & Co. KG, Lauf, Germany) and a Tiemann catheter CH16 (UROMED Kurt Drews KG, Oststeinbek, Germany). The control group was given instructions by an instructor, while the other received instructions through a video-based AR guidance system. Both groups engaged in hands-on training during those 30 min. The MR group performed bladder catheter placement while receiving instructions from the MR system. The control group got their instructions from a human instructor and then performed the catheter placement under the instructor’s supervision. All the nursing instructors participating in this study were fully trained staff nurses in the urology department with 10 years of experience in teaching this class.
Mixed reality guidance system
Students randomized to the MR group received instructions exclusively through an MR system. Instructions were displayed through a head-mounted display (HMD) using the Microsoft Hololens (Microsoft Corporation, Redmond, USA). After an introduction (indications, how to best prepare the patient, appropriate setting), the system provided step-by-step instructions on how to prepare the materials in a sterile manner, followed by guidance through the placement process. To accommodate different performance speeds, instructions were provided on demand, and the system was programmed to stop automatically until further instructions were requested. All instructions were provided through combined video-based visual and audio guidance (Fig. 1). No student received any additional support when performing the task, however, technical support in applying the MR system was available from us study authors. This pedagogical concept was developed under the guidance of a certified expert with a master’s degree in medical education (A. F.) and relies on current recommendations for bladder catheter placement .
We conducted a standardized, non-timed OSCE (Objective structured clinical examination) after 3 days (primary outcome measure) to assess learning outcomes. The structure of this OSCE was derived from a previously evaluated, published format . As the instructor giving the OSCE was unaware about how any participant had been instructed, this was a single-blinded setting. This OSCE was not part of the urological rotation curriculum, and it was only implemented to evaluate learning outcomes related to this study.
A total score of 24 points was possible. The OSCE was drafted independently from the MR system to prevent confounding (supplementary file 4). Students also filled out a standard questionnaire (supplementary file 1) before and after the instructions. Recorded demographic data included gender, age and semester in medical school, as well as intended residency and any experience as a paramedic or nurse. For the self-evaluations (secondary outcome measure), students ranked their ability in the given task; this covered their teaching preference and an evaluation of their learning experience during the study (a 6-point Likert scale was used to avoid a neutral position). Furthermore, students using the MR system were asked to evaluate the system via the NASA Task load index  and the standard system-usability scale (SUS)  in the German language (supplementary files 2 and 3). For the SUS, students had to answer 10 questions regarding complexity, usability, functional integrity, consistency and applicability to everyday teaching on a scale from 1 to 5. The SUS’ total cumulative score was calculated according to Brooke et al. . All data was anonymized; there were no names of participants recorded.
Statistical analyses were carried out using the Statistical Package for Social Sciences software (SPSS®, Version 20.0, Chicago, IL, USA). Differences between groups were analyzed by Kruskal-Wallis test for non-parametric data and in case of significant differences confirmed by Mann-Whitney test. Numeric data differences were analyzed via ANOVA and in case of significance, confirmed by T-Test. P-values < 0.05 were considered to be significant. All data are represented as mean ± standard deviation.
Of the 164 students enrolled in our study, 95 were female and 69 male. Fifty-nine students were randomized to the MR group and 105 students were allocated to the control group. Prior experience with bladder catheter placement on a model was claimed by 84.1% of participants. Study semesters ranged from 8 to 12 with a majority of students (> 50%) having studied more than 4 years. Average age was 25.2 ± 2.8 with a range from 21 to 39 years. There was no significant difference in gender distribution (p = 0.7), age distribution (p = 0.15) or previous medical experience as a nurse or a paramedic (p = 0.57) between the MR group and controls. Prior bladder catheter-placement experience on a model (p = 0.50) or on a patient (p = 0.33), did not differ either. Nor were there any differences in the affinity with new technologies (possession of smartphone (p = 0.45), possession of a gaming console (p = 0.76), possession of a tablet (p = 0.45)) or previous experience with AR devices (p = 0.55) and IT-based technology (knowledge of a programming language (p = 0.44), usage of online learning platforms (p = 0.18) and social media (p = 0.93)). All results are depicted in Table 1.
Teaching preference and learning experience
All students filled out a standard questionnaire prior to and after the teaching session. When ranking teaching methods, training directly on a patient was given an average of 2.57 ± 1.3 points, while using a training model was given 2.92 ± 1.3. When asked to rank an instructor’s instructions versus those through digital media, 85.9% of students leaned towards the instructor (average of 2.24 ± 1.2). No significant difference appeared in student preference prior to or after the study, except for a slightly higher preference towards learning on the patient (p = 0.036) in the MR group prior to our study that disappeared afterwards. Results are depicted in Table 3.
Evaluation after the training course showed that both groups found the training concept good with a score > 3 in 84.7% in the MR group and 92.6% for the control group (p = 0.181). We identified significant differences in how participants rated confusing instructions (p = 0.002), fulfillment of expectations (p = 0.003) and level of enjoyment (p = 0.004), with the control group delivering better evaluations. However, with average point values of 2.53 ± 1.4, (MR group; 77.9% enjoyed or greatly enjoyed the class) vs. 1.9 ± 1.1 (control group; 93.3% enjoyed or greatly enjoyed the class) for class enjoyment, 2.9 ± 1.4 (MR group; 67.8% fulfilled expectations) vs. 2.25 ± 1.2 (control group; 87.6% fulfilled expectations) for fulfilled expectations and 4.5 ± 1.6 (MR group; 72.9% reported no confusing statements) vs. 5.1 ± 1.4 (control group, 81,2% reported no confusing statements) for receiving confusing instructions, students in both groups gave their learning experience a generally positive assessment. All results are depicted in Table 2.
The mean value before the training session was 2.43 ± 1.4 concerning bladder catheter placement on a patient. After training, this value increases significantly (p < 0.001) to a value of 3.4 ± 1.3, although there was no significant difference between groups. Similarly, students ranked their ability to perform bladder catheter placement on a training model (3.16 ± 1.3 before vs. 4.47 ± 1.3 after; p < 0.001), their theoretical knowledge of bladder catheter placement (3.59 ± 1.3 before vs. 4.77 ± 1.1 after; p = 0.00) and their ability to do the procedure in a sterile fashion (3.61 ± 1.3 before vs. 4.47 ± 1.0 after; p < 0.001). Our group comparison revealed that the MR group felt more confident performing bladder catheter placement independently before our study’s training session (p < 0.001). However, confidence levels after the training were equal in both groups. Nevertheless, the control group reported slightly more confidence in their theoretical knowledge of bladder catheter placement (2.12 ± 1.1) than the MR group (2.44 ± 1.0; p = 0.01) after the training, while those values before training did not differ (p = 0.58). We detected no significant difference in self-evaluations before and after the training in any of the other parameters nor in their change during the study. Detailed results are depicted in Table 3.
Our OCSE evaluation revealed an average 19.96 ± 2,42 points for the control and 21.49 ± 2.27 for the MR group - a significantly better score for the latter (p = 0.00). Cumulative learning outcome was high in both groups, with an average 20.51 ± 2.57 out of 24 points. Comparison of the performance between students with previous experience in paramedics (p = 0.442) or nursing (p = 0.888) and those with no prior medical education revealed no difference Moreover, we detected no difference within the MR group’s students with prior AR/VR technology experience (p = 0.115), ownership of a gaming console (p = 0.200), regular occupation with computer games (p = 0.232), or knowledge of a programming language (p = 0.224).
Evaluation of the MR system
Immediately after completing MR training, all students randomized to the MR group were asked to evaluate the proposed MR technology. In this context, 40.2% of participants reported finding the system difficult to use, and 28.9% believed they could only use the system with an instructor’s help, a finding reflecting poor applicability for everyday teaching. However, only 10.2% believed they would need extensive instructions. The total cumulative SUS score was 56.6 (n = 52). SUS results are depicted in Table 4. Analysis of the NASA Task load index answers estimating how strenuous it is to use the MR system revealed a cumulative score of 24.2 ± 7.3 (n = 52) out of a maximum of 100 (=highest perceived workload). While mental (3.58 ± 1.8) and physical demand (2.48 ± 1.9) as well as time demand (4.09 ± 1.8), perceived success (4.30 ± 2.26) and overall effort (4.37 ± 2.0) all attained an average below 5, students ranked their emotional stress (frustration, anger) higher with 5.39 ± 2.8. We found no significant difference among students with previous AR/VR technology experience (n = 7), programming capabilities (n = 7), ownership of a gaming console (n = 16) or regular occupation with computer games (n = 11) for any of the categories of our evaluation tools (SUS, NASA Task load index).
In this randomized, prospective, single-blinded study, we applied and evaluated an interactive, video-based MR system. To our knowledge, we are the first to have conducted a prospective, randomized study in the field of urology investigating MR’s value for teaching practical medical skills. We assessed its efficiency through self-evaluation as well as via an OSCE to enable both subjective and objective means of measuring learning outcome. In addition, students were asked to evaluate the system’s usability and their learning experience. Our results reveal similar outcomes in the self-evaluation and a slightly better learning outcome in the MR group in the OSCE exam. Students’ learning-experience evaluations were also similar between the MR and control groups, receiving a generally positive appraisal. In addition, the MR system’s usability, as evaluated through SUS and NASA Task load index revealed a mid range result for both scores. While several pilot studies have applied this technology as a supporting tool for various surgical applications  including liver surgery , dentistry , tele-surgery  and robotic surgery , randomized studies investigating this technology’s impact are rare. In this context, MR technology has been applied for teaching anatomical structures, and a prospective study demonstrated the non-inferiority of this technology compared to a tablet-based approach . The field of laparoscopic surgery frequently applies MR technologies in simulation training to teach practical skills , and various simulators are commercially available. A recent study also using the Microsoft Hololens demonstrated improved outcome parameters when used as a visual aid during ureteroscopy . An advantage of this technology is its functionality without requiring teaching personnel, making it an efficient means of training independent of the fixed time schedules normally required in a traditional lecture-instructional context. In our study, the AR system provided all the information needed to carry out the task at hand, as well as its theoretical basis. Technical supporting staff was, however, provided, since most students needed initial help utilizing the HMD (Microsoft HoloLens). Once the students were instructed, the system functioned autonomously. To apply this technology in a broader sense, a student can receive initial instructions and then learn a variety of practical skills without needing an instructor, making teaching more flexible and cost-efficient. However, the current shortcomings of the hardware (uncomfortable, heavy HMDs, short battery life, complicated handling) as well as the necessary infrastructure (fast, reliable internet connection, EDV specialists, etc.) still make it difficult to employ MR widely in everyday teaching. In our study, participants often needed help in handling the device during the procedures, and technical problems did occur. While these difficulties were easy to overcome due to the attending technical personnel, it might cause a problem for students learning autonomously. This fact is represented in the overall SUS score of ~ 57, which makes the usability of our system only marginally acceptable . We also believe that the good (but significantly lower) evaluation of the MR teaching experience can be attributed to our system’s usability. This is supported by the identical student self-evaluations. A further advantage of a standardized teaching approach using an MR system is its consistency. While an instructor might vary in emphasizing different aspects or alter the content, a standardized MR teaching tool guarantees the same content for every application. This point is reflected in the significantly better OSCE exam result. In our study, lower ratings were entirely caused by 3 different points on the OSCE checklist, namely that our instructor repeatedly failed to provide this information, or failed to emphasize those aspects. In summary, our results show, that the MR system provides a comparable learning outcome to conventional training, with a higher degree of consistency. However, our study has specific limitations. While providing an interactive MR system, we used a video-based approach without the capability of recognizing objects, hence it does not truly “augment” the environment by providing a system that actually interacts with real objects (=Augmented Reality). The technology applied in our study is therefore categorized as a “Mixed Reality” system  We believe, however, that stronger interaction with the environment will improve teaching outomes and support the our study’s message, namely that this technology is a practical solution for teaching medical procedures, and the hardware currently available enables an acceptable degree of usability. While continuous development is necessary, resources need to be allocated to implement this technology further and spur on technological progress. We did not include a control group employing an alternative technological solution to an HMD to display information (eg, tablet computer). Such solutions may also be feasible and yield acceptable learning outcomes. However, we know of no other technologies that have as much potential as HMDs for advancing the development of interactive teaching systems, with the capacity of both instruments enabling on-demand supervisor feedback and the direct implementation of automatic-assessment tools through object recognition. We also conducted no long-term follow up, since study participants we assured that a poor performance would not affect their grades in their urology rotation. As we therefore decided to use anonymized data, no long-term follow up was possible. Also, we are obliged to mention that our statistical calculations were performed without a method for p-value correction, which may have affected our results evaluating the learning experience. Since our data reveals a generally positive evaluation from both groups, it is safe to assume that the learning experience the MR system provided was evaluated by the MR group at least as positively as the control group had done.
MR is a efficient tool for instructing bladder catheter placement. It is a promising technology for teaching practical medical tasks in general, as it delivers learning outcomes resembling those of a human instructor. Still more developmental progress with MR technology is needed to improve the usability of current MR systems.
Availability of data and materials
The datasets generated and/or analyzed during the current study are not publicly available due to data protection laws but are available from the corresponding author on reasonable request.
Head Mounted Display
National Aerospace Agency
Objective structured clinical examination
Statistical Package for Social Sciences software
System usability scale
Landes CA, Hoefer S, Schuebel F, Ballon A, Teiler A, Tran A, Weber R, Walcher F, Sader R. Long-term prospective teaching effectivity of practical skills training and a first OSCE in cranio maxillofacial surgery for dental students. J Cranio-Maxillofac Surg. 2014;42(5):e97–e104.
Schoeb DS, Brennecke E, Andert A, Grommes J, von Trotha KT, Prescher A, Neumann UP, Binnebosel M. Assessment of a course of realistic surgical training during medical education as a tool for pre-residential surgical training. BMC Med Educ. 2016;16:45.
Motola I, Devine LA, Chung HS, Sullivan JE, Issenberg SB. Simulation in healthcare education: a best evidence practical guide. AMEE guide no. 82. Med Teach. 2013;35(10):e1511–30.
Lateef F. Simulation-based learning: just like the real thing. J Emerg Trauma Shock. 2010;3(4):348.
Davis MC, Can DD, Pindrik J, Rocque BG, Johnston JM. Virtual interactive presence in global surgical education: international collaboration through augmented reality. World Neurosurg. 2016;86:103–11.
Shenai MB, Dillavou M, Shum C, Ross D, Tubbs RS, Shih A, Guthrie BL. Virtual interactive presence and augmented reality (VIPAR) for remote surgical assistance. Neurosurgery. 2011;68(1 Suppl Operative):200–7 discussion 207.
Shenai MB, Tubbs RS, Guthrie BL, Cohen-Gadol AA. Virtual interactive presence for real-time, long-distance surgical collaboration during complex microsurgical procedures. J Neurosurg. 2014;121(2):277–84.
McGaghie WC, Issenberg SB, Cohen MER, Barsuk JH, Wayne DB. Does simulation-based medical education with deliberate practice yield better results than traditional clinical education? A meta-analytic comparative review of the evidence. Acad Med. 2011;86(6):706.
Pawson R, Greenhalgh T, Harvey G, Walshe K. Realist review-a new method of systematic review designed for complex policy interventions. J Health Serv Res Policy. 2005;10(1_suppl):21–34.
Lippincott W. Wilkins: best practices : evidence-based nursing procedures. Philadelphia: Lippincott Williams & Wilkins; 2007.
Kalbitz M, Liener U, Kornmann M, Gebhard F, Huber-Lang M. Studentische Evaluation einer objektiven, strukturierten klinischen Prüfungsmethode (OSCE) im Fach Chirurgie und Orthopädie. Unfallchirurg. 2010;113(9):726–33.
Hart SG, Staveland LE. Development of NASA-TLX (Task Load Index): results of empirical and theoretical research. In: Advances in psychology, vol. 52. North-Holland: Elsevier; 1988. p. 139–83.
Bangor A, Kortum PT, Miller JT. An empirical evaluation of the system usability scale. Int J Hum Comput Interact. 2008;24(6):574–94.
Brooke J. SUS-a quick and dirty usability scale. Usabil Eval Ind. 1996;189(194):4–7.
Veneziano D, Amparore D, Cacciamani G, Porpiglia F. Climbing over the barriers of current imaging technology in urology. Eur Urol. 2020;77(2):142.
Quero G, Lapergola A, Soler L, Shabaz M, Hostettler A, Collins T, Marescaux J, Mutter D, Diana M, Pessaux P. Virtual and augmented reality in oncologic liver surgery. Surg Oncol Clin. 2019;28(1):31–44.
Jiang W, Ma L, Zhang B, Fan Y, Qu X, Zhang X, Liao H. Evaluation of the 3D augmented reality–guided intraoperative positioning of dental implants in edentulous mandibular models. Int J Oral Maxillofac Implants. 2018;33(6):1219–28.
Shenai MB, Dillavou M, Shum C, Ross D, Tubbs RS, Shih A, Guthrie BL. Virtual interactive presence and augmented reality (VIPAR) for remote surgical assistance. Oper Neurosurg. 2011;68(suppl_1):ons200–7.
Qian L, Deguet A, Kazanzides P. ARssist: augmented reality on a head-mounted display for the first assistant in robotic surgery. Healthc Technol Lett. 2018;5(5):194–200.
Moro C, Štromberga Z, Raikos A, Stirling A. The effectiveness of virtual and augmented reality in health sciences and medical anatomy. Anat Sci Educ. 2017;10(6):549–59.
Kamphuis C, Barsom E, Schijven M, Christoph N. Augmented reality in medical education? Perspect Med Educ. 2014;3(4):300–11.
Al Janabi HF, Aydin A, Palaneer S, Macchione N, Al-Jabir A, Khan MS, Dasgupta P, Ahmed K. Effectiveness of the HoloLens mixed-reality headset in minimally invasive surgery: a simulation-based feasibility study. Surg Endosc. 2020;34(3):1143–9.
Bangor A, Kortum P, Miller J. Determining what individual SUS scores mean: adding an adjective rating scale. J Usability Stud. 2009;4(3):114–23.
The study was funded by internal resources of the University of Freiburg Medical Center. Open Access funding enabled and organized by Projekt DEAL.
Ethics approval and consent to participate
The study was approved by our local ethics committee (IRB approved protocol number 23/17; leading ethics committee: Ethik-Kommission der Albert-Ludwigs-Universität Freiburg, Germany) and performed in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. All participants gave their written consent prior to their inclusion in the study.
Consent for publication
All other authors declare that there is no conflict of interest. Dominik S. Schoeb, Simon Hein, Philippe F. Pohlmann received Funding by the German Federal Ministry of Education and Research (BMBF), not related to the present work.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Standardized questionnaire applied in this study in German language (original version) and English translation.
NASA Task Load Index questionnaire applied in this study in German language (original version) and English version.
System usability scale questionnaire applied in this study in German language (original version) and English version.
OSCE checklist applied in this study in German language (original version) and English version.
General steps for bladder catheter placement as thaught in this study.
About this article
Cite this article
Schoeb, D.S., Schwarz, J., Hein, S. et al. Mixed reality for teaching catheter placement to medical students: a randomized single-blinded, prospective trial. BMC Med Educ 20, 510 (2020). https://doi.org/10.1186/s12909-020-02450-5
- Education, medical
- Simulation training
- Urinary catheters
- Urologic surgical procedures