Effect of remote online exam delivery on student experience and performance in applied knowledge tests

Jaap, Alan; Dewar, Avril; Duncan, Colin; Fairhurst, Karen; Hope, David; Kluth, David

doi:10.1186/s12909-021-02521-1

Research article
Open access
Published: 02 February 2021

Effect of remote online exam delivery on student experience and performance in applied knowledge tests

Alan Jaap¹,
Avril Dewar¹,
Colin Duncan¹,
Karen Fairhurst¹,
David Hope¹ &
…
David Kluth¹

BMC Medical Education volume 21, Article number: 86 (2021) Cite this article

11k Accesses
43 Citations
Metrics details

Abstract

Background

The use of remote online delivery of summative assessments has been underexplored in medical education. Due to the COVID-19 pandemic, all end of year applied knowledge multiple choice question (MCQ) tests at one UK medical school were switched from on campus to remote assessments.

Methods

We conducted an online survey of student experience with remote exam delivery and compared test performance in remote versus invigilated campus-based forms of similar assessments for Year 4 and 5 students across two academic years.

Results

Very few students experienced technical or practical problems in completing their exam remotely. Test anxiety was reduced for some students but increased for others. The majority of students preferred the traditional setting of invigilated exams in a computer lab, feeling this ensured an even playing field for all candidates. Mean score was higher for Year 4 students in the remotely-delivered versus campus-based form of the same exam (76.53% [SD 6.57] vs. 72.81% [6.64]; t_438.38 = 5.94, p = 0.001; d = 0.56), whereas candidate performance was equivalent across both forms for Year 5 students.

Conclusions

Remote online MCQ exam delivery is an effective and generally acceptable approach to summative assessment, and could be used again in future without detriment to students if onsite delivery is not possible.

Peer Review reports

Background

Applied knowledge tests form an important component of assessment in undergraduate medical education. While the use of online technology for multiple choice question (MCQ) exam delivery is well-established [1, 2], most medical schools continue to run summative assessments on campus under invigilated conditions. This is particularly the case for higher stakes assessments such as final exams where the need to ensure test security is a major consideration [1].

The COVID-19 pandemic has forced universities to reassess their strategies for assessment as well as teaching, with most considering adopting some form of remote exam delivery as a means of future proofing against further periods of disruption. Remote exams have traditionally been open, allowing candidates free access to check information using resources during the assessment [1,2,3]. The relative merits of open versus closed approaches to assessment have been underexplored in medical education, with a recent systematic review [3] identifying few contemporary studies. To our knowledge, there has been little research evaluating online exam platform capabilities useful to the remote delivery of summative assessments, such as the ability to present questions in a random order to individual candidates [4].

However, legitimate concerns have been raised about potential inequalities in student experience in relation to remote online assessment [2, 5]. Not all students have access to up-to-date devices, reliable Wi-Fi, or a suitable quiet space for taking a remote exam [4, 5]. Such factors are likely to have a disproportionate impact on students from less-advantaged backgrounds and so have the potential to contribute to differential attainment.

Further evaluation of remote online exams in this context is clearly needed. In Edinburgh, the COVID-19 pandemic resulted in all end of year applied knowledge tests being run remotely for the first time. This provided an opportunity to compare the effects of remote versus invigilated computer lab delivery of similar assessments on students’ exam experience and academic performance.

Methods

Context

The Edinburgh MBChB is a 6 year programme with an integrated curriculum. Students complete a research-focused BMedSci degree in Year 3, with Year 4 being the first predominantly clinical year. Around half of students are Scottish/EU funded, with approximately 30% coming from the rest of the UK, and 20% being non-EU international students. 1n 2016, 8.3% of UK entrants to the programme fulfilled widening participation criteria.

Applied knowledge tests are used throughout the programme. Questions are based on clinical vignettes which candidates have to interpret and then determine the correct answer - such as the most likely diagnosis, next investigation or most appropriate management. The complexity of vignettes increases progressively through the programme. We use a mixture of best-of-five MCQs and very short answer questions. The latter comprise around 10% of exam content and are designed to be automatically marked by computer with a check by two faculty members to ensure all correct answers are identified.

All summative assessments are preceded by formative progress tests. Knowledge assessments are delivered via a commercial online platform (Practique®, Fry IT, UK) and are normally held in a university computer lab under invigilated conditions. Each exam is constructed to a predefined two dimensional blueprint based on clinical specialties and skills. Items are standard set by expert group using a modified Angoff approach, with previous performance data including Rasch modelling being taken into consideration for established items. Exams are time-limited based on a nominal 90 s per item (with up to 25% additional time for candidates with a learning adjustment).

All students had previous experience of using the exam system remotely for open-book formative assessments completed without strict time limits. The exam software is not dependent on continuous Wi-Fi connectivity, with the exam being downloaded onto devices at the start and candidate responses synching to a cloud-based server when Wi-Fi is available.

Impact of COVID-19 on assessment

Assessments were completed as normal until the end of February 2020. At the onset of lockdown in March in the UK many of our students returned to their family homes. We decided to continue with all remaining applied knowledge tests as non-invigilated remote exams using the time limits originally planned in order for progression decisions to be made. Students were advised to try and complete the exam as normal but that they could make strategic use of resources if necessary. Candidates sat their exam synchronously. We had students in time zones from + 7 to − 8 h BST, so chose a start time of 2 pm BST for each exam. An academic conduct statement and own work declaration had to be read and agreed on the initial screen in order to enter the exam. Items were presented in a random order to each candidate to reduce opportunities for collaboration.

This study includes Year 4 and Year 5 students. All Year 4 students completed the applied knowledge test (160 items) at the end of the second semester, whereas in Year 5 the cohort is split in half, alternating placements and related assessments (total of 170 items) between semesters. All assessments had satisfactory internal consistency (Crohnbach’s alpha around 0.85).

Survey

We conducted an anonymous online survey to explore student experience of remote delivery in terms of exam accessibility and setting, test anxiety and preference. We chose to focus the survey on Year 5 students given their greater experience with online exams, and the fact they formed a natural crossover group in terms of remote versus invigilated on campus completion of the two components of their exam.

Following release of exam results (around 2 weeks after sitting the assessment), we sent all Year 5 students an invite to complete the survey. Participation was voluntary. Survey items are summarised in Table 1. We determined the frequency of categorical responses and identified emergent themes in free text comments.

Table 1 Remote exam experience survey items

Full size table

Performance

We compared candidate performance between remotely delivered assessments and the similar invigilated campus-based form of the exam from the previous academic year. Exam results are expressed using our institutional common marking scheme where raw scores are converted to a percentage and scaled, with the cut score equating to 60%.

Differences between groups were assessed using Welch’s modified two-sample t-test. The results were sufficiently normally distributed to make this a feasible choice. The analysis was adequately powered to detect a medium effect size (Cohen’s d = 0.3, with α = 0.05) [6]. All statistical analyses were performed in R [7].

Ethics approval

The study was approved by the institutional medical education research ethics committee and informed consent obtained from all survey participants.

Results

All candidates (n = 447) were able to complete their remote exam as planned, with 67 students (15.0%) accessing their assessment from outside the UK. Only 10 students (2.2%) contacted administrative staff about technical issues during their exam - all of which were minor and able to be resolved in real time - and these students were given extra time to compensate (maximum required was 10 min). One student made prior arrangements to sit the exam on campus using their own device in view of known issues with their home Wi-Fi. In terms of device choice, almost all students used a laptop computer.

Student experience

The survey was completed by 119 Year 5 students (56.4% response rate).

Setting and accessibility

All students indicated they had access to a suitable laptop or tablet for the remote exam, with this being a shared or borrowed device for 15 students (12.6%). Twenty-two students (18.5%) identified difficulties finding a quiet space for sitting the exam. Free text comments suggested the main issue was ambient noise from family, flatmates or neighbours during the exam. 10 students (8.4%) noted Wi-Fi issues during their exam. It is unclear whether these are the same students that contacted an administrator during the exam. Functionality of exam software on students’ own devices appeared to be at least as good as on the desktop pcs in the computer lab (Fig. 1). However, a few students felt images were more difficult to interpret with a smaller screen size.

Test anxiety

Opinion was split regarding anxiety in anticipation of sitting the remote exam with 61 students (51.3%) reporting feeling more anxious beforehand. There was a similar spread in anxiety levels experienced during the exam (Fig. 2). A number of students felt more relaxed sitting the exam remotely while a similar number felt more anxious, mainly due to concerns about technical issues like Wi-Fi connectivity (Table 2).

Table 2 Illustrative student comments relating to test anxiety

Full size table

Preference

Overall, the majority of students preferred sitting summative MCQ exams under invigilated conditions in the computer lab (Fig. 3), citing familiarity, positive associations, and standardisation as the main reasons for this (Table 2). Additionally, some students clearly felt uncomfortable with the blurring of boundaries caused by sitting a higher stakes exam in their own personal space. The issue of perceived fairness was mentioned frequently in free text comments, both in relation to accessibility and to the fact that those who had access to help from near-peers or medical friends and family during the exam might use this to gain an unfair advantage.

Exam performance

Results are summarised in Table 3. Year 4 students performed better in the remotely-delivered compared to campus-based form of the same exam, with a medium effect size (d = 0.56) [6], whereas candidate performance was equivalent for Year 5 students. Variance across the two forms of each exam was similar in both year groups. Scores in both Year 5 semester exams were highly correlated (r = 0.54 [CI 0.44–0.63], p < 0.0001]), similar to the previous academic year when both were invigilated exams (r = 0.58 [CI 0.49–0.66], p < 0.0001).

Table 3 Comparison of remote vs. computer lab delivery of online applied knowledge tests

Full size table

Discussion

Student experience of remote online summative MCQ exam delivery during the COVID-19 pandemic was generally positive, with few students experiencing technical or practical problems. We did not find any evidence of negative effects on candidate performance.

Test anxiety

Test anxiety was an important consideration given the context of these exams taking place during a global pandemic. Undoubtedly lockdown had unpredictable effects on individual students’ learning, revision and mental wellbeing as they adapted to a new normal [8], and this may have contributed to the heightened anxiety experienced by some students. It is interesting that the higher stakes nature of sitting a summative assessment seems to have resulted in many of our students worrying about access issues, particularly initial log in to the exam, despite their familiarity with using the system remotely. The literature suggests less test anxiety with remote delivery [1, 3], and we confirmed that this remained the case for a number of students where individual personality traits, learning styles or relief from negative associations with exam hall settings seem to have outweighed potential anxiety generated by the lockdown situation.

Test anxiety has been usefully conceptualised using distractional theories, in particular attentional control theory [9]. Addressing a task requires working memory to focus attention on several pieces of relevant information while inhibiting irrelevant information. According to attentional control theory, anxiety disrupts the balance between two competing attentional systems, with the one influenced by salient stimuli becoming stronger at the expense of the goal-directed system [9]. As working memory has a limited capacity, the addition of stress therefore reduces a candidate’s ability to use relevant information during a test resulting in underperformance. However, despite perceptions of increased or decreased test anxiety in our study, there was no obvious indication that any candidates actually did better or worse than predicted from previous performance.

Test security

Although concerns have been expressed about increased opportunities for cheating during remote assessments, there is lack of objective evidence that this is more widespread than in invigilated campus-based exams [10]. We did not use remote proctoring but, in accordance with the argument put forward by Fuller et al. [5], invested trust in our students to behave professionally during remote exams, reminding them of appropriate conduct immediately before the start of their test. The ability to present items in a random order to individual candidates and the time-limited nature of the tests were felt to be adequate countermeasures against attempted collaboration. We did not take any steps to formally verify candidate identity for these assessments, e.g. by webcam, due to cohort sizes, simply sending the specific exam entry pass code to students’ university email accounts immediately before the exam started. As previously discussed, there were no candidates whose actual performance was unexpectedly greater than predicted from their previous performance in formative progress tests.

Test performance

The remote online assessments were open-book by default and arguably the ability to quickly access and appraise relevant information to support clinical decision making is now a vital part of modern medical practice, underpinning professional values and patient safety. As such, testing candidates’ ability to do this by allowing access to resources during time-limited exams enhances their validity. This approach is already established in must-pass examinations such as the UK Prescribing Safety Assessment [11], where candidates have access to an online reference drug formulary.

We found that remote delivery aided candidate performance in the Year 4 exam, but not in Year 5. This was most likely due to Year 4 questions being less complex and covering more basic diagnosis, investigation and management of core and common medical conditions and therefore easier for candidates to check information to help them reach the correct answers. Our results contrast with the findings of Sam et al. who recently published a brief report of their experiences using an almost identical approach to exam delivery during the pandemic, but found no change in performance in any year group [12].

Item leakage

Finally, the resource implications of running summative exams remotely have to be acknowledged. All items used have effectively been released into the public domain precluding their inclusion in future summative assessments. This approach therefore has significant costs in terms of replenishing exam item banks.

Limitations

This was a naturalistic inquiry rather than pre-designed cohort study, comparing results from similar, but non-identical, assessments and across different time periods and therefore caution has to be taken in interpreting the results.

The generalisability of our findings is limited by this being a single institution study. In particular, we have a relatively low proportion of widening participation students who may be more likely to be disadvantaged by the remote approach.

While our results are encouraging it is difficult to clearly separate any effects arising specifically from lockdown.

Conclusions

In the context of the COVID-19 pandemic, remote delivery of summative online applied knowledge tests was an effective and generally acceptable option for all our students with no evidence of detriment to candidate performance. As such we would be happy to use this approach again in future if necessary.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

MCQ:: Multiple choice question

References

Appiah M, Van Tonder F. E-assessment in higher education: a review. Int J Bus Manag Econ Res. 2018;9(6):1454–60.
Google Scholar
Timmis S, Broadfoot P, Sutherland R, Oldfield A. Rethinking assessment in a digital age. Br Educ Res J. 2016;42(3):454–76.
Article Google Scholar
Durning SJ, Dong T, Ratcliffe T, Schuwirth L, Artino AR Jr, Boulet JR, Eva K. Comparing open-book and closed-book examinations: a systematic review. Acad Med. 2016;91(4):583–99.
Article Google Scholar
Alruwais N, Wills G, Wald M. Advantages and challenges of using E-assessment. Int J Inf Educ Technol. 2018;8(1):34–7.
Google Scholar
Fuller R, Joynes V, Cooper J, Boursicot K, Roberts T. Could COVID-19 be our ‘there is no alternative’ (TINA) opportunity to enhance assessment? Med Teach. 2020. https://doi.org/10.1080/0142159X.2020.1779206.
Cohen J. A power primer. Psychol Bull. 1992;112(1):155–9.
Article Google Scholar
Ihaka R, Gentleman R. R: a language for data analysis and graphics. J Comput Graph Stat. 1996;5:299–314.
Google Scholar
Jervis CG, Brown LR. The prospects of sitting ‘end of year’ open book exams in the light of COVID-19. A medical student’s perspective. Med Teach. 2020. https://doi.org/10.1080/0142159X.2020.1766668.
Derekshan N, Eysenck MW. Anxiety, processing efficiency, and cognitive performance. Eur Psychol. 2009;14(2):168–76.
Article Google Scholar
Beck V. Testing a model to predict online cheating – much ado about nothing. Act Learn High Educ. 2014;15(1):65–75.
Article Google Scholar
BPS and MSC Assessment. UK Prescribing Safety Assessment. https://prescribingsafetyassessment.ac.uk/. Accessed 18 Nov 2020.
Sam AH, Reid MD, Amin A. High-stakes, remote-access, open-book examinations. Med Educ. 2020;54(8):767–8.
Article Google Scholar

Download references

Acknowledgements

None.

Funding

None.

Author information

Authors and Affiliations

Medical Education, Edinburgh Medical School, University of Edinburgh, Edinburgh, UK
Alan Jaap, Avril Dewar, Colin Duncan, Karen Fairhurst, David Hope & David Kluth

Authors

Alan Jaap
View author publications
You can also search for this author in PubMed Google Scholar
Avril Dewar
View author publications
You can also search for this author in PubMed Google Scholar
Colin Duncan
View author publications
You can also search for this author in PubMed Google Scholar
Karen Fairhurst
View author publications
You can also search for this author in PubMed Google Scholar
David Hope
View author publications
You can also search for this author in PubMed Google Scholar
David Kluth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AJ, CD, KF and DK conceptualised the study. AJ, AD and DH gathered and analysed data. AJ, CD, KF, DH and DK interpreted the results. AJ drafted the manuscript with substantial contribution from DH. All authors substantially edited drafts. All authors read and approved the final manuscript.

Authors’ information

AJ is Deputy Director of Teaching, Assessment Lead, and an Honorary Reader in Medical Education at Edinburgh Medical School, and a Consultant Physician in Diabetes, NHS Lothian.

AD is a Fellow in Medical Education (Psychometrics and Assessment) at Edinburgh Medical School.

CD is Year 5 Director and a Professor of Reproductive Medicine and Science at Edinburgh Medical School, and an Honorary Consultant Gynaecologist, NHS Lothian.

KF is Year 4 Director and a Senior Lecturer in General Practice at Edinburgh Medical School, and a General Practitioner, NHS Lothian.

DH is a Senior Lecturer in Medical Education and Psychometrician at Edinburgh Medical School.

DK, is Dean of Undergraduate Medicine and a Professor of Medical Education at Edinburgh Medical School, and an Honorary Consultant Renal Physician, NHS Lothian.

Corresponding author

Correspondence to Alan Jaap.

Ethics declarations

Ethics approval and consent to participate

The study was approved by Edinburgh Medical School Medical Education Research Ethics Committee (reference: 2020/12) and performed in accordance with the Declaration of Helsinki. Informed consent was given by all aprticipants.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Jaap, A., Dewar, A., Duncan, C. et al. Effect of remote online exam delivery on student experience and performance in applied knowledge tests. BMC Med Educ 21, 86 (2021). https://doi.org/10.1186/s12909-021-02521-1

Download citation

Received: 18 November 2020
Accepted: 14 December 2020
Published: 02 February 2021
DOI: https://doi.org/10.1186/s12909-021-02521-1

Effect of remote online exam delivery on student experience and performance in applied knowledge tests

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Context

Impact of COVID-19 on assessment

Survey

Performance

Ethics approval

Results

Student experience

Setting and accessibility

Test anxiety

Preference

Exam performance

Discussion

Test anxiety

Test security

Test performance

Item leakage

Limitations

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Authors’ information

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Education

Contact us