Skip to main content

Undergraduates’ workplace learning in health sciences education: psychometric properties of single-item measures

Abstract

Background

Undergraduates’ workplace learning is an important part of health sciences education. Educational psychology research considers many different aspects of self-regulated learning at the workplace, including cognition, motivation, emotions, and context. Multivariate longitudinal and diary studies in this field require fewer items than alternatives or even a single item per construct and can reveal the sub-processes of workplace learning and contribute to a better understanding of students’ learning. Short instruments are necessary for application in workplace settings, especially stressful ones, to mitigate survey fatigue. The present study aimed to assess the psychometric properties of single items measuring various aspects of workplace learning.

Methods

Twenty-nine single items selected from the Workplace Learning Inventory in Health Sciences Education were analyzed for reliability, information reproduction, and relationships within the nomological network. The authors additionally analyzed four generally formulated single items’ relationships with the full Workplace Learning Inventory scales and external criteria within the nomological network. Participants were 214 ninth- or tenth-semester veterinary medicine students in Austria and Germany who were learning at varied workplaces during the winter semester of 2021/2022.

Results

Of the 29 single items selected from existing scales, 27 showed sufficient reliability, but mixed results were obtained regarding validity. Although the items’ relationships within the nomological network were similar to those of the full scales, information reproduction was insufficient for most items. The four general single items showed acceptable validity, but the reliability of these measures of states could not be assessed.

Conclusions

This paper reported findings on the psychometric properties of single items for undergraduates’ workplace learning in health science education. The findings are crucial for deciding whether to use scales versus single-item measures in future studies. By applying the findings, researchers can be more economical in their workplace learning data collection and can include more constructs.

Peer Review reports

Introduction

Health sciences education has long focused on the science of teaching, but in recent years we have seen a shift toward the science of learning [1, 2]. This study focused on undergraduates’ workplace learning in health sciences education [3,4,5] from an educational psychology perspective. Psychological theories, especially the theory of self-regulated learning (SRL), focus on the individual learner and view learning as a process in which cognitive, motivational, emotional, and contextual aspects are considered [6,7,8]. To gain a better understanding of undergraduates’ learning processes at the workplace, multivariate and longitudinal studies are needed. Such studies require fewer items per construct, which helps avoid survey fatigue and increases applicability in workplace settings, especially stressful ones.

We assessed the psychometric properties of single-item measures of constructs related to self-regulated learning at the workplace and this paper discusses the items’ role in health sciences education research. We selected some single items from the Workplace Learning Inventory in Health Sciences Education (WLI) [9] scales and specifically developed others using more general wording.

Self-regulated learning in the workplace

The study was based on SRL research [10, 11]. In health sciences education, the most prominent SRL theory is a process-based model, namely Zimmerman’s cyclical phases model, which differentiates between the forethought, performance, and reflection phases [12, 13]. Besides process-based models, there are also component-based models that integrate different areas (e.g., Pintrich’s conceptual framework for assessing motivation and SRL) [14] and levels (e.g., Boekaerts’ six-component model of SRL) [15]. The present study adopted Steinberg et al.‘s [9] component-based model as its conceptual framework. The model integrates four areas, namely cognition, motivation, emotion, and context, at two levels, namely the learning process level and the metalevel, resulting in a total of eight components (see Fig. 1).

Fig. 1
figure 1

The four areas of workplace SRL at the learning process level (inner components) and the metalevel (outer components), based on Steinberg et al. [9]

Cognition refers to learning strategies focused on workplace learning [16]. Motivation means instigating and sustaining goal-directed activity [17]. Emotions are defined within the broader concept of affect but are distinguished from other affective phenomena, such as moods, in that emotions are more intense, have a clearer object focus and a more salient cause, and are typically experienced for a shorter duration [18,19,20]. Context means undergraduate medical students’ perceptions of multiple dimensions of the educational environment in the clinical practice setting [21]. The metalevel of cognition, motivation, emotion, and context means regulating those respective aspects of the learning process [14, 22,23,24,25]. For more details on the model, we refer to Steinberg et al. [9].

Steinberg et al. identified aspects relevant to the eight components of undergraduates’ workplace learning and developed corresponding scales, resulting in the WLI [9], which provides 31 scales, each comprising three to six items. Researchers investigating workplace learning can select the scales that are relevant to their research questions. Table 1 lists the constructs with their corresponding definitions.

Van Houten-Schat et al. [13] and Roth et al. [26] reviewed SRL research and respectively identified the need to investigate SRL sub-processes in the workplace to gain a better understanding of the interplay of the different SRL aspects, as well as the need to use more diverse methodologies in SRL research, including multivariate longitudinal and diary studies. Single-item measures of the WLI constructs could facilitate such studies.

Table 1 WLI constructs and definitions (based on Steinberg et al.) [9]

Single-item measures

We summarize the discussion on the advantages and disadvantages of using single items in scientific studies, based on overviews provided in the literature [27,28,29]. Arguments in favor of the use of single items include parsimony, which is relevant in holistic studies considering the large number of theoretical constructs, as well as in diary studies with many measurement points and in time-limited settings such as data collection in the workplace. Parsimony is also associated with increased participant motivation and cognitive involvement, resulting in fewer missing values and higher validity. Moreover, parsimony addresses researchers’ ethical commitment to participants; that is, researchers strive not to overburden participants and to avoid their confusion and frustration when answering similar items. Other arguments in favor of the use of single items are their lower ambiguity, better interpretability, higher face validity, and reduced risk of criterion contamination.

Arguments against the use of single items include their lower or unknown reliability, their inability to adequately capture complex psychological constructs, and the less fine-grained distinctions between individuals. Hence, single items are usually acceptable when the construct is concrete, unidimensional, clearly defined, narrow in scope, and used as a moderator or control variable or when the desired precision is low [27, 28, 30]. Fisher et al. summarized successful examples of single items used in organizational psychology [28].

If there is uncertainty about whether a construct meets the above requirements, validation tests can be performed to ensure trustworthiness [30]. The appropriate validation method depends on how the item was developed, that is, whether it was selected from an existing scale or developed anew [28]. For items selected from a scale, Gogol et al. [29] have provided the following best practice for examining the psychometric properties: [27] assessing the reliability, information reproduction, and relationships within the nomological network. For newly developed single items measuring stable characteristics (or traits), the recommendation is to assess the test–retest reliability [28]; however, this method is inappropriate for single items measuring states that are expected to change over time, as in longitudinal studies [31]. To provide evidence of the validity of newly developed single items, assessing relationships within the nomological network is recommended.

Aim

Our study aimed to examine two sets of single items appropriate for research on undergraduate health science students’ learning by analyzing their reliability, their correspondence to the full scale, and their relations with external criteria. These sets of single items could be helpful for economically conducting multivariate longitudinal and intensive longitudinal studies in health workplace settings. First, we investigated 29 single items selected from the WLI [9]. The items address four areas of workplace learning, namely cognition, motivation, emotion, and context, at two levels, namely the learning process level and the metalevel. Each of the eight components is represented by several items, with the exception of emotion on the learning process level since Duffy et al. [20] have already provided single items for that. We systematically compared the single items with their corresponding full scales with respect to the following measurement questions [29]: (1) How reliable are single-item measures? (2) How well do single-item measures reproduce the information that the full scales obtain? (3) How well do single-item measures reproduce the relationships with external criteria in the nomological network that the full scales obtain?

Second, we examined four newly developed and more generally formulated single items measuring states rather than traits [32]. The items represent cognition, motivation, emotion, and context at the learning process level. Although their reliability cannot be tested, we examined the items’ validity with respect to the following measurement questions: (1) How well do single-item measures correlate with their respective full WLI scales? (2) How well do single-item measures relate to external criteria within the nomological network?

Methods

Participants

The outcomes should represent a diverse population of undergraduate health sciences students in the aspects of cognition, motivation, emotion, and learning environment. Consequently, we made a deliberate effort to encompass the majority of a pertinent student cohort from a single institution rather than distributing a questionnaire to students at different institutions, which could have led to a biased sample predominantly comprising highly motivated high achievers. We invited students from a second institution to participate in achieving the predetermined target sample size of n = 200 in adherence to a rule of thumb guideline for the minimum sample size for confirmatory factor analysis [33].

Participants were from two higher education institutions in Austria and Germany. At Institution 1, the target group comprised 200 students enrolled in a Clinical Rotation course as part of a veterinary degree program in which students learn in a clinical practical setting over a relatively long period for the first time. Students take this course in their ninth semester and rotate among highly varied workplace settings (rotations include, e.g., anesthesia/imaging diagnostics, surgery, gynecology, internal medicine, emergency department, reproduction). Although all 200 enrollees participated in the study, 13 students did not consent to their data being used for research purposes, and 11 consenting participants had to be excluded from further analysis owing to a high proportion of missing values (> 50%); the final sample size was n = 176 at Institution 1.

At Institution 2, the target group comprised about 260 students in their practical year of a veterinary degree program. Students usually complete their practical year during the ninth and tenth semesters and familiarize themselves with various workplaces. The questionnaire was opened 91 times; thereof 38 participants completed more than 50% of the items and consented to their data being used for research purposes. Combining both samples, the total sample size was n = 214 (78% [167] female[s], 21% [45] male[s], 1% [2] diverse; age: 21–41 years; M = 24.79, SD = 2.74). There were no statistically significant differences in gender (female: 77.90% [167] and 79.37% [170]) and age (M = 24.76, SD = 2.67 and M = 24.56, SD = 3.99) between Institutions 1 and 2, respectively.

Measures

We tested the psychometric properties of two sets of single items. First, the project team selected 29 single items from the WLI’s 29 full scales [9]. The project team comprised one professor, three senior scientists, two clinical teachers, and two students, all working in health science education and/or educational psychology. Single items were selected based on content and factor loadings (using the data of the study at hand). We preferred items couched in broader terms and considered face validity according to the project team’s ratings as well as those of nine researchers in the field of health sciences education and/or SRL who were not part of the project team. Furthermore, we chose items with high factor loadings (see Steinberg et al. [9] for details on factor loadings). Second, we tested the psychometric properties for four generally formulated single items representing cognition, motivation, emotion, and context on the learning process level. The project team developed these items using established instruments/scales and experiences with the SRL questionnaire and diary items, as well as theoretical assumptions. Table 2 provides an overview of the items.

Table 2 Overview of single-items

To validate the single items, we used the WLI, as well as measures of external criteria within the nomological network, which have also been used to validate the WLI [34,35,36,37,38,39,40,41,42]. A nomological network is a system of related constructs [43]. We excluded external criteria for emotions at the learning process level and context at the metalevel because none were available in German. Table 3 provides an overview of the measures.

Table 3 Overview of measures

Procedure

At Institution 1, the students completed the questionnaires as part of the course, as it was an exercise that supported the course’s learning goal of “reflecting on one’s own learning and practice.” Due to the large number of items, we spread data collection over a week, and students completed the questionnaires in the period December 6–10, 2021, or December 13–17, 2021, using the online survey tool unipark© [44]. Each item was answered once. Every morning during the survey period, the participants received an email with an invitation link to the questionnaire and were encouraged to complete it in the workplace.

At Institution 2, the rectorate invited all students in their practical year to participate in the study via an email with a link to the online questionnaire. Students were allowed to pause at any time and continue completing the questionnaire later within the aforementioned period using the online survey tool unipark© [44]. To improve the response rate at Institution 2, participants who completed the questionnaire were entered into a raffle to win a €50 voucher.

To avoid survey fatigue, the following steps were taken in addition to spreading the data collection over a week: Students were provided with targeted information about the study’s aims and benefits; teachers gave students time to complete the survey at their workplace; and students received individual feedback on their results, with tips and tricks for further developing their SRL skills.

Data analysis

We analyzed the first set of single items, which were selected from existing scales, according to Gogol et al.’s [29] recommendations for single-item measures in psychological science [27]. Accordingly, we assessed (1) the items’ reliability by computing the coefficient ω reflecting the proportion of item variance accounted for by the latent construct (Note that the items‘ reliability is the square of the standardized factor loading, see Brown (, p.115) [45], (2) the amount of reproduced information by computing the product–moment correlation between the scores obtained by the full scales and the scores for every single item, while accounting for the overlapping error variance, [46] and (3) relationships within the nomological network by computing product–moment correlations between the single-item measures and measures of external criteria within the nomological network. Similarly, we examined the second set of single items, which were generally formulated, by assessing their relationships with the full WLI scales and their relationships within the nomological network. These items’ reliability could not be assessed because they were not derived from a scale; their ω could not be calculated, and test–retest reliability is inappropriate for measures of states. Note that there is no clear cut-off separating good and poor reliability, but it has been suggested that 0.70 is an acceptable lower bound [47], with values between 0.65 and 0.70 considered minimally acceptable [48]. For correlations, r = .10 is considered as small, r = .30 as medium and r = .50 as large [49]. Analyses were conducted in Mplus 8.6 (Muthén and Muthén, Los Angeles, California) and R 4.3.1 (R Core Team, Vienna, Austria) [50, 51]. All analyses are based on the significance level \(\:\alpha\:\) = 0.05.

Results

Table 4 provides an overview of the detailed results, after which we summarized the results for the 29 single items selected from the full WLI [9] scales, as well as those for the four generally formulated single items.

Table 4 Psychometric characteristic of the full-scale (FS) and single-item (SI), including reliability, correlation with full scale (information reproduction) and correlation with external criteria (nomological network)

Single items selected from full scales

The following paragraphs summarize the results for the 29 single items selected from full scales. Twenty items showed acceptable reliability (ω > 0.70), seven showed minimally acceptable reliability (ω = 0.65 to ω = 0.70), and two showed unacceptable reliability (ω < 0.65) [48]. Of the 29 reliability values for the single items, eight differed significantly from the corresponding full-scale value but showed minimally acceptable to relatively high values (ranging from ω = 0.67 to ω = 0.83).

Regarding information reproduction, the single items showed low to substantial correlations (corrected for shared error variance) [34] with the corresponding full scales, with r ranging from 0.20 for reviewing to 0.79 for monitoring on the emotion metalevel. Of the 29 correlations, 27 values were below 0.70, indicating insufficient information reproduction (we considered less than 50% information reproduction to be insufficient; information reproduction expressed as a percentage is the square of the correlation values); two values were above 0.70, indicating substantial information reproduction.

Regarding the relationships of the selected single items within the nomological network, the items showed patterns that were similar to those of the full scales in terms of their correlations with the external criteria, but the correlations were significant less often. A similar pattern was reflected in the small mean absolute differences between the correlations obtained for the full scales and single items (between − 0.03 for all cognition metalevel aspects and 0.09 for all contextual aspects). The respective differences in the correlations ranged from − 0.25 to 0.28, but only 7 of the 124 correlations between the single items and the external criteria differed significantly from the correlations between the corresponding full scales and these external variables. See Table 5 for an overview of the results.

Generally formulated single items

The following paragraphs summarize the results for the four general single items shown in Table 4 (cognition general, motivation general, emotion general and context general). The correlations between the generally formulated single items (see lines “correlation with … general”) and their respective full scales (see columns “FS”) ranged between 0.16 and 0.51 for general cognition, − 0.51 and 0.67 for general motivation, − 0.54 and 0.66 for general emotion and 0.21 and 0.74 for general context.

Regarding the general single items’ (see column “… general”) relationships within the nomological network (see lines below “correlation with external criteria”), the items showed low to substantial correlations with external criteria. Analyses showed significant correlations between the general cognition single item and the external criterion elaboration (r = .22), single-item general motivation and the external criterion learning goal approach, and single-item general context and the external criteria perception of teachers (r = .61) and perception of atmosphere (r = .68). See Table 5 for an overview of the results.

Table 5 Overview of results

Discussion

In this study, we analyzed the psychometric properties of single items measuring different aspects of undergraduate health sciences students’ self-regulated learning (SRL) in the workplace. First, we assessed the psychometric properties of 29 single items selected from full WLI scales [29], of which 27 items showed sufficient reliability; however, the results regarding validity were heterogeneous. Second, we assessed the psychometric properties of four generally formulated single items [28], which showed acceptable validity, although their reliability could not be assessed. Consequently, this study provides evidence to inform decision-making regarding whether to use single-item measures rather than full scales when investigating the various aspects of workplace learning.

Single items’ psychometric properties

Reliability was acceptable for most of the 29 single items selected from the WLI. The broad range of reliability results is in line with Gogol et al. [29], who found low-reliability values for different types of academic anxiety and low- to acceptable-reliability values for different types of self-concept. If higher reliability is desired, single items can be used as daily measures aggregated to weekly measures in diary studies.

Regarding the items’ validity in terms of information reproduction, 27 of the 29 selected single items showed limited validity. This result aligns with Gogol et al. [29], who also found low correlations of their anxiety single items with the corresponding full scales (but acceptable correlations of their self-concept single items with the corresponding full scales). The possible reasons for low information reproduction are manifold [52]. For example, the constructs might be too complex [27, 52], and the items may not be representative [28, 29]. That could be the case for the single-item of the construct ‘reviewing’, whose information reproduction was particularly low. Additionally, the response format might not have enough categories and might, therefore, lack sufficient sensitivity [27]. We recommend cautious interpretation of such items, particularly if the items used do not represent the construct, as defined above. For example, although planning, as defined in Table 1, includes both anticipation and planning, the corresponding single item only addresses anticipation, necessitating a narrow interpretation using the single item’s wording.

The 29 single items selected from the WLI showed acceptable validity in terms of similar relationships with external criteria of the nomological network compared to the full scales. Similarly, the absolute differences between the correlations obtained for the full scales and those for the single items were small. The range of mean absolute differences is similar to Gogol et al.’s [29].

For the four newly developed single items, we analyzed how well they correlated with their corresponding WLI full scales and with the external criteria within the nomological network. In summary, the correlations were as expected. For example, the correlations of ‘cognition general’ or ‘motivation general’ with their respective full scales were significant while the correlation with external criteria were not always. This is plausible as the respective full scales were developed for the workplace setting while the scales that measured external criteria were developed for the classroom setting. In contrast, the correlations of ‘context general’ with both respective full scales and external criteria were significant as both were developed for the workplace setting. This study’s results should be used to interpret future studies’ results derived from the newly developed single-item measures. For example, the generally formulated single item “I am motivated today” showed a high correlation with situational interest and the mastery goal approach but a low correlation with the expectancy of success. Hence, the item represents the value rather than the expectancy component of motivation.

For constructs where the single items’ psychometric properties are insufficient, we recommend the use of full scales, such as for ‘reviewing’ and ‘control cognition’ which have low reliability or for further constructs if it is important to include multiple facets of the construct. However, researchers often need to balance the number of constructs measured with ethical standards to avoid overburdening participants, as well as to obtain complete and valid data. Several scenarios might justify the use of single items with limited information reproduction: (1) when the single item represents a control or moderator variable [30], (2) when a narrower definition of the construct is justified and the item represents the study’s aspect of interest [27], or (3) when the measure’s desired precision is low [28, 30].

Strengths, limitations, and implications

Our study’s strength is its rigorous methodology to test the psychometric properties of 33 single items [28, 29]. Furthermore, our data represent students’ heterogeneity in terms of cognition, motivation, and emotions, as we collected the data from an almost full cohort of students at one institution. The data also represent heterogeneous learning environments, as the students were in very different workplace settings.

A limitation is that our respondents were from two institutions only. This was necessary because we aimed to collect high-quality data using rigorous implementation management, such as ensuring support from all stakeholders and adequate time for the respondents to complete the questionnaires at the workplace. Furthermore, although preliminary analysis showed measurement invariance regarding gender, this result is limited due to the limited number of male participants. Future studies need sufficient participants to test for measurement invariance. Another limitation is the self-report aspect of our measurement instrument, as the reported information can sometimes differ from actual lived experiences [53, 54]. This study was an important first step in assessing the psychometric properties of single items for measuring different aspects of health sciences undergraduates’ SRL at the workplace. Further research should validate the items using alternative measures for comparison. Furthermore, items with different wording should be tested to improve the quality of the single items [28].

The scientific implication of our study lies in its provision of evidence to inform decision-making regarding whether to use scales versus single-item measures to investigate undergraduates’ workplace learning in health sciences education. This study took a very differentiated view of SRL by providing items on its various aspects. Such a differentiated view with corresponding single items enables future researchers to investigate SRL sub-processes. However, the use of single items also makes it possible to consider several aspects of SRL simultaneously and thus take a more holistic view of SRL. This allows researchers to be more economical in their data collection and to include more constructs. It also supports multivariate longitudinal and diary studies of workplace learning.

The research’s practical implication is its contribution to building scientific evidence that can serve as a foundation for developing interventions to enhance workplace learning. The single items can be used for screenings to further probe cognition, motivation, emotions, and contexts in workplace learning within a particular cohort toward the evaluation of different workplace learning curricula. The items can also be used for learning analytics.

Conclusion

The present study has enhanced knowledge of the psychometric properties of single items measuring different aspects of undergraduates’ self-regulated learning (SRL) at the workplace. Most single-items showed acceptable reliability but the results regarding validity were mixed. While the single-items reproduced the relationships with external criteria in the nomological network that the full scales obtain, most single-items insufficiently reproduced the information that the full scales obtained. The results provide evidence for health sciences education researchers to decide between using full scales and single items. The present study supports further investigation of health sciences undergraduates’ SRL at the workplace.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

WLI:

Workplace learning inventory in health sciences education

SRL:

Self-regulated learning

References

  1. Mayer RE. Applying the science of learning to medical education. Med Educ. 2010;44(6):543–9. https://doi.org/10.1111/j.1365-2923.2010.03624.x

    Article  Google Scholar 

  2. Schumacher DJ, Englander R, Carraccio C. Developing the master learner: applying learning theory to the learner, the teacher, and the learning environment. Acad Med. 2013;88(11):1635–45. https://doi.org/10.1097/ACM.0b013e3182a6e8f8

    Article  Google Scholar 

  3. Morris C, Behrens M. Work-based learning. In: Walsh K, editor. Oxford textbook of medical education. 1 ed. Oxford [u.a.]: Oxford Univ. Press; 2013. pp. 209–20.

    Google Scholar 

  4. Westerman M, Teunissen PW. Transitions in medical education. In: Walsh K, editor. Oxford textbook of medical education. 1 ed. Oxford [u.a.]: Oxford Univ. Press; 2013. pp. 372–82.

    Google Scholar 

  5. Hager P. Theories of workplace learning. In: Malloch M, Cairns L, Evans K, editors. The sage handbook of workplace learning. This paperback edition first published. Los Angeles: Sage; 2013. pp. 17–31.

    Google Scholar 

  6. Dai DY, Sternberg RJ. Motivation, emotion, and cognition: integrative perspectives on intellectual functioning and development. Mahwah, N.J: Lawrence Erlbaum Associates; 2004.

    Book  Google Scholar 

  7. Winne PH, Nesbit JC. The psychology of academic achievement. Annu Rev Psychol. 2010;61:653–78. https://doi.org/10.1146/annurev.psych.093008.100348

    Article  Google Scholar 

  8. Harris KR. APA educational psychology handbook, volume 2: individual differences and cultural and contextual factors. 1 ed. Washington, D.C.: American Psychological Association; 2012.

    Book  Google Scholar 

  9. Steinberg E, Marsch S, Yanagida T, et al. Development and validation of the Workplace Learning Inventory in Health Sciences Education: a multimethod study. Adv Health Sci Educ. 2023. https://doi.org/10.1007/s10459-023-10295-y

    Article  Google Scholar 

  10. Boekaerts M, Pintrich PR, Zeidner M. Handbook of self-regulation. San Diego, CA: Academic; 2010.

    Google Scholar 

  11. Schunk DH, Greene JA. Handbook of self-regulation of learning and performance. In:, editors. Second edition. New York, NY, London: Routledge/Taylor & Francis Group; 2018.

    Google Scholar 

  12. Zimmerman BJ. Attaining self-regulation. In: Boekaerts M, editor. Handbook of self-regulation. [Nachdr]. San Diego, Calif.: Academic; 2008. pp. 13–39.

    Google Scholar 

  13. van Houten-Schat MA, Berkhout JJ, van Dijk N, Endedijk MD, Jaarsma ADC, Diemers AD. Self-regulated learning in the clinical context: a systematic review. Med Educ. 2018;52(10):1008–15. https://doi.org/10.1111/medu.13615

    Article  Google Scholar 

  14. Pintrich PR. A conceptual Framework for assessing motivation and self-regulated learning in College Students. Educational Psychol Rev. 2004;16(4):385–407. https://doi.org/10.1007/s10648-004-0006-x

    Article  Google Scholar 

  15. Boekaerts M. Self-regulated learning at the Junction of Cognition and Motivation. Eur Psychol. 1996;1(2):100–12. https://doi.org/10.1027/1016-9040.1.2.100

    Article  Google Scholar 

  16. Weinstein CE, Acee TW, Jung J. Self-regulation and learning strategies. New Dir Teach Learn. 2011;2011(126):45–53. https://doi.org/10.1002/tl.443

    Article  Google Scholar 

  17. Koenka AC. Academic motivation theories revisited: an interactive dialog between motivation scholars on recent contributions, underexplored issues, and future directions. Contemp Educ Psychol. 2020;61:101831. https://doi.org/10.1016/j.cedpsych.2019.101831

    Article  Google Scholar 

  18. Scherer KR. What are emotions? And how can they be measured? Social Sci Inform. 2005;44(4):695–729. https://doi.org/10.1177/0539018405058216

    Article  Google Scholar 

  19. Shuman V, Scherer KR. Concepts and structures of emotions. In: Linnenbrink-Garcia L, Pekrun R, editors. International handbook of emotions in education. New York: Routledge Taylor & Francis Group; 2014. pp. 13–35.

    Google Scholar 

  20. Duffy MC, Lajoie SP, Pekrun R, Lachapelle K. Emotions in medical education: examining the validity of the medical emotion scale (MES) across authentic medical learning environments. Learn Instruction. 2018. https://doi.org/10.1016/j.learninstruc.2018.07.001

    Article  Google Scholar 

  21. Strand P, Sjöborg K, Stalmeijer R, Wichmann-Hansen G, Jakobsson U, Edgren G. Development and psychometric evaluation of the undergraduate Clinical Education Environment measure (UCEEM). Med Teach. 2013;35(12):1014–26. https://doi.org/10.3109/0142159X.2013.835389

    Article  Google Scholar 

  22. Ben-Eliyahu A. Academic emotional learning: a critical component of self-regulated learning in the emotional learning cycle. Educational Psychol. 2019;54(2):84–105. https://doi.org/10.1080/00461520.2019.1582345

    Article  Google Scholar 

  23. Ben-Eliyahu A, Bernacki ML. Addressing complexities in self-regulated learning: a focus on contextual factors, contingencies, and dynamic relations. Metacognition Learn. 2015;10(1):1–13. https://doi.org/10.1007/s11409-015-9134-6

    Article  Google Scholar 

  24. Panadero E. A review of self-regulated learning: six models and four directions for Research. Front Psychol. 2017;8:422. https://doi.org/10.3389/fpsyg.2017.00422

    Article  Google Scholar 

  25. Wolters CA. Regulation of motivation: evaluating an underemphasized aspect of self-regulated learning. Educational Psychol. 2003;38(4):189–205. https://doi.org/10.1207/S15326985EP3804_1

    Article  Google Scholar 

  26. Roth A, Ogrin S, Schmitz B. Assessing self-regulated learning in higher education: a systematic literature review of self-report instruments. Educational Assess Evaluation Account. 2016;28(3):225–50. https://doi.org/10.1007/s11092-015-9229-2

    Article  Google Scholar 

  27. Allen MS, Iliescu D, Greiff S. Single item measures in Psychological Science. Eur J Psychol Assess. 2022;38(1):1–5. https://doi.org/10.1027/1015-5759/a000699

    Article  Google Scholar 

  28. Fisher GG, Matthews RA, Gibbons AM. Developing and investigating the use of single-item measures in organizational research. J Occup Health Psychol. 2016;21(1):3–23. https://doi.org/10.1037/a0039139

    Article  Google Scholar 

  29. Gogol K, Brunner M, Goetz T, et al. My questionnaire is too long! The assessments of motivational-affective constructs with three-item and single-item measures. Contemp Educ Psychol. 2014;39(3):188–205. https://doi.org/10.1016/j.cedpsych.2014.04.002

    Article  Google Scholar 

  30. Fuchs C, Diamantopoulos A. Using single-item measures for construct measurement in management research. Conceptual issues and application guidelines; 2009.

  31. Gabriel AS, Podsakoff NP, Beal DJ, et al. Experience sampling methods: a discussion of critical trends and considerations for Scholarly Advancement. Organizational Res Methods. 2019;22(4):969–1006. https://doi.org/10.1177/1094428118802626

    Article  Google Scholar 

  32. Geiser C, Götz T, Preckel F, Freund PA. States and traits. Eur J Psychol Assess. 2017;33(4):219–23. https://doi.org/10.1027/1015-5759/a000413

    Article  Google Scholar 

  33. Kline RB, editor. Principles and practice of structural equation modeling. Fourth edition. New York, London: The Guilford Press; 2016.

  34. Klingsieck KB. Kurz Und knapp – die Kurzskala Des Fragebogens „Lernstrategien Im Studium (LIST) [A short version of the questionnaire on learning strategies in higher education (LIST)]. Z für Pädagogische Psychologie. 2018;32(4):249–59. https://doi.org/10.1024/1010-0652/a000230

    Article  Google Scholar 

  35. Boerner S, Seeber G, Keller H, Beinborn P. Lernstrategien Und Lernerfolg Im Studium [Lerning strategies and academic success in higher education]. Zeitschrift für Entwicklungspsychologie Und Pädagogische Psychologie. 2005;37(1):17–26. https://doi.org/10.1026/0049-8637.37.1.17

    Article  Google Scholar 

  36. Kunter M, Schümer G, Artelt C, et al. PISA 2000: Dokumentation Der Erhebungsinstrumente [PISA 2000: Documention of Assessment Instruments]. Berlin: Max-Planck-Institut für Bildungsforschung; 2002.

    Google Scholar 

  37. Schwarzer R, Jerusalem M, editors. Skalen Zur Erfassungvon Lehrer- Und Schülermerkmalen. Dokumentation Der Psychometrischen Verfahren Im Rahmen Der Wissenschaftlichen Begleitung Des Modellversuchs Selbstwirksame Schulen. [Scales for assessing teacher- and student characteristics. Documentation of assessment instruments of the pilot project on self-effective schools]. Berlin: Freie Universität Berlin; 1999.

  38. Rotthoff T, Ostapczuk M, Kröncke K, Schneider M, Decking U, Ritz-Timme S, editors. Erfassung Des Lehr-/Lernklimas in Der Medizinischen Ausbildung - Validierung Des DREEM (Dundee Ready Education Environment Measure) in deutscher sprache und ergänzender Messinstrumente. [German Medical Science GMS Publishing House]; 2010.

  39. Rotthoff T, Ostapczuk MS, de Bruin J, Decking U, Schneider M, Ritz-Timme S. Assessing the learning environment of a faculty: psychometric validation of the German version of the Dundee Ready Education Environment Measure with students and teachers. Med Teach. 2011;33(11):e624–36. https://doi.org/10.3109/0142159x.2011.610841

    Article  Google Scholar 

  40. Schwinger M, von der Laden T, Spinath B. Strategien Zur Motivationsregulation und ihre erfassung [Strategies for the regulation of motivation and their assessment]. Zeitschrift für Entwicklungspsychologie Und Pädagogische Psychologie. 2007;39(2):57–69. https://doi.org/10.1026/0049-8637.39.2.57

    Article  Google Scholar 

  41. Loch N, Hiller W, Witthöft M. Der cognitive emotion regulation questionnaire (CERQ). Z für Klinische Psychologie Und Psychother. 2011;40(2):94–106. https://doi.org/10.1026/1616-3443/a000079

    Article  Google Scholar 

  42. Garnefski N, Kraaij V, Spinhoven P. Negative life events, cognitive emotion regulation and emotional problems. Pers Indiv Differ. 2001;30(8):1311–27. https://doi.org/10.1016/s0191-8869(00)00113-6

    Article  Google Scholar 

  43. Cronbach LJ, Meehl PE. Construct validity in psychological tests. Psychol Bull. 1955;52(4):281.

    Article  Google Scholar 

  44. Unipark, Cologne. Germany: Globalpark; 2022.

  45. Brown TA. Confirmatory factor analysis for applied research. 2nd edition. The Guilford Press; 2015.

  46. Levy P. The correction for spurious correlation in the evaluation of short-form tests. J. Clin. Psychol. 1967;23(1):84–86. https://doi.org/10.1002/1097-4679(196701)23:1%3C84:AID-JCLP2270230123%3E3.0.CO;2-2

  47. Nunnally J. Psychometric theory. 2nd edition. New York: McGraw-Hill; 1978.

  48. DeVellis RF. Scale development: Theory and applications. 4th edition. Thousand Oaks, CA: SAGE Publications; 2016.

  49. Cohen J. Statistical Power Analysis for the behavioral sciences. New York: Routledge; 1988.

    Google Scholar 

  50. Muthén LK, Muthén BO. (1998–2017). Mplus user’s guide. 8th edition. Los Angeles, CA: Muthén & Muthén.

  51. R Core Team. (2023). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/

  52. Sarstedt M, Wilczynski P. More for less? A comparison of single-item and multi-item measures. Die Betriebswirtschaft. 2009;69(2):211–27. https://www.researchgate.net/profile/marko-sarstedt/publication/281306739_more_for_less_a_comparison_of_single-item_and_multi-item_measures/links/5623546608ae70315b5a72e6/more-for-less-a-comparison-of-single-item-and-multi-item-measures.pdf

    Google Scholar 

  53. Cuvillier M, Léger P-M, Sénécal S. Quantity over quality: do single-item scales reflect what users truly experienced? Computers Hum Behav Rep. 2021;4:100097. https://doi.org/10.1016/j.chbr.2021.100097

    Article  Google Scholar 

  54. Wolters CA, Won S. Validity and the use of self-report questionnaires to asess self-regulated learning. In: Schunk DH, Greene JA, eds. Handbook of self-regulation of learning and performance. Second edition. New York, NY, London: Routledge/Taylor & Francis Group; 2018.

Download references

Acknowledgements

The authors wish to thank the students for participating and also extend thanks to the teachers for their support in conducting this research.

Funding

This research received funding from the Austrian Science Fund, P 33913.

Author information

Authors and Affiliations

Authors

Contributions

ES is responsible for the study design, data collection, data analysis, results interpretation, writing and review of the manuscript. TY is responsible for data analysis and contributed to the study design, results interpretation, writing and review of the manuscript. LDU and FP contributed to the study design, results interpretation, and review of the manuscript. CK was responsible for data collection at Institution 2. SM, LS, UA, CP and PB contributed to the data collection at Institution 1. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Evelyn Steinberg.

Ethics declarations

Ethical approval

The study was submitted to the Medical University of Vienna’s Ethics Committee, which takes decisions on studies involving human subjects at the University of Veterinary Medicine in Vienna. Its decision was that no ethical approval was required according to the Declaration of Helsinki. The following ethical standards were met: Participation in the study was completely voluntary; written informed consent was obtained for participation in the study and use of the data; participants were assured that their responses would remain confidential and would only be used for scientific purposes; and complete anonymity was ensured by refraining from publishing any data that would allow conclusions to be drawn about the participants’ identities.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Steinberg, E., Yanagida, T., Marsch, S. et al. Undergraduates’ workplace learning in health sciences education: psychometric properties of single-item measures. BMC Med Educ 24, 861 (2024). https://doi.org/10.1186/s12909-024-05848-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12909-024-05848-7

Keywords