Indigenizing and co-producing the ACGME anesthesiology milestone in Taiwan: a Delphi study and subgroup analysis

Kang, Enoch Yi-No; Chi, Kuan-Yu; Liao, Faith; Liu, Chih-Chung; Lin, Chih-Peng; Chen, Ta-Liang; Tanaka, Pedro; Chen, Chien-Yu

doi:10.1186/s12909-024-05081-2

Research
Open access
Published: 19 February 2024

Indigenizing and co-producing the ACGME anesthesiology milestone in Taiwan: a Delphi study and subgroup analysis

Enoch Yi-No Kang^1,2,3^na1,
Kuan-Yu Chi^4,5^na1,
Faith Liao^3,6,
Chih-Chung Liu^1,7,
Chih-Peng Lin⁸,
Ta-Liang Chen^1,7,9,
Pedro Tanaka¹⁰ &
…
Chien-Yu Chen^3,6,7,8

BMC Medical Education volume 24, Article number: 154 (2024) Cite this article

716 Accesses
Metrics details

Abstract

Background

To implement the ACGME Anesthesiology Milestone Project in a non-North American context, a process of indigenization is essential. In this study, we aim to explore the differences in perspective toward the anesthesiology competencies among residents and junior and senior visiting staff members and co-produce a preliminary framework for the following nation-wide survey in Taiwan.

Methods

The expert committee translation and Delphi technique were adopted to co-construct an indigenized draft of milestones. Descriptive analysis, chi-square testing, Pearson correlation testing, and repeated-measures analysis of variance in the general linear model were employed to calculate the F values and mean differences (MDs).

Results

The translation committee included three experts and the consensus panel recruited 37 participants from four hospitals in Taiwan: 9 residents, 13 junior visiting staff members (JVSs), and 15 senior visiting staff members (SVSs). The consensus on the content of the 285 milestones was achieved after 271 minor and 6 major modifications in 3 rounds of the Delphi survey. Moreover, JVSs were more concerned regarding patient care than were both residents (MD = − 0.095, P < 0.001) and SVSs (MD = 0.075, P < 0.001). Residents were more concerned regarding practice-based learning improvement than were JVSs (MD = 0.081; P < 0.01); they also acknowledged professionalism more than JVSs (MD = 0.072; P < 0.05) and SVSs (MD = 0.12; P < 0.01). Finally, SVSs graded interpersonal and communication skills lower than both residents (MD = 0.068; P < 0.05) and JVSs (MD = 0.065; P < 0.05) did.

Conclusions

Most ACGME anesthesiology milestones are applicable and feasible in Taiwan. Incorporating residents’ perspectives may bring insight and facilitate shared understanding to a new educational implementation. This study helped Taiwan generate a well-informed and indigenized draft of a competency-based framework for the following nation-wide Delphi survey.

Highlight

1. Most ACGME anesthesiology milestones are applicable in Taiwan.

2. Experienced anesthesiologists achieved consensus faster than young practitioners.

3. Residents mirrored milestone competencies through participation.

4. Experience status affected the weight that anesthesiologists gave to milestone competencies.

Peer Review reports

Introduction

With the continuous development of science and technology, the format of residency training has encountered a paradigm shift from time-based medical education to competency-based medical education (CBME) over the last two decades [1]. This shift has had an enormous impact on resident training programs, reflecting the efforts of the Accreditation Council for Graduate Medical Education (ACGME) to establish CBME for all physicians. For instance, milestones competencies, known as ACGME reporting milestones, include patient care (PC), medical knowledge (MK), systems-based practice (SBP), practice-based learning and improvement (PBLI), professionalism (PROF), and interpersonal and communication skills (ICS), which are considered the criteria to indicate well-developed medical professionals. Notably, the ACGME offers more than 100 specialties and subspecialties, including the Anesthesiology Milestone Project (a workable CBME framework), all of which facilitate learners’ progress from the novice to the expert level with the expected proficiency [2]. Only by explicitly articulating residents’ developmental milestones and outcome competencies could the competency-based teaching, learning, and assessment strategies be developed and performed sequentially and deliberately [3].

The Anesthesiology Milestone Project, initiated conjointly by the ACGME and the American Board of Anesthesiology, has been officially implemented in all the residency training programs in the United States since 2015 [4]. To adopt the CBME conceptual model crossculturally in a foreign clinical and educational system, the indigenizing process such as mixed method exploratory triangulation [5, 6], back-translation [7], or two-step validation [8, 9] were borrowed. Indigenization, in the context of document translation, refers to the process of adapting a document to the cultural and linguistic context of an “indigenous” community. It goes beyond a simple translation of the content and involves considering the cultural nuances, such as professional ethos and hidden curriculum. This ensures that the translated document is culturally appropriate, respectful, and resonates with the target audience. Three issues have been addressed in our indigenization process. First, since the ACGME milestone is reported in English, it needs to be carefully translated and interpreted to the common “language” used and understood by the local practitioners [10]. Expert committee translation may ensure content accuracy and prevent cultural loss in the local context during CBME adoption [11]. Second, the co-production model of healthcare and its education, which values patients’ and learners’ engagement, was urged nowadays to facilitate sustainability and desired outcomes [5, 6, 8, 12]. Such a conceptual model could be applied to a consensus study by deliberately recruiting different categories of key stakeholders [8, 13, 14]. Considering the junior members’ perspectives is essential to meeting the learners’ needs. Co-production model has been proposed as an appropriate approach to develop trainees’ competencies, and ACGME also encourages attendings to guide residents using co-production model [15, 16]. A well-structured medical education program, which requires co-planning, co-executing, and co-producing the learning experience between the trainers and trainees collaboratively, may provide a substantial learning outcome and increase health care quality [12, 15, 16].

Third, the Delphi technique, considered a consensus method, has been used to establish consensus, develop concepts, and formulate future research directions across a range of subjects [17]. The opinions of panelists are iteratively proposed, collected, and analyzed by using this method until any disagreement is resolved [18]. This approach is primarily used for curriculum development, policy-setting, criterion-setting, and goal-setting and thus has been widely applied in medical education to develop curricula [19,20,21,22] and assess learning outcomes [23, 24]. Furthermore, subgroup analysis could be employed in a Delphi survey to unfold the differences between panels [13, 25, 26]. Bringing in such contextual insight may facilitate trainer–trainee cooperation for designing, developing, and delivering medical education and improving the overall learning process and outcome.

This study is a preliminary work of a two-year project (funded by the Taiwan Ministry of Science and Technology, MOST 105-2511-S-038-003), piloted in four teaching hospitals for preparing the following nation-wide Delphi survey [27]. The primary objective of this study is to review the relevance and evaluability of the ACGME 25 sub-competencies and their 285 milestones by all stakeholders, including senior visiting staff members (SVSs), junior visiting staff members (JVSs), and residents. The secondary research objective is to compare the differences between generations and investigate the trainers’ and trainees’ conceptual diversity during the multi-step consensus development process.

Methods

The expert committee was employed to ensure the translation quality before the survey. The Delphi technique was applied for adapting and co-producing the ACGME milestones. The qualitative feedback, achievement of consensus, and trend consistencies were analyzed, presented, and replied to participants during the multiple rounds of the Delphi survey. This study was approved by the Taipei Medical University–Joint Institutional Review Board (TMU-JIRB) with serial number N201604060, and written informed consent was obtained from all participants.

Expert committee translation and questionnaire design

The strategy of expert committee translation [11] was employed to ensure content accuracy from English to Traditional Chinese. Three experts (Chen CY, Lin CP, Liu CC) who are bilingually knowledgeable in anesthesiology and medical education were recruited to work separately (1st step) and together (2nd step) until consensus was achieved (3rd step). After the first Chinese draft of the ACGME anesthesiology milestone was developed, three experts (Chen CY, Kang YN, Liu CC) co-designed an online Delphi questionnaire. We used the Google Docs platform to investigate the relevance and evaluability of all 285 milestones under 25 sub-competencies. The other two questionnaire rounds were re-designed by the same team after each Delphi round. Figure 1 illustrates the research steps of this study.

No specific structure was established prior to data analysis to facilitate the natural development of a theoretical framework from the comments, thus mitigating potential bias stemming from the subjective inclinations and opinions of the researchers. The analytical process comprised three sequential steps, leveraging grounded theory, an inductive methodology that provides systematic guidance for the collection, synthesis, analysis, and conceptualization of qualitative data, ultimately aiming to construct theory [28]. In practice, experts were asked to provide their opinions on any inappropriate items in the Mandarin version, and researchers categorized those opinions into sub-themes following a comprehensive review. They subsequently grouped similar sub-themes together to form main themes.

Delphi panels

The sample size required for the Delphi technique typically ranges from 15 to 30 experts from the same discipline [29,30,31]. We targeted to recruit three panels (i.e., R, JVS, SVS) to incorporate various perspectives and further compare the panels’ differences. To collect useful opinions from the Delphi survey, our inclusion criteria were: (a) anesthesiologists work in teaching hospitals or medical centers during our study, and (b) they received relevant lectures or workshops about the ACGME milestone. We did not recruit year-one residents because they had no whole picture of anesthesia in clinical practice. Therefore, we invited 40 anesthesiologists to participate; of them, the 39 who agreed to participate included 15 SVSs, 13 JVSs, and 11 residents from two medical centers and two teaching hospitals. Two residents (5.1%) lost of following up in round 2 and 3, respectively. The SVSs and JVSs had an experience of ≥ 10 and < 10 years, respectively.

Data collection and analysis

Delphi survey was used for qualitative and quantitative data collection, whereby scores and interpretations of individual ACGME anesthesiology milestone items were explored separately. Moreover, this study examined the differences in applicable scores of the six domains and five levels among three anesthesiologist experience statuses in the final round (round 3) of the Delphi survey. The Delphi survey used a 4-point Likert scale, ranging from 1 (disagree) and 4 (agree). Research has shown that using fewer scale points can lead to higher reliability [32], with four to seven points commonly used in studies [33]. Additionally, evidence suggests that cultural differences play a role in survey studies [34, 35], and in our cultural context, a 4-point scale was chosen for our expert survey. Because social expectations and hierarchical pressure in their hospital may have led to bias among our anesthesiologist experts, online questionnaire and anonymity were deliberately employed. A research assistant was trained to manage the Delphi survey by using a standard approach; this involved the construction of an anonymous Google questionnaire for the Delphi survey. After completing qualitative and quantitative data collection, the research assistant sent a summary of each survey round to all participants; moreover, this summary report was completely anonymized. All qualitative comments were carefully reviewed and taken into consideration for co-producing an indigenized anesthesiology milestone. The qualitative feedback, achievement of consensus, and trend consistencies (i.e., consistency between different rounds) were analyzed, presented, and replied to participants during the multiple rounds of the Delphi survey.

Subgroup analysis

The Delphi method facilitated the forecasting process of negotiating nonconsensus items among the 37 anesthesiologists’ perspectives. We judged nonconsensus by using interquartile ranges (IQR) with the relevant methodology. Because an IQR demonstrates dispersion from the median and consists of the middle 50% of the cases, an IQR of ≤ 1 indicates that > 50% of the expert responses fall within 1 point [36]. By contrast, if an IQR of > 1 is obtained in each study round, the nonconsensus items are excluded from or revised in each subsequent round. The results of each round were presented using a 2 × 3 contingency table reporting the results of the chi-square test of independence and displaying Cramer’s V. Trend consistencies were also examined using Pearson correlation testing. Multiple pairwise comparisons were performed using the general linear model (GLM) for the differences in the applicable scores of the six domains and five levels among the three experience statuses. The results of the GLM analyses are presented as F values, mean differences (MDs), and standard errors (SEs), all with their corresponding 95% confidence intervals (CIs).

Results

After three rounds of Delphi surveys, all 285 milestones have achieved either consensus or stability. 274 (96.1%) and 172 (60.3%) milestones were regarded as relevant and evaluable, respectively, in round one (Fig. 1). Seven milestones did not reach consensus in the first round, and we raised three of them as examples as follows:

After adding six newly proposed milestones, another 8 and 39 milestones achieved consensus in round two. 2.5% (n = 7) and 14.7% (n = 42) of the anesthesiology milestones were regarded as irrelevant and non-evaluable, respectively. 271 (95.1%) milestones were revised or rephrased without changing their original meaning, whereas 6 (2.1%) milestones in medical knowledge have been replaced because all of them are the exams held in the US. The qualitative comments (n = 1145) were collected, analyzed, and categorized into four main themes: definition, relevance, evaluability, and others; and their sub-themes and representative comments were presented in Table 1. The qualitative analysis for theme development provided contextual meanings and practical insights into our milestone indigenization process. For instance, the comments in the sub-theme of irrelevant offered the rationale why a milestone should be deleted, and those in the sub-theme of translation provided a more accurate wording in Mandarin to rephrase a milestone.

Table 1 Themes, sub-themes, and representative comments

Full size table

We further performed chi-squared test for subgroup analysis on the panels of residents, JVSs, and SVSs separately (Table 2). In the final round of this separate Delphi analysis, all experts achieved significant consensus regardless of their experience status (P = 0.059). However, in round 1 of this analysis, residents, JVSs, and SVSs had 41 (14.44%), 22 (7.72%), and 6 (2.11%) nonconsensus items, respectively (P < 0.01). To address the discrepancy, we conducted round 2. Then, SVSs attained 100% consensus in round 2 with the lowest number of nonconsensus items, whereas JVSs and residents had one and five nonconsensus items, respectively. The difference was significant (P = 0.013) for one nonstable nonconsensus item out of the three nonconsensus items among JVSs. Consequently, we performed round 3 to minimize nonstable nonconsensus. Though there is still one nonconsensus item, which was a stable event, all statuses are comparable for the consensus throughout round 3.

Table 2 Differences in non-consensus items among levels of anesthesiologists in each round of Delphi study

Full size table

During all three rounds above, the correlation coefficients between rounds increased round by round, consistently and significantly (P < 0.001). Among residents, JVSs and SVSs, the correlation coefficients increased from 0.965, 0.896, and 0.962, respectively, for rounds 1 and 2 to 0.984, 0.998, and 1.00, respectively, for rounds 2 and 3. Notably, SVSs attained a positive correlation at the end of all rounds; in other words, all 15 SVSs reached consensus for each item (Table 3). By contrast, the correlation coefficients between experience statuses for each round demonstrated a rather decremental trend with significantly positive correlation. For rounds 1, 2, and 3, the resident–JVS correlation coefficients were 0.834, 0.735, and 0.724, respectively; similarly, resident–SVS correlation coefficients were 0.817 and 0.799 for the rounds 1 and 2, respectively.

Table 3 Correlation between levels of anesthesiologists in medical hierarchy in each round of Delphi study

Full size table

Regarding the six core competencies of the ACGME milestones, the pair comparisons among the experience statuses in round 3 are presented in Table 4. The difference in every pair of comparison was small for the MK and SBP domains, with four pairs presenting marginal difference: resident–JVS in MK (MD = − 0.37; 95% CI = − 0.79, 0.04; P < 0.10), JVS–SVS in SBP (MD = − 0.06; 95% CI = − 0.14, 0.01), resident–SVS in PBLI (MD = 0.072; 95% CI = 0.00, 0.15), and JVS–SVS in PROF (MD = 0.047; 95% CI = − 0.01, 0.10; all P < 0.10). By contrast, for the PC domain, values of JVSs were significantly higher than those of both residents (MD = − 0.095; 95% CI = − 0.15, − 0.04) and SVSs (MD = 0.075; 95% CI = 0.04, 0.11; both P < 0.001). Regarding PBLI, values of residents were significantly higher than those of JVSs (MD = 0.081; 95% CI = 0.03, 0.14; P < 0.01) and were less significantly higher than those of SVSs (MD = 0.072; 95% CI = 0.00, 0.15; P < 0.1). Moreover, residents acknowledged PROF more than either JVSs (MD = 0.072; 95% CI = 0.02, 0.13; P < 0.05) or SVSs (MD = 0.12; 95% CI = 0.04, 0.2; P < 0.01) did. Finally, SVSs graded ICS significantly lower than both residents (MD = 0.068; 95% CI = 0.01, 0.13; P < 0.05) and JVSs (MD = 0.065; 95% CI = 0.01, 0.12; P < 0.05) did.

Table 4 Multiple comparisons between statuses in the final Delphi survey round on six domains

Full size table

Regarding the five levels of the six domains of ACGME milestones (Table 5), the pair comparisons between experience statuses revealed almost no effect at levels 1 and 3, except for marginal differences in the resident–JVS pair at level 1 (MD = − 0.04; 95%CI = − 0.10, 0.00) and the resident–SVS pair at level 3 (MD = 0.057; 95%CI = − 0.01, 0.12; both P < 0.1). Nevertheless, residents put significantly less emphases on level 2 than JVSs (MD = − 0.76; 95%CI = − 0.14, − 0.02) and SVSs (MD = − 0.58; 95%CI = − 0.12, 0.00) did (both P < 0.05); nevertheless, both residents and JVSs put significantly more emphases on levels 4 and 5 than SVSs did.

Table 5 Multiple comparisons between statuses in the final Delphi survey round on five levels

Full size table

Discussion

Implementing a milestone project outside the North America is challenging due to the difference in culture and healthcare system. Not only cultural diversity but also medical education are potential factors influencing PC [37]. Thus, cultural competence training could be exploited as a strategy to improve practitioners’ PC knowledge, attitudes, skills, and most importantly, competency [38]. The primary purpose of our study was to translate, co-produce, and indigenize the present ACGME milestones, a competency-based assessment, in anesthesiology for developing a well-informed draft for the later nation-wide consensus survey and investigating whether they are applicable for local practice. Four phenomena were thus noted: (a) indigenizing the ACGME anesthesiology milestones in Taiwan is feasible; (b) experienced anesthesiologists achieved consensus fast; (c) experience status affected the weight that anesthesiologists gave to milestone competencies; and (d) residents mirrored milestone competencies through participation.

Indigenizing the ACGME anesthesiology milestones in Taiwan is feasible

This study’s primary finding showed that the ACGME anesthesiology milestones are highly feasible in Taiwan after indigenization. The expert committee and the Delphi process ensured the cross-cultural translation quality and validity. Most of the milestones are relevant to clinical practice and residents’ training, especially after the co-production process. Yet, one-seventh of them are difficult to be evaluated and might need further considerations in their implementation. Compared to a similar Delphi study, the cross-country differences in Anesthesiology are not as significant those in Emergency Medicine (14.7% versus 21%) [39]. The co-production model we adopted here helped us engage in learners’ voices, reflect democratic principles, and hopefully, contribute to a better educational outcome. We recruited trainees and junior specialists as representatives, which was also done in several recent Delphi studies [8, 40,41,42]. Although this indigenized document is only an introductory version, it is the first competency-based framework developed for the Taiwan anesthesia curriculum. Moreover, our preliminary results have later encouraged the Taiwan Society of Anesthesiologists (TSA) to set up a CBME taskforce and develop a nation-wide anesthesiology milestone for all training centers. This two-step approach, which started from a single-center development process followed by a national consensus, was also performed in other specialties [8, 9].

Experience status affects the consensus and weight

The secondary finding of our study was that all three experience status groups, consisting of 37 anesthesiologists, reached a consensus on the six competencies domains at the end of the Delphi analysis with little difference (P = 0.059), even though some milestone items may not be directly applicable to the local context (five stable nonconsensus items among residents and one stable nonconsensus item among JVSs). Notably, we found gradients in consensus and nonconsensus items during round 1. SVSs, with more experience, demonstrated more consensus items (Table 2), and they attained complete consensus in round 2—which was compatible with the strong correlation between their rounds 2 and 3—in contrast to resident and JVS who achieved complete agreement only in round 3. This phenomenon probably resulted from the fact that experienced anesthesiologists (i.e., SVSs) tend to possess a higher quality of anesthesiology knowledge, skills, and communication ability than JVSs and residents do, which aids SVSs in cultivating competent anesthesiologists. The early consensus also indicated that these anesthesiology experts agreed regarding the anesthesiology training process.

As shown in Table 3, slightly decreasing trends were noted in the correlation coefficients between experience statuses, potentially because of different weights for the six competencies among the five levels and experience statuses (Tables 4 and 5). A possible factor underlying the domains having significant differences in weights based on the experience statuses is PC. As the first component of six domains, PC, is the most fundamental element in daily medical practice, highlighting the importance of patient-centered care as a mainstay approach to health care [43]. However, trainees, including medical students and residents, commonly overlook the importance of PC, mainly because trainees have a propensity to pursue higher achievements. For instance, in Taiwan, medical students ask to participate in clinical research to obtain more publications and apply for highly competitive specialties, such as anesthesiology. Residents’ academic activities are positively related to their clinical performance, and thus, they should be encouraged to utilize evidence-based medicine to enhance PC quality [44]. Moreover, PC should always remain prioritized during clinical practice and course training.

Our study also indicated that residents valued PROF more than the visiting staff. PROF represents the behaviors and attitudes toward patients, surgeons, and colleagues [45]. Multiple factors could explain our result: First, compared with surgeons or generalists, anesthesiologists rarely deal with patients who assume them to have a comprehensive perception of their chief complaint, present illness, medical history, and surgical procedure at their first preoperative meeting—where rapid evaluation of patient conditions is performed and perioperative reassurance is provided [45]. Second, coping with the surgical staff could be challenging for anesthesiology residents because some surgical staff members tend to refer to anesthesiologists as merely perioperative “consultants” rather than experts, even though anesthesiologists may have more thorough knowledge regarding patient status; as a result, residents, particularly those with less experience, find it difficult to simultaneously communicate with the surgical staff and provide good care to the patients during the perioperative period [45]. Taken together, all these factors tend to make younger residents consider themselves to have sufficient professionalism and anticipate fitting into surgical teams and treating patients well.

Residents mirrored milestone competencies through participation

Another major highlight of our study was that although all nine residents had the highest number of nonconsensus items than JVS and SVS, the difference was insignificant during round 3 (n = 5) as stated above (P = 0.059). Moreover, the correlation coefficients between rounds 2 and 3 demonstrated a highly positive and significant correlation, with a trend of an increase in correlation in each round. This indicated that the residents (i.e., trainees) agreed with the contents of their training program. A study suggested that it is vital to include the trainees’ voices when developing evaluation tools for competency and performance because it is “likely to inform our understanding of whether and how assessments can serve the purpose of learning” [46, 47].

Another study on ACGME plastic surgery milestone evaluation reported that all residents, except for chief residents, significantly self-assessed higher than their attendings did; nevertheless, their evaluation results provided significant correlation coefficients. Thus, expectations for competency and performance standards differ between residents and their attendings [48]. The same effect was noted in an emergency department by Goldfam et al.; the residents there tended to overestimate their sub-competencies [49]. Notably, in the report above regarding plastic surgery milestones, [48] residents’ self-assessments throughout each year demonstrated the Dunning–Kruger effect [50]—a phenomenon related to the cognitive bias that causes people with low ability to overestimate their actual performance or competency. Therefore, letting residents know how they will be assessed and how they may rate themselves during courses could help them construct an image of a competent anesthesiologist, thus enhancing their confidence and easing their progress.

Limitations and future directions

This study had several limitations. First, it was mainly conducted in two medical centers and two teaching hospitals in Taipei City, the capital of Taiwan, which has high resource availability and demand for health care services and related research and teaching. Second, the representative issue, such as whether residents or junior faculty members should be recruited into the panel as regarded as “experts”, might be challenged and criticized. Thus, our findings may not be generalizable to other situations, such as in areas with limited access to general health care (e.g., hospitals in remote areas). Future investigations that incorporate a nationally representative sample may be necessary for effective educational implementation.

Conclusion

Most ACGME anesthesiology milestones are applicable in Taiwan. However, the five nonconsensus items noted in the current study warrant further detailed discussion during the implementation of the milestones. Moreover, our results contributed to the relevant perspectives of both senior staff members and residents. Our empirical results may guide medical educators when planning anesthesiology training curricula and raise their awareness regarding anesthesiology trainees’ learning process and understanding.

Data availability

Data described in the manuscript, code book, and analytic code will be made available upon request to the corresponding author Dr. Chien-Yu Chen (email: jc2jc@tmu.edu.tw).

Abbreviations

ACGME:: Accreditation Council for Graduate Medical Education
CBME:: Competency-based medical education
CI:: Confidence interval
ICS:: Interpersonal and communication skills
IQR:: Interquartile range
JVR:: Junior visiting staff
MD:: Mean difference
MK:: Medical knowledge
PBLI:: Practice-based learning and improvement
PC:: Patient care
PROF:: Professionalism
SBP:: Systems-based practice
SD:: Standard deviation
SE:: Standard error
SVS:: Senior visiting staff

References

Carraccio C, Wolfsthal SD, Englander R, Ferentz K, Martin C. Shifting paradigms: from Flexner to competencies. Acad Med. 2002;77(5):361–7.
Article PubMed Google Scholar
Milestones S. [https://www.acgme.org/What-We-Do/Accreditation/Milestones/Milestones-by-Specialty].
Van Melle E, Frank JR, Holmboe ES, Dagnone D, Stockley D, Sherbino J. International Competency-based Medical Education C: a Core Components Framework for evaluating implementation of competency-based Medical Education Programs. Acad Med. 2019;94(7):1002–9.
Article PubMed Google Scholar
ACGME.: The Anesthesiology Milestone Project. In.; 2015.
Hamui-Sutton A, Monterrosas-Rojas AM, Ortiz-Montalvo A, Flores-Morones F, Torruco-García U, Navarrete-Martínez A, Arrioja-Guerrero A. Specific entrustable professional activities for undergraduate medical internships: a method compatible with the academic curriculum. BMC Med Educ. 2017;17(1):143.
Article PubMed PubMed Central Google Scholar
Gutiérrez-Barreto SE, Durán-Pérez VD, Flores-Morones F, Esqueda-Nuñez RI, Sánchez-Mojica CA, Hamui-Sutton A. Importance of context in entrustable professional activities on surgical undergraduate medical education. MedEdPublish 2018, 7:109.
Castro UB, Gomes GR, Simão KFR, Egito L, Figueiredo S, Júnior RZB. Translation and Transcultural Adaptation of the Milestones Instrument To Assess Teaching in Medical Residency Services on Orthopedics and Traumatology. Rev Bras Ortop (Sao Paulo). 2022;57(5):795–801.
PubMed Google Scholar
Ganzhorn A, Schulte-Uentrop L, Küllmei J, Zöllner C, Moll-Khosrawi P. National consensus on entrustable professional activities for competency-based training in anaesthesiology. PLoS ONE. 2023;18(7):e0288197.
Article CAS PubMed PubMed Central Google Scholar
Schmidbauer ML, Pinilla S, Kunst S, Biesalski A-S, Bösel J, Niesen W-D, Schramm P, Wartenberg K, Dimitriadis K. The Isg: fit for service: preparing residents for Neurointensive Care with Entrustable Professional activities: a Delphi Study. Neurocrit Care. 2023.
ten Cate O. Competency-based education, Entrustable Professional activities, and the power of Language. J Graduate Med Educ. 2013;5(1):6–7.
Article Google Scholar
Epstein J, Osborne RH, Elsworth GR, Beaton DE, Guillemin F. Cross-cultural adaptation of the Health Education Impact Questionnaire: experimental study showed expert committee, not back-translation, added value. J Clin Epidemiol. 2015;68(4):360–9.
Article PubMed Google Scholar
Batalden M, Batalden P, Margolis P, Seid M, Armstrong G, Opipari-Arrigan L, Hartung H. Coproduction of healthcare service. BMJ Qual Saf. 2016;25(7):509–17.
Article PubMed Google Scholar
Howarth E, Vainre M, Humphrey A, Lombardo C, Hanafiah AN, Anderson JK, Jones PB. Delphi study to identify key features of community-based child and adolescent mental health services in the East of England. BMJ Open. 2019;9(6):e022936.
Article PubMed PubMed Central Google Scholar
de Villiers MR, de Villiers PJT, Kent AP. The Delphi technique in health sciences education research. Med Teach. 2005;27(7):639–43.
Article PubMed Google Scholar
Edgar L, McLean S, Hogan S, Hamstra S, Holmboe ES. The milestones guidebook. Accreditation Council for Graduate Medical Education. 2020.
Warm EJ, Edgar L, Kelleher M. A guidebook for implementing and changing assessment in the milestones era. Accreditation Council for Graduate Medical Education; 2020.
Rowe G, Wright G. The Delphi technique as a forecasting tool: issues and analysis. Int J Forecasting.1999, 15(4):353–75.
Jones J, Hunter D. Consensus methods for medical and health services research. Brit Med J.1995, 311(7001):376-80.
Almeland SK, Lindford A, Berg JO, Hansson E. A core undergraduate curriculum in plastic surgery - a Delphi consensus study in Scandinavia. J Plast Surg Hand Surg. 2018;52(2):97–105.
Article PubMed Google Scholar
Craig C, Posner GD. Developing a Canadian curriculum for Simulation-Based Education in Obstetrics and Gynaecology: a Delphi Study. J Obstet Gynaecol Can. 2017;39(9):757–63.
Article PubMed Google Scholar
Francis NK, Walker T, Carter F, Hubner M, Balfour A, Jakobsen DH, Burch J, Wasylak T, Demartines N, Lobo DN, et al. Consensus on training and implementation of enhanced recovery after surgery: a Delphi Study. World J Surg. 2018;42(7):1919–28.
Article PubMed Google Scholar
Veronesi G, Dorn P, Dunning J, Cardillo G, Schmid RA, Collins J, Baste JM, Limmer S, Shahin GMM, Egberts JH, et al. Outcomes from the Delphi process of the Thoracic Robotic Curriculum Development Committee. Eur J Cardio-thoracic Surg.. 2018;53(6):1173–9.
Article Google Scholar
Hasselager AB, Lauritsen T, Kristensen T, Bohnstedt C, Sonderskov C, Ostergaard D, Tolsgaard MG. What should be included in the assessment of laypersons’ paediatric basic life support skills? Results from a Delphi consensus study. Scand J Trauma Resusc Emerg Med. 2018;26(1):9.
Article PubMed PubMed Central Google Scholar
Knight S, Aggarwal R, Agostini A, Loundou A, Berdah S, Crochet P. Development of an objective assessment tool for total laparoscopic hysterectomy: a Delphi method among experts and evaluation on a virtual reality simulator. PLoS ONE. 2018;13(1):e0190580.
Article PubMed PubMed Central Google Scholar
Boote J, Barber R, Cooper C. Principles and indicators of successful consumer involvement in NHS research: results of a Delphi study and subgroup analysis. Health Policy. 2006;75(3):280–97.
Article PubMed Google Scholar
Zill JM, Scholl I, Härter M, Dirmaier J. Which dimensions of patient-centeredness matter? - results of a web-based Expert Delphi Survey. PLoS ONE. 2015;10(11):e0141978.
Article PubMed PubMed Central Google Scholar
Chen C-Y, Tang K-P, Lin F-S, Cherng Y-G, Tai Y-T, Chou FC-C. Implementation of competency-based Medical Education with a local context in a Taiwan Emergency Medicine Residency Training Program. Formosal Med J. 2018;22(1):62–70.
CAS Google Scholar
Jørgensen U. Grounded theory: methodology and theory construction. Int Encyclopedia Social Behav Sci. 2001;1:6396–9.
Google Scholar
De Villiers MR, De Villiers PJ. Kent AP: The Delphi technique in health sciences education research. Med Teacher. 2005, 27(7):639–643.
Linstone HA, Turoff M. The delphi method. Addison-Wesley Reading, MA; 1975.
Moore CM. Group techniques for idea building. Sage Publications, Inc; 1987.
Chang L. A psychometric evaluation of 4-point and 6-point likert-type scales in relation to reliability and validity. Appl Psychol Meas. 1994;18(3):205–15.
Article Google Scholar
Leung S-O. A comparison of psychometric properties and normality in 4-, 5-, 6-, and 11-point likert scales. J Social Service Res. 2011;37(4):412–21.
Article ADS Google Scholar
Wong C-S, Peng KZ, Shi J, Mao Y. Differences between odd number and even number response formats: evidence from mainland Chinese respondents. Asia Pac J Manage. 2011;28:379–99.
Article Google Scholar
Lee JW, Jones PS, Mineyama Y, Zhang XE. Cultural differences in responses to a likert scale. Res Nurs Health. 2002;25(4):295–306.
Article PubMed Google Scholar
De Vet E, Brug J, De Nooijer J, Dijkstra A. De Vries NK. determinants of forward stage transitions: a Delphi study. Health Educ Res. 2005, 20(2):195–205.
Sorensen J, Norredam M, Dogra N, Essink-Bot ML, Suurmond J, Krasnik A. Enhancing cultural competence in medical education. Int J Med Educ. 2017;8:28–30.
Article PubMed PubMed Central Google Scholar
Mary Catherine Beach EGP, Tiffany L, Gary, Karen A, Robinson A, Gozu A, Palacio C, Smarth MW, Jenckes C, Feuerstein EB, Bass NR, Powe, Lisa A. Cooper: Cultural Competency: a systematic review of Health Care Provider Educational interventions. Med Care. 2005;43(4):356–73.
Article PubMed PubMed Central Google Scholar
Chou FC, Hsiao C-T, Yang C-W, Frank JR. Glocalization in medical education: a framework underlying implementing CBME in a local context. J Formos Med Assoc. 2022;121(8):1523–31.
Article PubMed Google Scholar
Timmerberg JF, Dole R, Silberman N, Goffar SL, Mathur D, Miller A, Murray L, Pelletier D, Simpson MS, Stolfi A, et al. Physical Therapist Student Readiness for Entrance into the first full-time clinical experience: a Delphi Study. Phys Ther. 2018;99(2):131–46.
Article Google Scholar
Liu C-H, Hsu L-L, Hsiao C-T, Hsieh S-I, Chang C-W, Huang ES, Chang Y-J. Core neurological examination items for neurology clerks: a modified Delphi study with a grass-roots approach. PLoS ONE. 2018;13(5):e0197463.
Article PubMed PubMed Central Google Scholar
Younas A, Khan RA, Yasmin R. Entrustment in physician-patient communication: a modified Delphi study using the EPA approach. BMC Med Educ. 2021;21(1):497.
Article PubMed PubMed Central Google Scholar
Epstein RM, Street RL Jr. The values and value of patient-centered care. Ann Fam Med. 2011;9(2):100–3.
Article PubMed PubMed Central Google Scholar
Seaburg LA, Wang AT, West CP, Reed DA, Halvorsen AJ, Engstler G, Oxentenko AS, Beckman TJ. Associations between resident physicians’ publications and clinical performance during residency training. BMC Med Educ. 2016;16:22.
Article PubMed PubMed Central Google Scholar
Dorotta I, Staszak J, Takla A, Tetzlaff JE. Teaching and evaluating professionalism for anesthesiology residents. J Clin Anesth. 2006;18(2):148–60.
Article PubMed Google Scholar
Andrade HLD. Ying: Student perspectives on Rubric-Referenced Assessment. Educational & Counseling Psychology Faculty Scholarship 2 2005.
Mertler C. Designing scoring rubrics for yourclassroom. Practical Assess Res &Evaluation 2001, 7(25).
Yao A, Massenburg BB, Silver L, Taub PJ. Initial comparison of Resident and attending milestones evaluations in plastic surgery. J Surg Educ. 2017;74(5):773–9.
Article PubMed Google Scholar
Goldflam K, Bod J, Della-Giustina D, Tsyrulnik A. Emergency Medicine residents consistently rate themselves higher than attending assessments on ACGME milestones. West J Emerg Med. 2015;16(6):931–5.
Article PubMed PubMed Central Google Scholar
Kruger J, Dunning D. Unskilled and unaware of it: how difficulties in recognizing one’s own incompetence lead to inflated self-assessments. J Personal Soc Psychol. 1999;77(6):1121–34.
Article CAS Google Scholar

Download references

Acknowledgements

We want to thank Director Chih-Chen Chou, Associate Professor Yih-Giun Cherng, Professor Ya-Jung Cheng, Associate Professor Yu-Ting Tai, and Assistant Professor Kung-Pei Tang for their advice; Chen-Yang Chao for administrating the research; and all anesthesiology experts for their participation. This manuscript was edited by Wallace Academic Editing.

Funding

This work was supported by the Ministry of Science and Technology, Taiwan [grant number MOST105-2511-S038-003-MY2].

Author information

Enoch Yi-No Kang and Kuan-Yu Chi equally contribute to the study.

Authors and Affiliations

Department of Anesthesiology, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
Enoch Yi-No Kang, Chih-Chung Liu & Ta-Liang Chen
Institute of Health Policy & Management, College of Public Health, National Taiwan University, Taipei, Taiwan
Enoch Yi-No Kang
Department of Education and Humanities in Medicine, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
Enoch Yi-No Kang, Faith Liao & Chien-Yu Chen
Department of Medicine, Jacobi Medical Center, Bronx, NY, USA
Kuan-Yu Chi
Albert Einstein College of Medicine, Bronx, NY, USA
Kuan-Yu Chi
Department of Education, Taipei Medical University Hospital, Taipei, Taiwan
Faith Liao & Chien-Yu Chen
Department of Anesthesiology, Taipei Medical University Hospital, Taipei, Taiwan
Chih-Chung Liu, Ta-Liang Chen & Chien-Yu Chen
Department of Anesthesiology, National Taiwan University Hospital, National Taiwan University College of Medicine, Taipei, Taiwan
Chih-Peng Lin & Chien-Yu Chen
Department of Anesthesiology, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
Ta-Liang Chen
Department of Anesthesia, Stanford University Medical School, Palo Alto, CA, USA
Pedro Tanaka

Authors

Enoch Yi-No Kang
View author publications
You can also search for this author in PubMed Google Scholar
Kuan-Yu Chi
View author publications
You can also search for this author in PubMed Google Scholar
Faith Liao
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Chung Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Peng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Ta-Liang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Chien-Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: Yi-No Kang and Chien-Yu ChenData curation: Chih-Chung Liu and Ta-Liang ChenExpert committee translation: Chih-Chung Liu, Chih-Peng Lin, and Chien-Yu ChenQuestionnaire design: Chih-Chung Liu, Yi-No Kang, and Chien-Yu ChenFormal analysis: Yi-No Kang and Chien-Yu ChenInvestigation: Kuan-Yu Chi, Faith Liao, Chih-Chung Liu, and Chien-Yu ChenMethodology: Yi-No KangSupervision: Ta-Liang ChenVisualization: Yi-No Kang.Writing– original draft: Kuan-Yu Chi and Faith LiaoWriting– review & editing: Kuan-Yu Chi, Yi-No Kang, Chien-Yu Chen, Pedro Tanaka.

Corresponding author

Correspondence to Chien-Yu Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

All authors confirm that all methods were carried out in accordance with Declaration of Helsinki guidelines and regulations. This study is approved by the Taipei Medical University (TMU)-Joint Institutional Review Board (IRB) (#201604060). Informed written consent was obtained by all participants.

Consent for publication

Not applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Kang, E.YN., Chi, KY., Liao, F. et al. Indigenizing and co-producing the ACGME anesthesiology milestone in Taiwan: a Delphi study and subgroup analysis. BMC Med Educ 24, 154 (2024). https://doi.org/10.1186/s12909-024-05081-2

Download citation

Received: 14 May 2022
Accepted: 22 January 2024
Published: 19 February 2024
DOI: https://doi.org/10.1186/s12909-024-05081-2

Indigenizing and co-producing the ACGME anesthesiology milestone in Taiwan: a Delphi study and subgroup analysis

Abstract

Background

Methods

Results

Conclusions

Highlight

Introduction

Methods

Expert committee translation and questionnaire design

Delphi panels

Data collection and analysis

Subgroup analysis

Results

Discussion

Indigenizing the ACGME anesthesiology milestones in Taiwan is feasible

Experience status affects the consensus and weight

Residents mirrored milestone competencies through participation

Limitations and future directions

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval and consent to participate

Consent for publication

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Education

Contact us