Development and initial validation of a dual-purpose questionnaire capturing mentors’ and mentees’ perceptions and expectations of the mentoring process

Background In health profession education, learners are often coached by mentors for development of competencies, self-direction of learning and professionalism. It is important that the mentee-mentor relationship is aligned in terms of mutual expectations. Methods A dual-purpose questionnaire capturing both the mentor and mentee perceptions on the actual and preferred mentoring functions was designed and validated, by performing a principal component analysis (PCA) using the data of mentees (n = 103) and mentors (n = 23) of a medical course. As a proof of concept, alignment of needs and changes in the mentoring perceptions in mentee groups of different years were determined. Results PCA showed that specific sets of questions addressed important elements in the mentoring process, such as self-direction of learning and reflection (Scale 1), guidance of behavioural change (Scale 4), addressing personal issues and professional identity development (Scale 3 and 5) and how the mentor and mentee presents oneself in the mentoring relationship (Scale 2). Mentors and mentees perceived comparable situations as critical for an effective mentoring process, such as mentor presence and guidance of reflection, although there was also evidence of gaps, such as perception of cultural issues. By comparison of the mentee groups in the different years of the program, the dynamic or evolving nature of the mentor process became evident, mentees experienced more emphasis by the mentor on reflection (Scale 1), at a constant level of mentor presence (Scale 2). Conclusion Given the individualized, context-specific, and dynamic nature of mentoring, programmes would benefit from a regular evaluation of mentoring practices, e.g. by using questionnaires, in order to facilitate organizational revisions and further development of the mentoring competencies.


Background
Mentoring is perceived as an important element of health profession education. Although varying definitions are used, and mentoring is often situational and context-specific, recurrent components of a mentoring relationship in health profession education are support and guidance for the mentees' professional and personal development, by a faculty member or a peer, through the sharing of experiences, knowledge, skills, and wisdom [1,2]. There is general consensus that mentoring programmes have beneficial effects on job satisfaction, career success, personal development, reflection and self-direction of learning. These benefits have been described for the mentee and for the mentor [1][2][3][4][5][6][7]. However, negative experiences have also been described, such as a low quality of the relation or dominance of the mentor, and have been called the 'dark side' of mentoring [8], which affects both the mentor and mentee. The underlying causes for negative mentoring experiences are likely to be multidimensional and can lie in the domain of the mentee, in terms of lack of trust, or bad intent in the relationship, or can be at the side of the mentor, in terms of dysfunctional behaviour, due to lack or insufficient training, or the inability to adapt to changing needs of the mentee [9,10]. The latter is important as increasingly mentor relationships are long-term, spanning the length of the program, using tools such as a portfolio in the context of some form of assessment [11][12][13], especially in health profession education such as nursing and medical education. In addition, it was shown that satisfaction in the mentoring process was the best predictor for a successful relationship in the long term [14]. These studies show that it is extremely important that expectations of mentors and mentees on the mentoring process are aligned and especially that the mentor can adapt to the needs of the mentee during different phases of the education. There is limited information how to evaluate and subsequently guide and support alignment and mutual expectations in the mentoring process.
In terms of evaluation of the mentoring process, several mentorship measurement scales have been described. In a recent literature review by Chen et al. [15], 22 scales for measuring mentors' function, behaviour and relationships were found. It was shown that scales from the field of business and organisation are founded in a common theoretical framework with a 3-function structure: career, psychosocial, and role modelling function. This was resonated in scales such as the Mentorship Relationship Challenges Scale and the Mentoring Functions Questionnaire -9 [16,17]. In these questionnaires, career support is represented as a personal interest of the mentor in career and coordination of professional goals. Psychosocial support is represented by a safe environment in which the mentee can share personal problems and exchange confidences. In addition, the mentor is an example for behaviour and is respected for his/her ability to motivate and teach others. In the health profession educational field however, no universal recognised framework for mentoring was yet found, implying that in education, mentoring is conceptualized context-dependently [18], and used in varying situations, also suggesting that more measurement tools will need to be developed [15]. Recently Schäfer et al. [19] developed a questionnaire for evaluating mentees' satisfaction with the mentoring relationship in the medical education field. Although addressing an important determinant of a successful mentoring relationship in the long term; the satisfaction in the relationship, this questionnaire addressed the mentees' needs and expectations, while it is equally important to get insight in the mentors' perspective and needs, to align mutual expectations and guide mentoring support.
In their review about available mentorship measurement tools, Chen et al. [15] also noted that there was a trend in scale development from focusing on the positive aspects of mentoring to (negative) mentoring experiences and challenges [16,20]. Stories of critical incidents by mentors also revealed that challenges such as a mentee resisting mentoring or not attending meetings led to a dynamic nature of the mentoring process and fluctuations in reasoning and behaviour of the mentor [21]. In a previous study, we interviewed mentors in a medical education program and also found that student-centred critical incidents, such as students with poor reflective, self-directed learning styles or resenting the use of the portfolio, were perceived as a challenge, as well as an opportunity for professional development [12]. Here, a critical incident is the narrative of a challenge, a situation or behaviour that is perceived as potential risk for an effective mentor meeting [22]. Mentoring success is critically dependent on an effective relationship that is characterized by a safe setting and a nurturing and trusting personal connection [18], which could be hampered by critical incidents. For an effective relationship, it is vital that there is a congruency between the expectations and actual experiences of critical incidents for both mentors and mentees. In addition, mentoring is a dynamic and developmental process in which the relationship and interaction between mentor and mentee change over time [23]. Therefore, there is a need for an instrument that can evaluate the dual perspective and level of alignment. The aim of the current study was the development and initial validation of a dual-purpose questionnaire capturing both the mentor and mentee perceptions on the actual and preferred mentoring functions and critical incidents. As a proof of concept, the questionnaire was distributed amongst mentees and mentors of a medical course to perform a needs analysis and to determine whether use of the questionnaire would demonstrate important gaps or misalignments in the mentoring process. This study was done in the context of a longitudinal mentoring programme, in which a portfolio was used for self-regulation of learning (by the mentee), and guidance (by the mentor). The portfolio was assessed at the end of each year. The research questions were: (i) What are the underlying dimensions of the questionnaire? (ii) What are critical incidents in the mentoring process according to mentees and mentors and what is the level of alignment? And (iii) What are the perceptions of mentees on actual and preferred mentoring function in different years of the program?

Setting and participants
This study was performed at Maastricht University, the Netherlands. The participants were mentees and mentors in the graduate-entry program Physician-Clinical Investigator, which is a 4 Year medical course. The set-up of the medical program and mentorship program is described in Heeneman and de Grave [12]. Briefly, the distinguishing features of this medical course are the use of programmatic assessment, and a longitudinal mentoring relationship during the entire course. The assessment of the course as a whole is designed according to the principles of programmatic assessment [24,25]. All assessment and feedback information is collected and aggregated in an electronic portfolio. Students use all assessment and feedback information that the program generates, to self-regulate their learning, in a regular process of reflection captured in the portfolio, which is used in the mentor meetings throughout the years. Students are appointed the same mentor for 4 years. Mentors meet with their students 5 times in Year 1 (39 weeks of pre-clinical education according to the principles of problem-based learning), 3 times in Year 2 (27 weeks, patient-based preclinical education), 4 times in Year 3 (64 weeks, clinical placements/ clerkships) and 2 times in Year 4 (30 weeks, senior elective clinical placement combining clinical care and translational research). The topics of the mentor meetings are not strictly defined (i.e. no set agenda for each meeting) and the content might vary depending on the questions or topics of both the mentee and the mentor. Topics that are regularly discussed are the performance of the student (based on the assessment and feedback information in the portfolio), the subsequent analysis and reflection and follow-up of learning goals, choices that the mentee can make in the curriculum (e.g. content and topic of electives, vocational interest, career choices), personal development, and personal problems. At the end of the Year, the portfolio is assessed by an independent portfolio assessment committee, leading to a high-stake (pass or fail) promotion decision to the next year or graduation. The mentors are faculty members with a PhD degree, working as a clinician-investigator in the university hospital or as an investigator/ scientist in a research group of the university. Every mentor has 4-10 students, with no more than 5 students in the same year group. There is a yearly training in coaching and interviewing techniques at the beginning of the year, that all mentors attend. During the year, there are 5 mentor meetings, in which a topic (e.g. how to address certain assessment results) is discussed and there is opportunity to exchange experiences and sharing of stories.

Design of the questionnaire
The questionnaire was developed using the results of an earlier qualitative interview study with the mentors [12] and mentoring dimensions found in the literature [1,3,5,11,23,[26][27][28][29][30][31][32][33]. Mentoring dimensions included were performance indicators and mentees analysis thereof, choices to be made in curriculum (e.g. electives, research opportunities), vocational interests/ career planning, and personal development and wellness. This resonated with the critical situations as perceived by the mentors concerning the mentees, and the critical incidents that the mentors perceived in their own behaviour and function, and the combination led to the construction of a 38 item questionnaire, in an edition to be used for mentors and an edition to be used for students (see Table 1). The items were verbalised as critical situations in the mentor version and in terms of mentoring functions or behaviours in the mentee version, e.g. 'The mentee has difficulties translating feedback and assessment information for certain competencies (such as the health advocate) into learning objectives' (Mentor version) or 'The mentor helps me to formulate learning objectives in competency domains that are rather thorny (health advocate, for example)' (Mentee version). For each critical situation the respondent indicated in a Likert scale 1-5 whether [1] The situation was experienced in a mentor meeting (1 = never, 5 = at multiple occasions/ repeatedly), as an indication if the situation or critical incident was (actually) experienced (by mentor and mentee); [2] The situation prevents the mentor meeting from being effective (mentor version), or in the mentee version: This situation is important for an effective mentor meeting (1 = disagree, 5 = totally agree) as an indication of what mentor behaviour or function was disruptive or wanted; [3] It is mentors' responsibility/ job to take action when this situation occurs (1 = disagree, not mentors' responsibility, 3 = sometimes, depends on the situation, 5 = agree, this should always be discussed, the mentor should act), as an indication of what mentor behaviour or function was perceived as important by mentor or mentee.
The questionnaire ended with one open question in which the respondents could indicate whether situations that they perceived as important or critical were not present in the items of the questionnaire.
As a pilot, the questionnaire was completed by n = 3 experienced mentors and n = 6 mentees and their feedback was used in the final version of the questionnaire.

Data collection
The questionnaires were distributed to students at the end of Year 1 (n = 38, response rate 76%), end of Year 2 (n = 31, response rate 62%) and end of Year 3 (n = 33, response rate 66%), and to the mentors that completed at least 1 year of mentoring (n = 20, response rate 67%). Response rates were considered acceptable. The questionnaires were distributed and immediately collected in plenary sessions/ lectures; the non-responders were students or mentors that did not attend these sessions.

Statistical analysis
To evaluate whether potential constructs or scales could be identified in the questionnaire, principal component analysis (PCA) was used on the collective data of the mentees (n = 103) [34]. Cut-off for item loading was set at 0.400. After PCA, average scale scores were generated for the sets of questions that showed component loading on the different scale themes. Cronbach's Alpha for these average scale themes were calculated to assess whether reliability of the sub-scales was sufficient. Descriptive statistics (mean ± standard error of the mean (sem)) were used to determine the highest scoring situations from the mentees' (n = 103) and mentors' (n = 20) perspective. Item 23 and 27 were reversed before calculation given the negative factor loadings and negative corrected item-total correlation values in the item-total statistics in the reliability statistics.
To get an impression of the gap between the actual experienced situation (question 1) and what was deemed important for an efficient meeting (question 2), the individual value of question 1 was distracted from the value of question 2. A positive difference was used as an indication of the gap, then what was wanted or deemed important, did not align with what was experienced.
One-way ANOVA with post-hoc Bonferroni testing was used to determine which situations of actual and preferred mentoring functions were distinctive in the mentee groups of different years.
Analysis of open-ended question The remarks were read and aligned according to similar themes and the themes that were mentioned most frequently were reported.

Identified scales in the questionnaire
A PCA was done on the 38 items of the questionnaire, using the responses of the mentees (n = 103) for the question 1: The situation was experienced in a mentor meeting. The rationale was that this question expressed the need and expectations of the mentee. The KMO measure verified the sampling adequacy for the analysis, KMO = .693 (above the minimum of .5). The Bartlett's test of sphericity confirmed that the original correlation matrix was an identity matrix (Chi-square = 1513, df = 703, p < .001) [34]. An initial analysis was done to obtain eigenvalues for each component in the data. Eleven components had eigenvalues over 1 and in combination explained 69.6% of the variance. The scree plot was ambiguous and showed inflexions that would justify retaining less components (not shown). A PCA was performed with 4, 5 and 6 factors, using a Varimax rotation, the results compared and assessed for best fit. Given a certain subjectivity and considering the sample size, a rotated solution with 5 factors was considered to produce an acceptable fit and factorial suitability, both intuitively and conceptually. The five factors explained 48% of variance. Table 1 shows the factor loadings after rotation. Two items did not show sufficient factor loading (< 0.400) to any of the scales (item 20,22).
Regarding the representation of the scales, the items that clustered on factor 1 were on the stimulation of reflection and challenging the mentee suggesting that Scale 1 represented 'Mentor stimulation of reflection' (n = 9 items). In Scale 2, mentor behavior was represented, such as sharing of personal experiences, and being attentive to a good relationship, therefore Scale 2 was indicated as 'Mentor presence' (n = 7 items). In Scale 3, the items focused on personal issues of the mentee, such as motivation, proactive behavior and planning, which was summarized as 'Mentor addressing personal issues' (n = 9 items). In Scale 4, situations revolved around behavioral change through formulation and follow-up of learning goals, Scale 4 was summarized as 'Mentor stimulating conditions and perspectives for behavioral change' (n = 7 items). Scale 5 was relatively small (4 items) and addressed the use of qualities and the identity as a medical doctor, and was designated as 'Mentor addressing professional identity development'.
The overall Cronbach's α of the 38 item questionnaire was .90. The reliability of Scale 1-4 was sufficient, with Cronbach's α between .77 and .81 (Scale 1 α = .81, Scale 2 α = .77, Scale 3 α = .80, Scale 4 α = .79). In Scale 5, two items affected reliability (item 27 and 37). After deletion, reliability of this scale, Cronbach's α .64, was lower as compared to the other scales. Although the correlation between the 2 remaining items (19 and 26) was sufficient (.51), this scale, containing 2 items, has to be viewed with some caution. The responses on the open question on situations that were missed confirmed content validity, as analysis of the answers yielded few omissions. Mentioned were: a lack of items on the portfolio itself from the mentees (as the portfolio is the aggregation instrument, we choose to focus on the content), on a context-specific procedure in the mentoring process of this program (from mentees and mentors), and on the discussion of professionalism issues such as fraud and plagiarism (from mentors).

Critical incidents in the mentoring process according to mentees and mentors and level of alignment
In Table 2a, an overview of high scoring situations or incidents is shown (question 1). The mentee clearly experienced from their mentors a personal approach (Scale 2, Mentor presence), a coaching behaviour on the reflective activities (Scale 1, Mentor stimulation of reflection) and addressing personal issues (Scale 3, Mentor addressing personal issues). This was also seen in the mentor perception and then formulated as critical incident in either reflection, personal issues (Scale 1 and 3) or the mentee (not) sharing their thoughts (Scale 2, Mentee presence, in the perspective of the mentor). However, mentors also put emphasis on the support of behavioural change (Scale 4, Mentor stimulating conditions and perspectives for behavioural change), this was presented more often in the Top 10 of situations from mentors.
The support of behavioural change was resonated in the mentees' Top 10 of what was expected from the mentor (Table 2b). In the high scoring tasks or expectations of the mentor, Scale 4 was presented more often, as was Scale 5 (Mentor addressing professional identity development). The mentors also indicated that stimulating behavioural change was an important task (Scale 4), next to stimulating reflection (Scale 1) and addressing personal issues (Scale 3). The situations of Scale 2 (Mentee presence, in the perspective of the mentor) were not part of the Top 10 of the mentor, suggesting that mentors put less emphasis on taking responsibility for these situations.
Question 2 of the questionnaire was used to explore which situations were deemed important (question 2), but not experienced (question 1). A large difference between question 2 and question 1 was considered as a gap or a need of the mentee, or a potential obstacle for an effective meeting in the perspective of the mentor (Table 3). From the mentees' perspective, there was a gap regarding the stimulation of behavioural change (Scale 4) and attention for personal issues (Scale 3) was also presented more often in the mentee Top 10. From the mentors' perspective, critical situations concerning personal issues (Scale 3) were more often presented. In addition, reflective activities (Scale 1) were mentioned as a potential obstacle for an effective meeting. Item 37 (Mentee list: The mentor is sensitive to my (cultural) background (deleted from Scale 5)) was present in the mentees' and mentors' Top 10, indicating that this was a potential situation or critical incident to consider for a needs analysis.
The dynamic nature of the mentoring process in mentee groups in the different years of the program The mentoring process is expected to change over time when the mentees further develop and progress in their training. In Table 4a, it is shown that the mentees in different year groups experienced that more emphasis was given by the mentor on stimulation of reflection (Scale 1), the addressing of personal issues (Scale 3) and the stimulation of professional identity development as a medical doctor (Scale 5). Mentor presence (Scale 2) seemed constant, except that the mentor was more available for contact during the years. Importantly, the mentees in different year groups evaluated that these mentoring processes occurred with less guiding of the mentor (Situation 22), suggesting that the mentee was able to better self-direct learning over the years.
What the mentees of different year groups evaluated resonated partly with what was expected from the mentor during the program (Table 4b). It was clear that addressing professional identity development was deemed an important task of the mentor during the years (Scale 5). The noted tempering of coaching was expected (item 22), suggesting that the mentees needed less coaching. In addition, and this was less evident in what was evaluated (Table 4a), mentees in different year groups expected their mentor to be more personal (Scale 2, Mentor presence), such as sharing their thoughts and addressing their own experiences.
Comparing the evaluations of mentees in different year groups of what was expected of the mentor (Table 4b) with the apparent need or gap (Table 3) suggested that the discussion of confidential information was perceived as a gap (Scale 4, situation 5, Table 3) by the mentees, but the mentees perceived it less as the task of the mentor during the course of the program (Table 4b). This was considered an inconsistency and was a signal that this needed to be addressed in either the training of students or faculty development of mentors.

Discussion
In this study, a dual-purpose questionnaire was designed and validated that enabled the exploration of both the A. The situation was experienced in a mentor meeting (1 = never, 5 = at multiple occasions/ repeatedly), as an indication if the situation or critical incident was (actually) experienced. In brackets are the mean ± sem).
1 Situation 1 -The mentor exhibits professional behaviour (4.37 ± 0.07) Scale 2: Mentor presence Situation 10 -The mentee has difficulty translating a specific key or enabling competency (such as health advocate) into learning objectives (4.40 ± 0.13) (Scale 4) 2 Situation 23 -The mentor especially considers the mentor meeting ('not') as an assessment moment, rather than a moment of support (4.30 ± 0.09) > Reversed calculation given the negative verbalization  B. It is mentors' responsibility/ job to take action when this situation occurs (1 = disagree, not the mentors' responsibility, 3 = sometimes, depends on the situation, 5 = agree, this should always be discussed, the mentor should act). In brackets are the mean ± sem.     mentor and mentee perceptions of the mentoring process and its critical incidents. PCA showed that specific sets of questions addressed important elements in the mentoring process, such as self-direction of learning and reflection (Scale 1), guidance of behavioural change (Scale 4), addressing personal issues and professional identity development (Scale 3 and 5) and how the mentor and mentee presents oneself in the mentoring relationship (Scale 2) (Research question 1). Mentors and mentees perceived comparable situations as critical or important for an effective mentoring process, although in the mentors' perspective the items of Scale 4 (Mentor stimulating conditions and perspectives for behavioural change) had relatively higher scores, showing that there were nuances in what mentees and mentors find important or critical. In addition, the comparison of what was experienced (questionnaire question 1) and what was expected (questionnaire question 2) showed gaps in the mentoring practices in the context of this program, such as (lack of ) attention for cultural issues (mentee perspective), resonating with the mentor perception that cultural issues were considered challenging (Research question 2). By comparison of the mentee groups in the different years of the program, the dynamic or evolving nature of the mentor process became evident, mentees experienced more emphasis by the mentor on reflection (Scale 1), at a constant level of mentor presence (Scale 2) (Research question 3).
In the current study, as a proof of concept, the questionnaire was completed by the mentees and mentors of a medical program, to assess its suitability for a needs analysis and to determine whether use of the questionnaire would demonstrate gaps or misalignments in the mentoring program. As widely acknowledged, mentoring is an individualized, context-specific and organizational-dependent process [15,18] and there has been calls for evidence-based methods to study mentoring approaches [18]. However, there is also a need for the possibility to tailor methods given the variances in mentoring practice. In terms of evidence-based, the questionnaire was designed on the basis of qualitative data [12] and theory and research about mentoring. In addition, analysis showed that the scales of the questionnaire addressed important, well-established elements in mentoring processes, such as self-regulation of learning, coaching of behavioural change and personal development [1,2,5,11,15,27,29]. In particular, the scale of mentor presence resonates with the themes of teacher authenticity, for which learners emphasized expertise, passion, unicity and distance [35] and the theme of presence, for which teachers emphasize presence as self-awareness, as connection to students, and as connection to subject matter [36]. This is indeed an equilibrium that mentors need to maintain. Also, the theme of professional identity formation is essential in medical education, which was shown to be relational and social in nature, with mentors and experiential learning as important educational strategies [37]. The alignment of the scales with theory and research would make the questionnaire suitable to use in other contexts given that the goal of mentoring is self-direction of learning and there is a longitudinal element, a longer-lasting mentoring relationship. In our context, the open-ended question at the end of the questionnaire showed that mentees and mentors missed items on a context-specific procedure, i.e. a second mentor being present at 1 of the meetings, giving feedback on progress and use of the portfolio [12]. In other contexts, elements not addressed in the questionnaire could be monitored by either additional questions in a follow-up questionnaire or tailored methods such as short interviews to complement the need analysis of the questionnaire.
The questionnaire was specifically designed with the dual purpose to capture the perceptions and expectations of both the mentee and the mentor. The data and information can be used at multiple levels. Firstly, by the program or organisation to evaluate the mentoring system, e.g. does the mentoring system achieve the desired goal? Analysis of the results could lead to either tailored faculty development training, e.g. use of the critical incidents (role-played in video-vignettes) in the training of mentors, emphasis of certain information in guidelines, or a change in the procedures, in order to (further) improve the quality of the mentoring system. There are limited studies on the efficacy and usefulness of a formal evaluation process of the mentoring system [18,38]. Sng et al. [18] emphasized that there is a need to evaluate the development of mentoring processes and the impact on the mentor-mentee relationship. Standardized, validated measurement tools or questionnaires could be helpful for this. The second level is at the individual mentor, release of the results of the individual mentors' group of mentees (anonymized) can inform the mentor if there is a gap or discrepancy in 1 of the scales or if there is a need to work on or get extra training in certain mentoring competencies [30]. Fleming et al. [30] designed a measurement tool to assess mentor competencies in the context of research mentors, which were categorised as maintaining effective communication, aligning expectations, assessing understanding, addressing diversity, fostering independence, and promoting professional development. These competencies are likely to be more generic and also applicable in the educational field. With the exception of addressing diversity, these could be recognized in Scale 2, 3, 4 and 5 of the questionnaire. A third level is the data that can be evaluated in the different years of the program, applicable both at the level of the organisation and the individual mentor. As shown, the mentees valued some aspects more in the different years of the program, or needed less of certain mentoring functions. The role of the mentor clearly changed, students became more self-directed. Mentors need to be aware that this needs another level of guidance or emphasis on other mentor competencies. Mentor training or faculty development for mentors is a valuable tool. It is well known that mentor training leads to more confident mentors and improved mentoring skills also in the perception of the mentees [39,40]. For the program in the current study, analysis of the data of the questionnaire led to a dialogue in the mentor training about the topic of confidentiality of documents that are used in the portfolio, and how to address cultural diversity. In addition, video-vignettes of a mentor-meeting were made, in which the high score critical situations were discussed or the topic of the mentor-mentee dialogue. These vignettes are currently used in the training sessions.
The findings of this study should be interpreted given certain limitations. Firstly, this is an initial validation of the questionnaire. The sample size for a component analysis should have a ratio of 10 participants per item and that number (380) was not reached. Additional studies with larger number of respondents are needed to confirm these first results. Secondly, the components were only identified for the mentees; mentors could have a different representations of the dimensions evaluated in the questionnaire. Thirdly, this study was performed in one institute, and the questionnaire captured a number of context-specific elements, such as the progress test. In addition, the portfolio was assesses at the end of each year, which might have had an effect on the mentoring relationship. Although the mentors are not assessors in our setting and care is taken to strictly divide the mentoring and assessing role, the high-stake context of the portfolio assessment at the end of the year and the use of programmatic assessment are factors that may have affected the results of the questionnaire in this study. Therefore, the results may not be transferable to other settings and if the questionnaire is used in other settings, it should be adapted to the specific context. Nevertheless, the use of a longitudinal, portfolio-based mentoring system is becoming more common in medical education, so the results might be of interest to other programs. Fourthly, although the response rate was considered acceptable (> 60% [41]), the potential difference in non-responders versus responders should be considered. The non-responders were mentees or mentors that were not present at the plenary session where the questionnaire was distributed and collected. In this medical course, these sessions are usually attended by 60-70% of the population. As the questionnaire was anonymous, we could not follow-up on the non-responders. Although it is unlikely that these represent a certain sub-population, we cannot fully eliminate that this may have affected the results. Finally, further development is needed for the use of the questionnaire in an international, non-western context, to address cultural diversity.

Conclusions
In this study a dual purpose questionnaire was designed and validated to capture both the mentor and mentee perceptions of the mentoring process and its critical incidents. The questionnaire contained 5 scales that could be associated to important processes in the mentoring practice and mentor competencies (Research question 1). Analysis of the data of the questionnaire disclosed a number of gaps or misalignments in the mentors' and mentees' experiences and expectations in a medical program (Research question 2). Use of the questionnaire during the course of the program showed that the expectations of the mentees on the mentoring process changed (Research question 3), again disclosing the need to critically examine the mentoring system and attune the mentor training program. Given the individualized, context-specific, dynamic and organizational-dependent nature of mentoring practices, programs would benefit from a regular evaluation of mentoring practices, in order to facilitate organizational revisions of the mentoring system and further development of the mentoring competencies of individual mentors.
Abbreviations PCA: Principal Component Analysis; sem: Standard error of the mean