- Research article
- Open Access
- Open Peer Review
How empathic is your healthcare practitioner? A systematic review and meta-analysis of patient surveys
BMC Medical Educationvolume 17, Article number: 136 (2017)
A growing body of evidence suggests that healthcare practitioners who enhance how they express empathy can improve patient health, and reduce medico-legal risk. However we do not know how consistently healthcare practitioners express adequate empathy. In this study, we addressed this gap by investigating patient rankings of practitioner empathy.
We conducted a systematic review and meta-analysis of studies that asked patients to rate their practitioners’ empathy using the Consultation and Relational Empathy (CARE) measure. CARE is emerging as the most common and best-validated patient rating of practitioner empathy. We searched: MEDLINE, Embase, PsycINFO, Cinahl, Science & Social Science Citation Indexes, the Cochrane Library and PubMed from database inception to March 2016. We excluded studies that did not use the CARE measure. Two reviewers independently screened titles and extracted data on average CARE scores, demographic data for patients and practitioners, and type of healthcare practitioners.
Sixty-four independent studies within 51 publications had sufficient data to pool. The average CARE score was 40.48 (95% CI, 39.24 to 41.72). This rank s in the bottom 5th percentile in comparison with scores collected by CARE developers. Longer consultations (n = 13) scored 15% higher (42.60, 95% CI 40.66 to 44.54) than shorter (n = 9) consultations (34.93, 95% CI 32.63 to 37.24). Studies with mostly (>50%) female practitioners (n = 6) showed 16% higher empathy scores (42.77, 95% CI 38.98 to 46.56) than those with mostly (>50%) male (n = 6) practitioners (34.84, 95% CI 30.98 to 38.71). There were statistically significant (P = 0.032) differences between types of providers (allied health professionals, medical students, physicians, and traditional Chinese doctors). Allied Health Professionals (n = 6) scored the highest (45.29, 95% CI 41.38 to 49.20), and physicians (n = 39) scored the lowest (39.68, 95% CI 38.29 to 41.08). Patients in Australia, the USA, and the UK reported highest empathy ratings (>43 average CARE), with lowest scores (<35 average CARE scores) in Hong Kong.
Patient rankings of practitioner empathy are highly variable, with female practitioners expressing empathy to patients more effectively than male practitioners. The high variability of patient rating of practitioner empathy is likely to be associated with variable patient health outcomes. Limitations included frequent failure to report response rates introducing a risk of response bias. Future work is warranted to investigate ways to reduce the variability in practitioner empathy.
A growing number of randomized trials show that when healthcare practitioners are encouraged to enhance how they express empathy, this can reduce patient pain, [1, 2] lower patient anxiety,  increase patient satisfaction, [4, 5] improve medication adherence, [6, 7] and ameliorate other patient health outcomes. [8,9,10,11]. For example, Chassany’s  empathy training intervention for general practitioners (GPs) (n = 180) reduced pain in osteoarthritis patients (n = 842) by one point on a 10-point VAS (P < 0.0001). These modest benefits are comparable to many pharmaceutical interventions without the adverse events. Hence some authors have recently called for efforts to encourage empathic care .
Supporting the view that empathic care should be encouraged, the extent to which healthcare practitioners express empathy seems to be lacking in some cases, [13,14,15,16] and it may decline with time in practice . The increased burden of paperwork, which takes up a quarter of practitioner time,  may be a barrier to empathic care. However we do not know the prevalence of inadequate empathy. If adequate empathy is rare, then patients and practitioners would both likely benefit if practitioners reinforced how they display empathy. In this study, we aimed to address this gap by conducting a systematic review of patient ratings of practitioner empathy.
An obstacle to empathy research is that practitioner empathy is difficult to define theoretically [19, 20]. At the same time there is an emerging consensus that empathy can be operationalized as a healthcare practitioner’s ability to understand a patient’s point of view, express this understanding, and make a recommendation that reflects the shared understanding [21, 22]. More importantly for present purposes, while empathy is measured using different scales, [23, 24] only one patient-rating of practitioner empathy demonstrated evidence of reliability,  internal validity and consistency: CARE [25, 26]. From a patient health perspective, patient ratings of practitioner empathy are likely to be important. We therefore limited our review to studies that used the CARE measure.
Our primary objective was to measure the extent to which patients (of any type) report their healthcare practitioners (of any type) to be empathic. Our secondary objective was to compare differences in empathy ratings between different practitioner groups (male versus female, consultation times, different types of practitioners, and practitioners in different countries).
Protocol and registration
The protocol for this review was published in PROSPERO (record no. CRD42016037456). We made two changes to the protocol. In the protocol we proposed to analyze CARE scores before and after training, however there were insufficient studies to complete this analysis. We also had insufficient data to perform the proposed analyses comparing practitioners with 10 years or more experience with those who had less than 10 years experience. Neither of these changes was related to our main study aim.
We included any study where patients rated their practitioners’ empathy using the CARE measure. We included ratings of any practitioner including nurses, doctors, alternative practitioners, and medical students. We included studies in any language, provided that the translation of the CARE questionnaire was validated.
We excluded studies that used other measures of empathy, because only CARE has been validated. An added benefit of this approach is that it reduced heterogeneity. We excluded studies where practitioners were reported to have been trained in empathy prior to being rated by patients, since we were interested in pre-training empathy ratings. Where the publications included surveys of more than one group of practitioners the surveys were treated independently.
CARE asks patients to answer 10 questions about the consultation with their practitioner such as whether the practitioner: made the patient feel at ease, really listened and understood, showed compassion, and explained things clearly (see Additional file 1). Each question can be answered by ticking one of five options: poor, fair, good, very good, excellent, does not apply, with the lowest being given a score of ‘1’, and the highest a score of ‘5’. Hence, the maximum CARE score is 50. The developers of the CARE measure have produced normative values based on administration of their questionnaire . They found that the mean CARE score was 45.75, and that 5% of CARE scores fell above 48.32, and 5% fell below 40.72.
Information sources and search
We searched the following databases: MEDLINE (OvidSP) [1946–09/03/2016], Embase (OvidSP) [1974 to 2016 March 08], PsycINFO (OvidSP) [1967–09/03/2016], Cinahl (EBSCOHost), Science & Social Science Indexes (Web of Science, Thomson Reuters) [1945–09/03/2016], Cochrane Central Register of Controlled Trials [Issue 2 of 12, February 2016], Cochrance Database of Systematic Reviews [Issue 3 of 12, March 2016] and Database of Abstracts of Reviews of Effects [issue 2 of 4, April 2015] (via Cochrane Library, Wiley) and Pubmed (see Additional file 2 for search strategy). We also searched the Web of Science Core Collection, Scopus and Google Scholar for studies that have cited the CARE measure,  and any record that includes the full name of the measure (consultation and relational empathy). Additionally, we contacted authors of studies to ask whether they are aware of any additional studies.
Data collection, extraction, and management
After piloting the extraction sheet by two authors (JH, KM), two authors (LS, AU) independently screened all titles and abstracts and extracted data. Discrepancies were resolved with discussion by a third author (JH). We extracted data about: type of practitioner, percentage female practitioners, country, average CARE score, and individual CARE scores (where available).
We assessed risk of bias within studies by measuring response rates. It was not feasible to assess risk of bias across studies, for example by conducting a funnel plot since there was no reason to suspect higher (or lower) CARE scores varying with sample size. There was insufficient data to investigate risk of bias across studies.
Statistical analyses were performed using the program Comprehensive Meta Analysis . We provided the mean and 95% confidence interval of the CARE score. We contacted study authors via email to obtain missing data with respect to participants, outcomes, or summary data. Participant data were analysed as reported. We conducted preplanned subgroup analyses to assess the extent to which proportion of female practitioners, consultation duration, type of practitioner, and country played a role. To evaluate the predictive value of gender and consultation time with respect to CARE scores we performed a multivariable regression analysis, with gender and consultation time included as the independent variables, and CARE scores included as the dependent variable.
Sensitivity and subgroup analyses
We conducted four preplanned subgroup analyses.
Longer (>10 min) consultations compared with shorter (≤ 10 min) consultations. This was based on average consultation times in UK general practice .
Gender: average empathy ratings of mostly (>50%) female compared with average ratings of mostly (>50%) male practitioners.
When there were at least three studies within the same country, we conducted a subgroup analysis with those three countries, and compared it with the complement. We chose three studies because fewer than three makes meta-analysis problematic and increases the likelihood of basing conclusions on anomalous results.
Types of practitioners (physicians, medical students, alternative practitioners, etc.). If there were at least three studies that measured patient ratings of specific types of practitioners, we conducted a subgroup analysis of this group, and compared it with the complement.
Our search yielded 392 independent records, of which 69 studies met our inclusion criteria (see Supplemental Material). Of these, 64 independent study groups (within 51 publications) had sufficient data to be included in our meta-analysis (see Table 1, Fig. 1, Additional file 3). See Additional file 4 for excluded studies.
The 64 study groups were from 15 different countries: UK (n = 23), USA (n = 6), Hong Kong (n = 9), Germany (n = 7), Australia (n = 4), China (n = 6), Ethiopia (n = 2), South Korea (n = 2), and one study from each of Brazil, Croatia, France, India, and Japan. The types of practitioners included primary care physicians, practitioners of Traditional Chinese Medicine (TCM), medical students, allied health professionals, and other specialists.
The average CARE score for the 64 study groups was 40.48 (95% CI, 39.24 to 41.72) (see Table 2, Fig. 2). Twenty-two studies reported consultation times. Longer consultations (≥10 min; n = 13) scored higher (42.60, 95% CI 40.69 to 44.52) than shorter (<10 min; n = 9) consultations (34.93, 95% CI 32.66 to 37.21). This difference of 7.67 points (15%) between longer and shorter consultations was highly significant (P < 0.001). Twelve studies provided data on the gender of practitioners (Table 2). Studies with predominantly female practitioners (n = 6) showed higher empathy scores (42.77, 95% CI 38.98 to 46.56) than those with predominantly male practitioners (n = 6, 34.85, 95% CI 30.98 to 38.71). This difference of 7.92 points (16%) was statistically significant (P = 0.004).
Fifty-five study groups could be included in the pre-planned subgroup analysis by country (Table 2). Highest empathy scores were found in Australia (n = 4, 44.88, 95% CI 42.63 to 47.14), USA (n = 6, 44.56, 95% CI 42.71 to 46.40) and UK (n = 23, 43.07, 95% CI 42.11 to 44.04). Scores were lowest in Hong Kong (n = 9, 33.46, 95% CI 31.94 to 34.99). Scores in Germany (n = 7, 40.72, 95% CI 39.02 to 42.44) and China (n = 6, 40.61, 95% CI 38.68 to 42.55) were in-between. We added an exploratory analysis by country including all 64 study groups and found that scores in India (n = 1, 29.49, 95% CI 24.18 to 34.80) were lower than those in Hong Kong. Scores in the UK, USA and Australia were highest (See Additional file 5).
We found at least three studies each measured empathy in the following types of providers: physicians, medical students, allied health professionals, and practitioners of Traditional Chinese Medicine (Table 2). There was statistically significant heterogeneity between these (P = 0.032), with allied health professionals scoring the highest (n = 5, 45.29, 95% CI 41.38 to 49.20), and physicians scoring the lowest (n = 39, 39.68, 95% CI 38.29 to 41.08). We found no differences between primary care physicians, specialists, and complementary and alternative medicine (CAM) providers, (P = 0.386) (see Table 3).
A multivariable regression analysis was performed to analyze the predictive value of gender and consultation time with respect to CARE scores. Consultation duration was the only significant predictor for CARE scores (Table 4).
Risk of bias
The response rate was reported in 20 of the 53 studies (38%), with the average rate being high (69%, ranging from 21% to 100%). The uncertainty about the remaining response rates entails a risk of response bias.
We found that patient rating of practitioner empathy is highly variable, with some practitioners being reported to express empathy much less effectively to patients than others. Female practitioners, allied health professionals, those who spend more time with patients, and practitioners from Australia, the US, and the UK seem to display empathy more effectively than other practitioners. In addition, the average care score we identified was low in comparison with normative values, falling in the lowest 5% of CARE scores measured by the developers of the questionnaire . The highly variable scores we found are likely to be associated with variable patient outcomes [9,10,11, 30].
Strengths and limitations
This is the first systematic review to investigate the extent to which healthcare practitioners are empathic. Another strength is that it used measures of the only validated patient-rated measure of practitioner empathy. As such, it provides a good indication of the differences between perceived empathy across gender, disciplines, and countries.
There are also several potential limitations. First, our method for measuring the difference between female and male practitioners was likely to be an underestimate. If studies with majority female practitioners resulted in greater patient-rated empathy, it is reasonable to assume that if all the practitioners were female, the difference between male and female practitioners would have been greater. In the context of this observational research we do not know whether the additional time caused female practitioners to be more empathic, or whether female practitioners’ higher empathy caused them to spend more time with patients, or whether these two factors cannot be separated. Second, response bias [26, 31, 32] could have affected the results. Patients who know they are rating their practitioners may wish to please their practitioners,  for example by giving them higher scores than they otherwise would [31, 32]. The lack of response rate reporting in most of the studies makes the extent of this problem unclear. Furthermore, selection bias might have influenced the results: the CARE questionnaire could be delivered in areas where the empathy of the practitioners is believed to be anomalous (either particularly high or particularly low). Next, the comparison between countries could have been influenced by the number of studies per country. Specifically, some of the countries with low scores had very few studies (Croatia had 1, Ethiopia had 2, and India had 1). Moreover in spite of validation of CARE translations, patients in different countries may have divergent prior expectations and beliefs about what it means to be an empathic practitioner. Finally, the comparison with normative values (resulting in the average score we found being in the lowest 5%) is problematic. In spite of being relatively low, the average score is still above 40. Further work needs to be done to investigate the meaning of average CARE scores.
Implications for clinical practice and clinical research
The way different healthcare practitioners express empathy to patients is low (on average) in comparison with normative scores, and highly variable. Given the likely association between practitioner empathy and patient outcomes, further research is now warranted to investigate how these findings can be used to improve patient care. Future reports of the CARE questionnaire should include all the potentially relevant factors we have identified here, especially details about response rates, and also consultation duration, gender, experience of practitioners, and other demographic details of patient raters and practitioners.
Chassany O, Boureau F, Liard F, Bertin P, Serrie A, Ferran P, Keddad K, Jolivet-Landreau I, Marchand S. Effects of training on general practitioners' management of pain in osteoarthritis: a randomized multicenter study. J Rheumatol. 2006;33(9):1827–34.
Vangronsveld KL, Linton SJ. The effect of validating and invalidating communication on satisfaction, pain and affect in nurses suffering from low back pain during a semi-structured interview. Eur J Pain. 2012;16(2):239–46.
Fujimori M, Shirai Y, Asai M, Kubota K, Katsumata N, Uchitomi Y. Effect of communication skills training program for oncologists based on patient preferences for communication when receiving bad news: a randomized controlled trial. J Clin Oncol. 2014;32(20):2166–72.
Soltner C, Giquello JA, Monrigal-Martin C, Beydon L. Continuous care and empathic anaesthesiologist attitude in the preoperative period: impact on patient anxiety and satisfaction. Brit J Anaesth. 2011;106(5):680–6.
Little P, White P, Kelly J, Everitt H, Mercer S. Randomised controlled trial of a brief intervention targeting predominantly non-verbal communication in general practice consultations. Brit J Gen Pract. 2015;65(635):e351–6.
Kim SS, Kaplowitz S, Johnston MV. The effects of physician empathy on patient satisfaction and compliance. Eval Health Prof. 2004;27(3):237–51.
Attar HS, Chandramani S. Impact of physician empathy on migraine disability and migraineur compliance. Annals of Indian Academy of Neurology. 2012;15(Suppl 1):S89–94.
Di Blasi Z, Harkness E, Ernst E, Georgiou A, Kleijnen J. Influence of context effects on health outcomes: a systematic review. Lancet. 2001;357(9258):757–62.
Derksen F, Bensing J, Lagro-Janssen A. Effectiveness of empathy in general practice: a systematic review. Br J Gen Pract. 2013;63(606):e76–84.
Kelm Z, Womer J, Walter JK, Feudtner C. Interventions to cultivate physician empathy: a systematic review. BMC Med Educ. 2014;14:219.
Mistiaen P, van Osch M, van Vliet L, Howick J, Bishop FL, Di Blasi Z, Bensing J, van Dulmen S. The effect of patient-practitioner communication on pain: a systematic review. Eur J Pain. 2015;20(5):675–88.
Howick J, Rees S. Overthrowing barriers to empathy in healthcare: empathy in the age of the internet. J R Soc Med. 2017:141076817714443. http://journals.sagepub.com/doi/full/10.1177/0141076817714443.
Parker SM, Clayton JM, Hancock K, Walder S, Butow PN, Carrick S, Currow D, Ghersi D, Glare P, Hagerty R, et al. A systematic review of prognostic/end-of-life communication with adults in the advanced stages of a life-limiting illness: patient/caregiver preferences for the content, style, and timing of information. J Pain Symptom Manag. 2007;34(1):81–93.
Abraham A. Care and compassion: report of the health service ombudsman on ten investigations into NHS care of older people, fourth report if the health service commissioner for England; session 2010–2011. In. Edited by service PaH. London: The Stationary Office; 2011.
Francis R. Report of the mid Staffordshire foundation NHS trust public inquiry volumes 1–3, HC-898-I-III. In. London: The Stationary Office; 2013.
Davies HT, Mannion R. Will prescriptions for cultural change improve the NHS? BMJ. 2013;346:f1305.
Neumann M, Edelhauser F, Tauschel D, Fischer MR, Wirtz M, Woopen C, Haramati A, Scheffer C. Empathy decline and its reasons: a systematic review of studies with medical students and residents. Acad Med. 2011;86(8):996–1009.
Magasine GP. Quarter of GPs spend half their time on paperwork. In: GP Online: Haymarket Media Group Limited; 2012.
Aring CD. Sympathy and empathy. J Am Med Assoc. 1958;167(4):448–52.
Halpern J. From detached concern to empathy: humanizing medical practice. Oxford: Oxford University Press; 2011.
Larson EB, Yao X. Clinical empathy as emotional labor in the patient-physician relationship. JAMA. 2005;293(9):1100–6.
Decety J, Fotopoulou A. Why empathy has a beneficial impact on others in medicine: unifying theories. Front Behav Neurosci. 2014;8:457.
Hemmerdinger JM, Stoddart SDR, Lilford RJ. A systematic review of tests of empathy in medicine. BMC Medical Education. 2007;7.
Hojat M, Gonnella JS, Nasca TJ, Mangione S, Veloksi JJ, Magee M. The Jefferson scale of physician empathy: further psychometric data and differences by gender and specialty at item level. Acad Med. 2002;77(10 Suppl):S58–60.
Mercer SW, Maxwell M, Heaney D, Watt GC. The consultation and relational empathy (CARE) measure: development and preliminary validation and reliability of an empathy-based consultation process measure. Fam Pract. 2004;21(6):699–705.
Mercer SW, McConnachie A, Maxwell M, Heaney D, Watt GC. Relevance and practical use of the consultation and relational empathy (CARE) measure in general practice. Fam Pract. 2005;22(3):328–34.
CARE Measure. http://www.measuringimpact.org/s4-care-measure. Accessed 31 July 2017.
Borenstein M, Hedges LV, Higgins JPT. Comprehensive meta-analysis. In., 2.2.064 edn. Biostat: Englewood, NJ; 2014.
NHS general practitioner (GP) services. http://www.nhs.uk/NHSEngland/AboutNHSservices/doctors/Pages/gp-appointments.aspx. Accessed 31 July 2017.
Kelley JM, Kraft-Todd G, Schapira L, Kossowsky J, Riess H. The influence of the patient-clinician relationship on healthcare outcomes: A systematic review and meta-analysis of randomized controlled trials. PLoS One. 2014;9(4):e94207.
Worsley A, Baghurst KI, Leitch DR. Social desirability response bias and dietary inventory responses. Hum Nutr Appl Nutr. 1984;38(1):29–35.
Hróbjartsson A, Kaptchuk T, Miller FG. Placebo effect studies are susceptible to response bias and to other types of biases. J Clin Epidemiol. 2011;64(11):1223–9.
Allan LG, Siegel S. A signal detection theory analysis of the placebo effect. Eval Health Prof. 2002;25(4):410–20.
Bridget Johnson, Stewart Mercer, Vincent Chung, and Michelle Dossett shared their data with us when it was missing from published reports. Sir Muir Gray came up with the title for the paper. Claire Madigan provided some useful suggestions to improve the manuscript.
JH was supported by the Nuffield Department of Primary Care Health Sciences. KM received support from the Theophrastus Foundation and the Schweizer-Arau Foundation, Germany.
Availability of data and materials
All data available in manuscript, supplemental material, or by contacting authors.
Ethics approval and consent to participate
Not relevant (systematic review).
Consent for publication
Not relevant (systematic review).
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The CARE Measure Questionnaire © Stewart W Mercer 2004. Actual questionnaire used within studies to measure patient perception of practitioner empathy (permission obtained). (DOCX 108 kb)
Search Strategy. Search terms used to identify studies for electronic searches. (DOCX 12 kb)
Studies that used the CARE measure (starred (*) studies not included in meta-analysis). References to studies not included in meta-analysis because they did not meet the inclusion criteria. (DOCX 141 kb)
Reasons for excluding studies identified in the search that were excluded from meta-analysis (n = 23). Summary of justification for not including studies in meta-analysis. (DOCX 48 kb)
CARE scores by country (all 64 studies included). Additional subgroup analysis by country. (DOCX 16 kb)