Advertisement
Review paper|Articles in Press

Effectiveness of nontechnical skills educational interventions in the context of emergencies: A systematic review and meta-analysis

Open AccessPublished:February 27, 2023DOI:https://doi.org/10.1016/j.aucc.2023.01.007

      Abstract

      Introduction

      In recent years, the importance of training healthcare professionals in nontechnical skills using effective methodologies has been increasingly recognised as a means of preventing clinical errors in the practice of health care. The aim of this study was to evaluate the effectiveness of educational interventions on nontechnical skills in the emergency medical services and/or critical care unit settings.

      Methods

      A systematic search was carried out in the PubMed, SCOPUS, CINAHL, and Web of Science databases according to predetermined inclusion and exclusion criteria. After the initial search, 7952 records were selected after duplicates removed. Finally, a selection of 38 studies was included for quantitative analysis. Separate meta-analyses of standardised mean changes were carried out for each outcome measure assuming a random-effects model. Cochran's Q-statistic and I2 index were applied to verify study heterogeneity. Weighted analyses of variance and meta-regressions were conducted to test the influence of potential moderators and funnel plots using Duval and Tweedie's trim-and-fill method, and Egger's regression test were used to examine publication bias.

      Results

      All the variables analysed had a significant effect size, with the exception of situational awareness (d+ = −0.448; 95% confidence interval [CI] = −1.034, 0.139). The highest mean effect size was found for knowledge (d+ = −0.925; 95% CI = −1.177, −0.673), followed by the mean effect sizes for global nontechnical skills (d+ = −0.642; 95% CI = −0.849, −0.434), team nontechnical skills (d+ = −0.606; 95% CI = −0.949, −0.262), and leadership nontechnical skills (d+ = −0.571; 95% CI = −0.877, −0.264). Similar mean effect sizes were found for attitude (d+ = −0.406; 95% CI = −0.769, −0.044), self-efficacy (d+ = −0.469; 95% CI = −0.874, −0.064), and communication nontechnical skills (d+ = −0.458; 95% CI = −0.818, −0.099). Large heterogeneity among the standardised mean changes was found in the meta-analyses (I2 > 75% and p < .001), except for self-efficacy where I2 = 58.17%, and there was a nonstatistical result for Cochran's Q. This great variability is also reflected in the forest plots.

      Discussion

      The use of simulation interventions to train emergency and critical care healthcare professionals in nontechnical skills significantly improves levels of knowledge, attitude, self-efficacy, and nontechnical skills performance.

      Keywords

      1. Introduction

      Clinical errors arising from the practice of health care significantly affect patient safety, increasing mortality, morbidity, and the financial costs of healthcare systems.
      • de Montalvo-Jääskeläinen F.
      • Altisent-Trota R.
      • Bellver-Capella V.
      • Cadena-Serrano F.
      • de los Reyes-López M.
      • Gándara-del -Castillo A.
      • et al.
      Report of the Spanish Bioethics Committee on the ethical aspects of patient safety and, specifically, on the implementation of an effective system for reporting safety incidents and adverse events.
      One of the first reports published by the U.S. Institute of Medicine (1999) estimated that 1 million deaths per year were attributable to preventable clinical errors.
      • Kohn L.T.
      To err is human: building a safer health system.
      More recently, a report published by the Organization for Economic Co-operation and Development
      Organization for Economic Cooperation and Development
      The economics of patient safety from analysis to action.
      noted that more than one in 10 patients are harmed by clinical errors during their care, causing more than 3 million deaths per year at an economic cost of between $1 and 2 trillion annually.
      Organization for Economic Cooperation and Development
      The economics of patient safety from analysis to action.
      A study conducted by the Joint Commission
      The Joint Commission
      Sentinel event data: root causes by event type 2004-2015.
      from 2004 to 2015 identified that the three leading causes of these clinical errors were mainly human factors in general, followed by lack of communication and leadership. Both are embraced in the broad universe of human factors and linked to the professionals' ability to perform nontechnical skills. These nontechnical skills are defined as “cognitive, social, and personal resource skills that complement technical skills and contribute to safe and efficient task performance”
      • Flin Rhona
      • O'Connor Paul
      • Crichton Margaret
      Safety at the sharp end: a guide to non-technical skills.
      and can be classified into seven categories:
      • Leonard M.
      • Graham S.
      • Bonacum D.
      The human factor: the critical importance of effective teamwork and communication in providing safe care.
      skills related to situational awareness, decision-making, communication, teamwork, leadership, stress management, and coping with fatigue. Therefore, preventing clinical errors and delivering high-quality care outcomes requires not only high levels of technical knowledge, clinical competence, and good adaptability but also the acquisition and development of nontechnical skills.
      • Gordon M.
      • Darbyshire D.
      • Baker P.
      Non-technical skills training to enhance patient safety: a systematic review.
      ,
      • Myers J.A.
      • Powell D.M.C.
      • Psirides A.
      • Hathaway K.
      • Aldington S.
      • Haney M.F.
      Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.
      Training healthcare teams in these skills would therefore appear to be essential.
      Various training methodologies have been developed for teaching and learning nontechnical skills.
      • Jafri F.
      • Mirante D.
      • Ellsworth K.
      • Shulman J.
      • Dadario N.
      • Williams K.
      • et al.
      A microdebriefing crisis resource management program for simulated pediatric resuscitation in a community hospital: a feasibility study.
      These include the so-called Crisis Resource Management (CRM), developed in the aviation field in the late 1970s
      • Cooper G.E.
      • White M.D.
      • Lauber J.K.
      Resource management on the flight deck.
      and extrapolated in the 1990s to services handling crisis situations, such as emergency medical services and critical care units,
      • Gaba D.M.
      • Fish K.J.
      • Howard S.K.
      Crisis management in anesthesiology.
      among others.
      The CRM methodology effectively describes and leverages communication and other components of teamwork to improve performance and the safety of care and, ultimately, the quality of care.
      • Pizzi L.
      • Goldfarb N.I.
      • Nash D.B.
      Crew resource management and its applications in medicine.
      Through clinical simulation and the subsequent reflective process,
      • Nader M.
      • Tasdelen-Teker G.
      • DeStephens A.J.
      • Lampotang S.
      • Prelipcean I.
      • Smith R.D.
      • et al.
      Simulation use in outreach setting: a novel approach to building sustainability.
      ,
      • Langdalen H.
      • Abrahamsen E.B.
      • Sollid S.J.M.
      • Sørskår L.I.K.
      • Abrahamsen H.B.
      A comparative study on the frequency of simulation-based training and assessment of non-technical skills in the Norwegian ground ambulance services and helicopter emergency medical services.
      the CRM methodology offers a stepwise way for participants to overcome individual, team, and contextual pressures in a controlled environment.
      • Alinier G.
      Developing high-fidelity health care simulation scenarios: a guide for educators and professionals.
      The CRM training methodology has been widely used internationally and in the healthcare field for decades.
      • Gaba D.M.
      Crisis resource management and teamwork training in anaesthesia.
      To the best of our knowledge, two meta-analyses have specifically evaluated the effectiveness of CRM training.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      ,
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      The analysis by O'Connor et al.
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      included 74 studies from different contexts where CRM had been applied, such as the commercial and military aviation industry, the nuclear industry, offshore oil production, commercial shipping, and health care. The authors concluded that the results were insufficient to demonstrate its effectiveness due to a lack of data and recommended a more rigorous evaluation of the methodology and greater access to data and resources. Meanwhile, O'Dea et al.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      focused on the evaluation of CRM interventions and the training of critical care professionals in particular, finding them to be effective in the short term. Recent systematic reviews have been published which collect and report in depth many important aspects of CRM methodology
      • Buljac-Samardžić M.
      • Dekker-van Doorn C.M.
      • Maynard M.T.
      What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
      ,
      • Gross Benedict
      • Leonie Rusin
      • Jan Kiesewetter
      • Zottmann Jan M.
      • Fischer Martin R.
      • Stephan Prückner
      • et al.
      Crew resource management training in healthcare: a systematic review of intervention design, training conditions and evaluation.
      (or interventions with other methodologies that address nontechnical skills
      • Low X.M.
      • Horrigan D.
      • Brewster D.J.
      The effects of team-training in intensive care medicine: a narrative review.
      ,
      • Buljac-Samardzic M.
      • Doekhie Kirti D.
      • van Wijngaarden J.D.H.
      Interventions to improve team effectiveness within health care: a systematic review of the past decade.
      ). Such reviews discuss in depth the design and structure of the interventions, content, coaches, evaluation methods, and the effectiveness of the interventions, among other issues. However, since 2014, no further evidence has been found in the secondary literature on the effectiveness of educational interventions using the CRM methodology (or covering concepts related to those addressed in the CRM of nontechnical skills) at the meta-analytic level. Although the CRM methodology is widely used, there are also interventions in the literature from other theoretical frameworks, such as TeamSTEPPS (Team Strategies and Tools to Enhance Performance and Patient Safety) or SBAR (Situation-Background-Assessment-Recommendation), as well as high-fidelity simulation interventions designed exclusively and without following a specific theoretical framework with the aim to train nontechnical skills.
      • Ounounou E.
      • Aydin A.
      • Brunckhorst O.
      • Khan M.S.
      • Dasgupta P.
      • Ahmed K.
      Nontechnical skills in surgery: a systematic review of current training modalities.
      ,
      • Griffin C.
      • Aydın A.
      • Brunckhorst O.
      • Raison N.
      • Khan M.S.
      • Dasgupta P.
      • et al.
      Non-technical skills: a review of training and evaluation in urology.
      The present meta-analysis was therefore conducted to evaluate the effectiveness of educational interventions involving nontechnical skills (communication, leadership, teamwork, situational awareness, decision-making, fatigue, and/or stress management) and targeting healthcare professionals (nursing, medical, and/or emergency health technicians) in critical care units and emergency medical services, both in-hospital and out-of-hospital.

      2. Methods

      This systematic review and meta-analysis was conducted according to the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA).
      • Page M.J.
      • McKenzie J.E.
      • Bossuyt P.M.
      • Boutron I.
      • Hoffmann T.C.
      • Mulrow C.D.
      • et al.
      The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.
      The work performed was registered on Open Science Framework (OSF) on March 3rd, 2022 (Registration DOI: 10.17605/OSF.IO/5QPGH).

      2.1 Study selection

      Studies were selected according to the following inclusion criteria: (i) An educational intervention aimed at healthcare professionals had been carried out; (ii) the intervention was aimed at medical professionals, nurses, emergency health technicians, and/or paramedics; (iii) the purpose of the intervention was the acquisition of nontechnical skills; (iv) the setting for the intervention was the emergency medical services (in-hospital/out-of-hospital) and/or critical care unit settings; (v) the articles presented the results in terms of a pre–post or postintervention comparison between groups (randomised controlled trial “[RCT]”, controlled clinical trial “[CCT]”, control trial, quasi-experimental).
      In addition, the exclusion criteria were as follows: (i) Studies that were not written in English or Spanish, (ii) the intervention was aimed at undergraduate students, and (iii) those that did not provide sufficient statistical data to calculate the effect size.

      2.2 Search strategies

      The search for studies was conducted in January 2021, using a variety of strategies in order to collect as many articles as possible. The search strategies were developed in conjunction with experts in bibliographic searches in the field of health sciences. In the first instance, a search was carried out in the MEDLINE/PubMed, CINAHL, Web of Science, and Scopus electronic databases. As part of the search strategy, the PubMed thesaurus was consulted, using the following Medical Subject Headings (MeSH) terms: “Crew Resource Management”, “simulation training”, “high fidelity simulation training”, “emergency medicine”, “emergency responders”, “emergency nursing”, “emergency medical technicians”, “emergencies”, “critical care”, and “emergency medical services”. The natural language search terms included in the title and/or abstract fields were: “Crew Resource Management”, “Crisis Resource Management”, “CRM”, “non-technical skills”, “simulation training”, “high fidelity simulation training”, “critical care”, “emergency”, “critical care nursing”, and “emergency medical technicians”. The search strategy used for each database is outlined in supplementary file 1.
      A manual search was also performed in journals aimed at the training of healthcare professionals with a specific focus on simulation methodology: “Emergencies”, “Simulation in Healthcare”, “International Emergency Nursing”, “Medical Education”, “Medical Teacher”, “Advances in Simulation”, “BMC Medicine”, and “Nurse Education Today”. No time restrictions were applied in any of these cases, with the aim of being as exhaustive as possible, compiling all existing interventions that meet the established criteria and analysing their effectiveness. Given that simulation methodology has a track record going back several decades and its effectiveness is unlikely to have been impacted by technological advances, no time restrictions were applied to the search. Lastly, the bibliographic references of the articles included were reviewed to find other studies that could be relevant to the meta-analysis but that had not been found using the aforementioned strategies. The process of evaluating the eligibility of the studies was carried out independently by two investigators (M.S.M. and S.E.). Discrepancies were resolved by a third investigator (M.J.C.M.) in order to reach consensus.

      2.3 Data extraction

      A coding protocol was developed to extract the study characteristics, classifying the extracted variables into three categories: substantive, methodological, and extrinsic.
      • Lipsey M.W.
      Identifying interesting variables and analysis opportunities.
      The following substantive variables were coded: (i) level of knowledge; (ii) attitude score; (iii) self-efficacy score; (iv) overall nontechnical skills score; (v) score for teamwork-related skills; (vi) leadership skills score; (vii) communication skills score; (viii) score for situational awareness skills; (ix) medical specialty (accident and emergency, helicopter emergency medical service, or critical care unit); (x) type of methodology (passive, interactive, mixed, or unspecified); (xi) theoretical model (CRM, other, unspecified); (xii) type of simulation (high fidelity, low fidelity, none), and (xiii) scenario objectives (CRM [design to train elements of the CRM or specifically mentioned], nontechnical skills, others [design to train skills other than nontechnical skills], unspecified).
      The following methodological variables were extracted: (i) study design; (ii) intervention comparison group; (iii) assessment period; (iv) sample size of the pretest experimental group; (v) sample size of the pretest control group; (vi) sample size of the posttest experimental group; (vii) sample size of the posttest control group; (viii) sample size of the follow-up experimental group; (ix) sample size of the follow-up control group; (x) randomisation of the groups; (xi) dropout rate; and (xii) the domains of the quality scale Medical Education Research Study Quality Instrument (MERSQI)
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      ,
      • Cook D.A.
      • Reed D.A.
      Appraising the quality of medical education research methods: the medical education research study quality instrument and the newcastle-ottawa scale-education.
      (see Assessment of Risk Bias of Selected Studies section). Finally, the extrinsic variables coded were as follows: (i) authors; (ii) year; (iii) educational background of the lead author of the study; and (iv) publication status (published or unpublished and whether it has an ISBN or ISSN).
      The coding and extraction of these variables was carried out independently by two researchers (M.S.M. and S.E.). Disagreements between the two researchers who coded these characteristics were resolved with a third researcher (M.J.C.M.).

      2.4 Assessment of risk of bias of selected studies

      The quality of the studies was measured using the MERSQI,
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      which was designed to measure the quality of experimental, quasi-experimental, and observational studies, specifically in medical education research. The tool includes 10 items clustered in six domains (“study design”; “sampling”; “data type”; “validity”; “analysis”; “outcome”) and a final overall score. These criteria form six domains, each with a maximum score of 3 and a minimum of 0 or 1, that sum to produce a total score that ranges from 5 to 18. It has generally higher interrater reliability (0.68–0.89) and good relationship with the scale Newcastle–Ottawa Scale-Education (rho = 0.60) for studies of simulation-based trained.
      • Cook D.A.
      • Reed D.A.
      Appraising the quality of medical education research methods: the medical education research study quality instrument and the newcastle-ottawa scale-education.
      The evaluation was performed in pairs, with each researcher independently evaluating the studies, and then discrepancies were resolved by consensus. All participating researchers (M.S.M., S.E., M.R.A., M.J.C.M., and R.J.S.) were previously trained in extracting and evaluating the methodological quality of studies and in the use of the evaluation instrument used.

      2.5 Computation of effect sizes

      For maximum comprehensiveness, this meta-analysis aimed to include studies with and without a comparison group. The unit of analysis was therefore the group, not the study, and the effect size was the standardised mean rate of change. This index was calculated as the pretest-posttest mean and this difference was divided by the pretest standard deviation: d=c(m)(y¯prey¯post)/Spre, with c(m) being a correction factor for small sample sizes.
      • Borenstein M.
      • Hedges L.V.
      • Higgins J.P.T.
      Introduction to meta-analysis.
      Negative d values indicated an improvement in the group from the pretest to the posttest. For standardisation, absolute values of d of around 0.2, 0.5, and 0.8 can be interpreted as small, moderate, and large magnitudes, respectively.
      • Cohen J.
      Statistical power analysis for the behavioral sciences.
      In order to calculate the standardised mean changes when the necessary information (means and standard deviations) was not reported in the study, the authors of the corresponding studies were contacted to request the necessary missing data. In the absence of a reply, effect sizes were calculated using different procedures depending on the information available in the study. For example, conversion equations from significance tests (e.g., t-test and Wilcoxon test) and sample size were used.
      • Borenstein M.
      • Hedges L.V.
      • Higgins J.P.T.
      Introduction to meta-analysis.
      When results were reported by means of sample size, median, range, and/or interquartile range, the means and standard deviations were estimated by different approximation methods.
      • Wan X.
      • Wang W.
      • Liu J.
      • Tong T.
      Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range.
      Finally, effect sizes were calculated separately for each construct evaluated: knowledge, attitude, self-efficacy, global nontechnical skills, teamwork, communication, leadership, and situational awareness.

      2.6 Statistical analyses

      Separate meta-analyses were carried out for the effect sizes of each outcome measure in at least three studies, by assuming a random-effects model. This model involves weighting each standardised mean change by its inverse variance, defined as the sum of the within-study variance and between-study variance, the latter being estimated by restricted maximum likelihood.
      • Cooper H.M.
      • Hedges L.V.
      • Valentine J.C.
      The handbook of research synthesis and meta-analysis.
      For each outcome analysed, a forest plot was constructed showing the individual effect sizes, and the mean effect size calculated with a 95% confidence interval (CI) using the method proposed by Hartung.
      • Hartung J.
      An alternative method for meta-analysis.
      To assess heterogeneity among the individual standardised mean changes, Cochran's heterogeneity Q statistic and the I2 index were calculated. If heterogeneity was found and the number of studies for the outcome was at least 10, an analysis of potential moderator variables was performed.
      • Aguinis H.
      • Gottfredson R.K.
      • Wright T.A.
      Best-practice recommendations for estimating interaction effects using meta-analysis.
      For this purpose, mixed-effects models were assumed applying the F statistic described by Knapp-Hartung for testing the significance of the moderator variable.
      • Viechtbauer W.
      • López-López J.A.
      • Sánchez-Meca J.
      • Marín-Martínez F.
      A comparison of procedures to test for moderators in mixed-effects meta-regression models.
      ,
      • Knapp G.
      • Hartung J.
      Improved tests for a random effects meta-regression with a single covariate.
      The QW and QE statistics were calculated to assess the model misspecification of the weighted analyses of variance (ANOVAs) and meta-regressions, respectively, together with an estimate of the percentage of variance accounted for by the moderator variable, R2.
      Finally, publication bias was tested by constructing funnel plots using Duval and Tweedie's trim-and-fill method
      • Duval S.
      • Tweedie R.
      Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis.
      and Egger's regression test.
      • Rothstein H.R.
      • Borenstein M.
      • Sutton A.J.
      Publication bias in meta-analysis.
      A statistically significant result for Egger's test (p < .10) was evidence of publication bias. We used p < .10 instead of the usual p < .05 because of the lower statistical power of Egger's test with such a small number of studies.
      • Egger M.
      • Smith G.D.
      • Schneider M.
      • Minder C.
      Bias in meta-analysis detected by a simple, graphical test.
      All statistical analyses were carried out with the metafor package in R version 3.2.3.
      • Viechtbauer W.
      Conducting meta-analyses in R with the metafor package.

      3. Results

      3.1 Characteristics of the included studies

      The screening and coding of studies was performed manually. During the screening process, two reviewers independently excluded 7118 studies based on the title and abstract (Fig. 1). After this first screening, 834 full-text studies were reviewed, with 794 studies that did not meet the inclusion criteria being excluded. We contacted the authors of 11 studies that failed to provide sufficient statistical data to calculate the effect size; only one response was obtained (k = 10 studies were discarded for this reason). Finally, two independent coders extracted data from 38 studies (supplementary file 2) in accordance with the study coding manual.
      Fig. 1
      Fig. 1PRISMA 2020 flow diagram elaborated by Haddaway et al.
      • Haddaway N.R.
      • Page M.J.
      • Pritchard C.C.
      • McGuinness L.A.
      PRISMA2020: an R package and Shiny app for producing PRISMA 2020-compliant flow diagrams, with interactivity for optimised digital transparency and Open Synthesis.
      Table 1 contains the main characteristics of the 38 studies included in the review. The total number of participants was 3144 (min. = 7; max. = 684), with an average of 82.74 participants per sample (SD = 124.83). The mean ages of the participants ranged from 21 to 43.43 years (M = 33.59; SD = 6.73), the standard deviations of their ages ranged from 5.70 to 10.75 years (M = 8.69; SD = 2.13), and the percentage of females ranged from 11.5 to 94.73 (M = 70.79; SD = 21.85).
      Table 1Descriptive features of 38 studies.
      ArticleStudy designOutcome measureMeasure toolAssessmentYearCountrySample sizeGender (% female)AgeDuration,

      days (hours)
      Medical specialtyHealth professionalTheoretical model interventionType of methodologyTypeScenarios objectives
      Abelsson et al.
      • de Montalvo-Jääskeläinen F.
      • Altisent-Trota R.
      • Bellver-Capella V.
      • Cadena-Serrano F.
      • de los Reyes-López M.
      • Gándara-del -Castillo A.
      • et al.
      Report of the Spanish Bioethics Committee on the ethical aspects of patient safety and, specifically, on the implementation of an effective system for reporting safety incidents and adverse events.
      Quasi-experimental two groups

      CG = half of the sessions
      1. Global NTS

      2. Communication

      3. Situational awareness
      1. GRS (external observation, validated)

      2. GRS communication dimension (validated)

      3. GRS situational awareness dimension (validated)
      Pretest

      Posttest (8 weeks)
      2017SwedenN = 63

      EG = 27

      CG = 36
      Total = 26/63 41.27%

      EG: 12/27 (44.44%)

      CG: 14/36 (38.89%)
      EG: M = 38 (SD = 9, range = 26–63)

      CC: M = 42 (SD = 9.4, range = 26–63)
      4 daysA&E, Catastrophe or HEMSNursingUnspecifyInteractive: SimulationHigh-fidelity simulationOther
      Anderson et al.
      • Kohn L.T.
      To err is human: building a safer health system.
      Quasi-experimental a single group1. Global NTS1. Behavioural performance markers (external observation, validated)Pretest

      Posttest (immediate)
      2006EEUUN = 98/9 (89%)M = 32 (range = 26–44)1 dayIntensive critical careNursingECMO Sim training programMixed: Lectures and simulationHigh-fidelity simulationNTS
      Armstrong et al.
      Organization for Economic Cooperation and Development
      The economics of patient safety from analysis to action.
      Quasi-experimental a single group1. Global NTS

      2. Communication

      3. Leadership

      4. Situational awareness
      1. Modified T_NOTECHS (external observation, validated)

      2. T_NOTECHS communication dimension

      3. T_NOTECHS leadership dimension

      4. T_NOTECHS situational awareness dimension
      Pretest

      Posttest (immediate)
      2020New ZealandN = 151 dayA&E, Catastrophe or HEMSNursingCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Batchelder et al.
      The Joint Commission
      Sentinel event data: root causes by event type 2004-2015.
      Quasi-experimental a single group1. Communication

      2. Leadership

      3. Situational awareness
      1. Questionnaire (external observation, no validated)

      2. Questionnaire (external observation, no validated)

      3. Questionnaire (external observation, no validated)
      Pretest

      Posttest (immediate)
      2009UKN = 1212 daysA&E, Catastrophe or HEMSMedicine and paramedicsCRMMixed:Lectures and simulationHigh-fidelity simulationCRM
      Brewster et al.
      • Flin Rhona
      • O'Connor Paul
      • Crichton Margaret
      Safety at the sharp end: a guide to non-technical skills.
      Quasi-experimental a single group1. Knowledge

      2. Team
      1. Questionnaire (self-administered, no validated)

      2. Item (self-administered, no validated)
      Pretest

      Posttest (immediate)

      Follow-up (4 months after)
      2017AustralianN = 99M = 43 (SD = 9.01)1 day (4 h)Intensive critical careNursing and medicineCRMMixed:Lectures and simulationHigh-fidelity simulationCRM
      Burton et al.
      • Leonard M.
      • Graham S.
      • Bonacum D.
      The human factor: the critical importance of effective teamwork and communication in providing safe care.
      Quasi-experimental a single group1. Knowledge

      2. Attitude

      3. Global NTS
      1. Cuestionario (no validated)

      2. Safety Attitude Questionnaire (validated)

      3. MPTHS (external observation, validated)
      Pretest

      Posttest (immediate)
      2011EEUUN = 1918/19 (94.73%)6 days (24 h)Intensive critical careNursing and medicineUnspecifyInteractive: SimulationHigh-fidelity simulationNTS
      Chamberland et al.
      • Gordon M.
      • Darbyshire D.
      • Baker P.
      Non-technical skills training to enhance patient safety: a systematic review.
      Experiment.

      CG another intervention without NTS
      1. Communication1. Item (external observation, validated)Pretest

      Posttest (immediate)

      Posttest (3 months after)
      2018CanadaN = 60

      EG = 30

      CG = 30
      24/60 (40%)M = 40 (SD = 9.3, range = 26–59)1 day (4 h)Intensive critical careNursing and medicineCRMInteractive: ParticipationHigh-fidelity simulationCRM
      Couloures et al.
      • Myers J.A.
      • Powell D.M.C.
      • Psirides A.
      • Hathaway K.
      • Aldington S.
      • Haney M.F.
      Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.
      Quasi-experimental a single group1. Team

      2. Communication
      1. SBAR Teamwork component (external observed, no validated)

      2. SBAR component (external observation, no validated)
      Pretest

      Posttest (immediate)
      2017EEUUN = 23Groups

       = 5
      4 day (4 h)A&E, Catastrophe or HEMSMedicine (residents)SBARMixed: Lectures and simulationHigh-fidelity simulationNTS
      Crimlisk et al.
      • Jafri F.
      • Mirante D.
      • Ellsworth K.
      • Shulman J.
      • Dadario N.
      • Williams K.
      • et al.
      A microdebriefing crisis resource management program for simulated pediatric resuscitation in a community hospital: a feasibility study.
      Quasi-experimental a single group1. Global NTS1. Team dynamics and confidence (self-administered, no validated)Pretest

      Posttest (immediate)
      2017EEUUN = 631 day (4 h)Mixed: A&E, Catastrophe or HEMS and Intensive critical careNursingEARTMixed: Lectures and simulationHigh-fidelity simulationNTS
      Delaney et al.
      • Cooper G.E.
      • White M.D.
      • Lauber J.K.
      Resource management on the flight deck.
      Quasi-experimental a single group1. Knowledge1. TeamSTEPPS (self-administered, no validated)Pretest

      Posttest (immediate)
      2015EEUUN = 8270/82 80.50%M = 27.61 yearMixed: A&E, Catastrophe or HEMS and Intensive critical careNursingTeamSTEPPSMixed: Lectures and simulationHigh-fidelity simulationNTS
      George et al.
      • Gaba D.M.
      • Fish K.J.
      • Howard S.K.
      Crisis management in anesthesiology.
      Quasi-experimental a single group1. Knowledge

      2. Team
      1. Examen online (no validated)

      2. Teamwork skills scale (self-administered, validated)
      Pretest

      Posttest (immediate)

      Posttest (1month after)
      2018EEUUN = 221 day (5 h)Intensive critical careNursing and MedicineUnspecifyInteractive: SimulationHigh-fidelity simulationNTS
      Ghazali et al.
      • Pizzi L.
      • Goldfarb N.I.
      • Nash D.B.
      Crew resource management and its applications in medicine.
      Experimental CG less sessions1. Global NTS

      2. Leadership
      1. Clinical Teamwork Management (external observation, validated)

      2. Behavioural assessment tool (external observation, validated)
      Pretest

      Posttest (immediate)
      2019FranceN = 48

      EG = 24

      CG = 24
      22/48

      45.83%
      9 days (12 horas)A&E, Catastrophe or HEMSNursing, Medicine and ambulance driverUnspecifyMixed: Lectures and simulationHigh-fidelity simulationNTS
      Ginsburg et al.
      • Nader M.
      • Tasdelen-Teker G.
      • DeStephens A.J.
      • Lampotang S.
      • Prelipcean I.
      • Smith R.D.
      • et al.
      Simulation use in outreach setting: a novel approach to building sustainability.
      Quasi-experimental two groups

      CG = no intervention and intensive critical care professionals.
      1. Team1. Teamwork climate survey (self-administered, validated)Pretest

      Posttest (immediate)

      Follow-up (4 months after baseline)
      2017CanadaN = 71

      EG = 42

      CG = 29
      EG: 35/42

      83%

      CG: (27/29, 93%)
      UnspecifyA&E, Catastrophe or HEMSNursing and MedicineUnspecifyMixed: Lectures and simulationLow-fidelity simulationNTS
      Haerkens et al.
      • Langdalen H.
      • Abrahamsen E.B.
      • Sollid S.J.M.
      • Sørskår L.I.K.
      • Abrahamsen H.B.
      A comparative study on the frequency of simulation-based training and assessment of non-technical skills in the Norwegian ground ambulance services and helicopter emergency medical services.
      Quasi-experimental a single group1. Attitude1. Safety attitudes questionnaire (SAQ) (TeamWork climate domain) (validated)Pretest

      Posttest (immediate)
      2015The NetherlandN = 2512 days (18 h)Intensive critical careNursing and medicineCRMMixed: Lectures and participative activitiesNone
      Hicks et al.
      • Alinier G.
      Developing high-fidelity health care simulation scenarios: a guide for educators and professionals.
      Quasi-experimental a single group1. Attitude

      2. Global NTS
      1. Human Factor Attitude Survey (self-administered, validated)

      2. Ottawa GRS (external observation, validated)
      Pretest

      Posttest (immediate)
      2012CanadaN = 141 dayA&E, Catastrophe or HEMSMedicine (residents)CRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Houzé-Cerfon et al.
      • Gaba D.M.
      Crisis resource management and teamwork training in anaesthesia.
      Experimental CG, other intervention.1. Global NTS

      2. Team

      3. Leadership
      1. TEAM Emergency Assessment Measure (external observation, validated)

      2. TEAM dimension

      3. Team leadership dimension
      Pretest

      Posttest (immediate)
      2020FranceN = 211

      EG = 105

      CG = 106
      Total: 145l/211 (68.72%)

      EG: 68/106 (54%)

      CG: 77/105 (73%)
      EG: range = 28-43

      CG: range = 26-45
      1 dayA&E, Catastrophe or HEMSNursing, medicine and other health professionalsCRMInteractive: SimulationHigh-fidelity simulationCRM
      Karageorge et al.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      Quasi-experimental a single group1. Knowledge

      2. Self-efficacy
      1. Questionnaire Multiple-choice (self-administered, no validated)

      2. Clinical decision-making self-confidence scale (self-administered, validated)
      Pretest

      Posttest (immediate)
      2020EEUUN = 244 days (4 h)Intensive critical careNursingUnspecifyInteractive: SimulationHigh-fidelity simulationNTS
      Khobrani et al.
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      Quasi-experimental a single group1. Knowledge1. Questionnaire Multiple-choice (no validated)Pretest

      Posttest (immediate)
      2018EEUUN = 132 days (10 h)A&E, Catastrophe or HEMSMedicine (residents)Kern'sMixed: Lectures and simulationHigh-fidelity simulationNTS
      Maenhout et al.
      • Buljac-Samardžić M.
      • Dekker-van Doorn C.M.
      • Maynard M.T.
      What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
      Quasi-experimental a single group1. Self-efficacy

      2. Team

      3. Leadership
      1. Clinical decision-making (self-confidence scale, validated)

      2. Checklist (external observation, no validated)

      3. Clinical deterioration leadership ability scale (self-administered, validated)
      Pretest

      Posttest (immediate)
      2021BelgiumN = 7167/71 (94.37%)1 day (2 h)Intensive critical careNursingUnspecifyInteractive: SimulationHigh-fidelity simulationNTS
      Mah et al.
      • Gross Benedict
      • Leonie Rusin
      • Jan Kiesewetter
      • Zottmann Jan M.
      • Fischer Martin R.
      • Stephan Prückner
      • et al.
      Crew resource management training in healthcare: a systematic review of intervention design, training conditions and evaluation.
      Quasi-experimental a single group1. Knowledge1. Questionnaire (no validated)Pretest

      Posttest (immediate)
      2009EEUUN = 741 dayIntensive critical careNursing and medicineCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Mahramus et al.
      • Low X.M.
      • Horrigan D.
      • Brewster D.J.
      The effects of team-training in intensive care medicine: a narrative review.
      Quasi-experimental a single group1. Team1. Team tool (external observed, validated)Pretest

      Posttest (immediate)
      2016EEUUN = 7370%1 day (2 h)Intensive critical careNursing and medicineCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Matzke et al.
      • Buljac-Samardzic M.
      • Doekhie Kirti D.
      • van Wijngaarden J.D.H.
      Interventions to improve team effectiveness within health care: a systematic review of the past decade.
      Quasi-experimental a single group1. Global NTS

      2. Communication
      1. T-TPQ (self-administered, validated)

      2. T-TPQ communication dimension (self-administered, validated)
      Pretest

      Posttest (3 weeks after)
      2021EEUUN = 3428/34 (82.4%)M = 35 (SD = 10.75, range = 22–66)1 day (1 h)A&E, Catastrophe or HEMSNursing and patient care techniciansTeamSTEPPSMixed: Lectures and simulationHigh-fidelity simulation (real-life)NTS
      Meurling et al.
      • Ounounou E.
      • Aydin A.
      • Brunckhorst O.
      • Khan M.S.
      • Dasgupta P.
      • Ahmed K.
      Nontechnical skills in surgery: a systematic review of current training modalities.
      Quasi-experimental a single group1. Attitude

      2. Self-efficacy
      1. Safety attitudes questionnaire, (SAQ)

      (puntuación Teamwork climate) (validated)

      2. Self-efficacy Questionnaire (self-administered, validated)
      Pretest

      Posttest (immediate)
      2013SwedenN = 151119/151 (79%)Range = 20-622 days (8 h)Intensive critical careNursing and medicine and nurse assistantsUnspecifyInteractive: High-fidelity simulationHigh-fidelity simulationNTS
      The article by Morey et al.24 was not included in the meta-analytic analysis.
      Morey et al.
      • Griffin C.
      • Aydın A.
      • Brunckhorst O.
      • Raison N.
      • Khan M.S.
      • Dasgupta P.
      • et al.
      Non-technical skills: a review of training and evaluation in urology.
      Quasi-experimental two groups. CG no intervention.1. Attitudes

      2. Team
      1. Staff attitudes toward teamwork (self-administered, validated)

      2. BARS (external observation, validated)
      Pretest

      Posttest (4 months after)

      Posttest (8 months after)
      2002EEUUN = 1058

      EG = 684

      CG = 374
      A&E, Catastrophe or HEMSNursing, medicine and techniciansCRMInteractive: Participative activitiesNone
      Munroe et al.
      • Page M.J.
      • McKenzie J.E.
      • Bossuyt P.M.
      • Boutron I.
      • Hoffmann T.C.
      • Mulrow C.D.
      • et al.
      The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.
      Quasi-experimental a single group1. Global NTS

      2. Communication

      3. Situational awareness
      1. ENNTS (external observation, validated)

      2. ENNTS communication dimensión

      3. ENNTS situational awareness dimensión
      Pretest

      Posttest (immediate)
      2016AustralianN = 3829/38 (76.30%)M = 29 (range = 21–49)1 day (4 h)A&E, Catastrophe or HEMSNursingHIRADMixed: Lectures and simulationHigh-fidelity simulationNTS
      Obenrader et al.
      • Lipsey M.W.
      Identifying interesting variables and analysis opportunities.
      Quasi-experimental a single group1. Team

      2. Communication
      1. T-TPQ dimensión (self-administered, validated)

      2. T-TPQ communication dimension
      Pretest

      Posttest (2 weeks after)

      Posttest (1 month after)
      2018EEUUN = 57A&E, Catastrophe or HEMSNursing and medicineTeamSTEPPSMixed: Lectures and simulationLow-fidelity simulationNTS
      Parsons et al.
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      Quasi-experimental a single group1. Global NTS

      2. Communication

      3. Leadership

      4. Situational awareness
      1. Ottawa GRS (external observation, validated)

      2. Ottawa GRS communication dimension

      3. Ottawa GRS leadership dimension

      4. Ottawa GRS situational awareness dimension
      Pretest

      Posttest (immediate)
      2018SingapurN = 144 day (5 h)A&E, Catastrophe or HEMSMedicine (residents)CRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Pascual et al.
      • Cook D.A.
      • Reed D.A.
      Appraising the quality of medical education research methods: the medical education research study quality instrument and the newcastle-ottawa scale-education.
      Quasi-experimental a single group1. Knowledge

      2. Global NTS
      1. Cuestionario (no validated)

      2. Team Leadership-Interpersonal skills (external observation, validated)
      Pretest

      Posttest (immediate)
      2011EEUUN = 128/12 (66.67%)M = 30.75 (range = 26–36)3 days (15 h)Intensive critical careNursingUnspecifyMixed: Lectures and simulationHigh-fidelity simulationNTS
      Patterson et al.
      • Borenstein M.
      • Hedges L.V.
      • Higgins J.P.T.
      Introduction to meta-analysis.
      Quasi-experimental a single group1. Knowledge

      2. Attitude
      1. Questionnaire (no validated)

      2. Safety attitudes questionnaire, SAQ (validated)
      Pretest

      Posttest (immediate)

      Follow-up (6 months after)
      2013EEUUN = 289222/289 (77%)2 days (12 h)A&E, Catastrophe or HEMSNursing, medicine and other health professionalsCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Pennington et al.
      • Cohen J.
      Statistical power analysis for the behavioral sciences.
      Quasi-experimental a single group1. Global NTS1. TEAM (external observation, validated)Pretest

      Posttest (2–4 weeks)
      2018International (several countries)N = 10 groups12 daysA&E, Catastrophe or HEMSNursing and medicineUnspecifyInteractiveNone
      Peters et al.
      • Wan X.
      • Wang W.
      • Liu J.
      • Tong T.
      Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range.
      Quasi-experimental a single group1. Knowledge

      2. Global NTS

      3. Team

      4. Communication

      5. Leadership
      1. TeamSTEPPS (no validated)

      2. TTPOT (external observation, validated)

      3. TTPOT team dimension

      4. TTPOT communication dimension

      5. TTPOT leadership dimension
      Pretest

      Posttest (immediate)
      2017EEUUN = 821 day (8 h)A&E, Catastrophe or HEMSNursingTeamSTEPPSMixed: Lectures and simulationLow-fidelity simulationNTS
      Pietsch et al.
      • Cooper H.M.
      • Hedges L.V.
      • Valentine J.C.
      The handbook of research synthesis and meta-analysis.
      Quasi-experimental a single group1. Global NTS1. Self-evaluation questionnaire (self-administered, no validated)Pretest

      Posttest (immediate)
      2016GermanyN = 401 day (1 h)A&E, Catastrophe or HEMSMedicine and paramedicsCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Rice et al.
      • Hartung J.
      An alternative method for meta-analysis.
      Quasi-experimental a single group1. Attitude

      2. Global NTS

      3. Team

      4. Communication

      5. Leadership
      1. T-TAQ (validated)

      2. TTPOT (self-administered, validated)

      3. TeamSTEPPS Perceptions Questionnaire (self-administered, validated)

      4. TTPOT communication dimension

      5. TTPOT leadership dimension
      Pretest

      Posttest (immediate)
      2016EEUUN = 76/7 (85.71%)M = 21Intensive critical careNursingTeamSTEPPSInteractive: SimulationHigh-fidelity simulationNTS
      Ryan et al.
      • Aguinis H.
      • Gottfredson R.K.
      • Wright T.A.
      Best-practice recommendations for estimating interaction effects using meta-analysis.
      Quasi-experimental a single group1. Communication

      2. Leadership
      1. Item (self-administered, no validated)

      2. Questionnaire (self-administered, no validated)
      Pretest

      Posttest (immediate)
      2018EEUUN = 71 day (4 h)Intensive critical careNursingCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Sauter et al.
      • Viechtbauer W.
      • López-López J.A.
      • Sánchez-Meca J.
      • Marín-Martínez F.
      A comparison of procedures to test for moderators in mixed-effects meta-regression models.
      Quasi-experimental a single group1. Knowledge1. Questionnaire of CRM principles (self-administered, no validated)Pretest

      Posttest (immediate)
      2016EEUUN = 5034/50 (68%)Range = 21-601 day (7 h)A&E, Catastrophe or HEMSNursing, medicine and other health professionalsCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Shapiro et al.
      • Knapp G.
      • Hartung J.
      Improved tests for a random effects meta-regression with a single covariate.
      Experimental CG other intervention.1. Team1. BARS (external observation, validated)Pretest

      Posttest (within 2 weeks of the training)
      2004EEUUN = 20

      EG = 2 groups

      CG = dos groups
      Range = 29-43A&E, Catastrophe or HEMSNursing, medicine and other health professionalsCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Turkelson et al.
      • Duval S.
      • Tweedie R.
      Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis.
      Quasi-experimental a single group1. Knowledge1. Multiple-choice questionnaire (no validated)Pretest

      Posttest (immediate)
      2017EEUUN = 3430/34 (91%)Range = 25-35Intensive critical careNursingCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Winkelmann et al.
      • Rothstein H.R.
      • Borenstein M.
      • Sutton A.J.
      Publication bias in meta-analysis.
      Quasi-experimental a single group1. Knowledge

      2. Communication
      1. Questionnaire CRM basics (no validated)

      2. Questionnaire (a self-administered, no validated)
      2016GermanyN = 263/26 (11.50%)M = 38.7 (SD = 5.70)2 daysA&E, Catastrophe or HEMSMedicineCRMMixed: Lectures and simulationHigh-fidelity simulationCRM
      Note: The articles included are cited in supplementary material Appendix 2).
      A&E, emergencies services; BARS, Behaviorally Anchored Rating Scale; CG, control group; CRM, critical resources management; EART, Emergency airway response team; ECMO Sim, Immersive simulation-based training program designed to teach skills to successfully manage life-threatening crises on extracorporeal membrane oxygenation; EG, experimental group; ENNTS, Emergency Nurses' Non-technical Skills; GRS, Global Rating Scale; HEMS, helicopter emergency medical service; HIRAD, History, Identify Red Flags, Assessment, Interventions, Diagnostics, reassessment and communication; M, average; MPTHS, Mayo High Performance Teamwork Scale; NTS, nontechnical skills; Ottawa GRS, Ottawa Crisis Resource Management Global Rating Scale; SD, standard deviation; SBAR, Situation, Background, Assessment, and Recommendation techniques to transfer the patient to a higher level of care; T_NOTECHS, Trauma Non-technical Skills teamwork scale; TeamSTEPPS, Team Strategies and Tools to Enhance Performance and Patient Safety; T-TPQ, Team- STEPPS® teamwork and perceptions questionnaire; TTPOT, Trauma Team Performance Observation Tool; TTAQ, TeamSTEPPS Teamwork Attitudes Questionnaire.
      a The article by Morey et al.
      • Griffin C.
      • Aydın A.
      • Brunckhorst O.
      • Raison N.
      • Khan M.S.
      • Dasgupta P.
      • et al.
      Non-technical skills: a review of training and evaluation in urology.
      was not included in the meta-analytic analysis.
      We analysed 21 articles (55.3%) aimed at emergency health professionals (including disaster or helicopter emergency medical service), 14 articles (39.5%) aimed at critical care professionals, and two (5.3%) articles aimed at both professionals. The predominant teaching methodology was the mixed methodology (passive and interactive) (n = 27; 71.1%). The predominant theoretical model was CRM (n = 18; 47.4%). The most used type of simulation was high fidelity (n = 32; 84.2%), and the objectives of the simulation scenarios were mainly geared towards nontechnical skills training (n = 18; 47.4%), followed by CRM (n = 16; 42.1%), unspecified (n = 3; 7.9%), and other (n = 1; 2.6%). Most studies (n = 32; 84.21%) performed immediate posttest measurements, except for five articles (13.16%) which evaluated the results with variability between 2 and 8 weeks after the intervention, considered as posttest. One article performed the first evaluation measure at 4 months after implementation and, finally, was not included in the statistical analysis. The year of publication of the studies found ranged from 2002 to 2021. In terms of the locations of the studies, four continents were identified: North America, 24 (63.2%); Asia, 1 (2.6%); Europe, 9 (23.7%); Oceania, 3 (7.9%), and International, 1 (2.6%). All studies were written in English.

      3.2 Assessment of risk bias of selected studies

      With respect to the methodological quality of the studies, the average MERSQI
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      score was 12.88 (SD = 2.21), with scores ranging from 8.5 to 16.5 (median = 14) (Table 2). Study designs included randomised experimental studies (n = 4; 10.52%) and two-group quasi-experimental designs with pretest and posttest (n = 3; 7.89%) and one-group quasi-experimental designs with pretest and posttest (31; 81.58%). These studies predominantly employed a single institution (n = 27, 71.1%), using objective data by external observer ratings (n = 26, 68.4%) and assessing knowledge or skills outcomes (n = 27, 71.1%). Most of the tools (n = 28, 73.7%) used to assess the outcomes had psychometric tests of internal consistency. All articles (100%) reported statistical inference and were appropriate for the design and types of data collected.
      Table 2Quality of studies measured by MERSQI.
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      ArticleStudy designSamplingType of dataValidityData analysisOutcomeTotal
      Abelsson et al.
      • de Montalvo-Jääskeläinen F.
      • Altisent-Trota R.
      • Bellver-Capella V.
      • Cadena-Serrano F.
      • de los Reyes-López M.
      • Gándara-del -Castillo A.
      • et al.
      Report of the Spanish Bioethics Committee on the ethical aspects of patient safety and, specifically, on the implementation of an effective system for reporting safety incidents and adverse events.
      22,53331,515
      Anderson et al.
      • Kohn L.T.
      To err is human: building a safer health system.
      1,523231,513
      Armstrong et al.
      Organization for Economic Cooperation and Development
      The economics of patient safety from analysis to action.
      1,523331,514
      Batchelder et al.
      The Joint Commission
      Sentinel event data: root causes by event type 2004-2015.
      1,523131,512
      Brewster et al.
      • Flin Rhona
      • O'Connor Paul
      • Crichton Margaret
      Safety at the sharp end: a guide to non-technical skills.
      1,521031,59
      Burton et al.
      • Leonard M.
      • Graham S.
      • Bonacum D.
      The human factor: the critical importance of effective teamwork and communication in providing safe care.
      1,52333214,5
      Chamberland et al.
      • Gordon M.
      • Darbyshire D.
      • Baker P.
      Non-technical skills training to enhance patient safety: a systematic review.
      32,53231,515
      Couloures et al.
      • Myers J.A.
      • Powell D.M.C.
      • Psirides A.
      • Hathaway K.
      • Aldington S.
      • Haney M.F.
      Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.
      1,513231,512
      Crimlisk et al.
      • Jafri F.
      • Mirante D.
      • Ellsworth K.
      • Shulman J.
      • Dadario N.
      • Williams K.
      • et al.
      A microdebriefing crisis resource management program for simulated pediatric resuscitation in a community hospital: a feasibility study.
      1,521131,510
      Delaney et al.
      • Cooper G.E.
      • White M.D.
      • Lauber J.K.
      Resource management on the flight deck.
      1,531331,513
      George et al.
      • Gaba D.M.
      • Fish K.J.
      • Howard S.K.
      Crisis management in anesthesiology.
      1,521231,511
      Ghazali et al.
      • Pizzi L.
      • Goldfarb N.I.
      • Nash D.B.
      Crew resource management and its applications in medicine.
      333331,516,5
      Ginsburg et al.
      • Nader M.
      • Tasdelen-Teker G.
      • DeStephens A.J.
      • Lampotang S.
      • Prelipcean I.
      • Smith R.D.
      • et al.
      Simulation use in outreach setting: a novel approach to building sustainability.
      221331,512,5
      Haerkens et al.
      • Langdalen H.
      • Abrahamsen E.B.
      • Sollid S.J.M.
      • Sørskår L.I.K.
      • Abrahamsen H.B.
      A comparative study on the frequency of simulation-based training and assessment of non-technical skills in the Norwegian ground ambulance services and helicopter emergency medical services.
      1,51,5333315
      Hicks et al.
      • Alinier G.
      Developing high-fidelity health care simulation scenarios: a guide for educators and professionals.
      1,52,53331,514,5
      Houzé-Cerfon et al.
      • Gaba D.M.
      Crisis resource management and teamwork training in anaesthesia.
      32,53331,516
      Karageorge et al.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      1,521131,510
      Khobrani et al.
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      1,523331,514
      Maenhout et al.
      • Buljac-Samardžić M.
      • Dekker-van Doorn C.M.
      • Maynard M.T.
      What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
      1,523331,514
      Mah et al.
      • Gross Benedict
      • Leonie Rusin
      • Jan Kiesewetter
      • Zottmann Jan M.
      • Fischer Martin R.
      • Stephan Prückner
      • et al.
      Crew resource management training in healthcare: a systematic review of intervention design, training conditions and evaluation.
      1,523031,511
      Mahramus et al.
      • Low X.M.
      • Horrigan D.
      • Brewster D.J.
      The effects of team-training in intensive care medicine: a narrative review.
      1,523331,514
      Matzke et al.
      • Buljac-Samardzic M.
      • Doekhie Kirti D.
      • van Wijngaarden J.D.H.
      Interventions to improve team effectiveness within health care: a systematic review of the past decade.
      1,52123110,5
      Meurling et al.
      • Ounounou E.
      • Aydin A.
      • Brunckhorst O.
      • Khan M.S.
      • Dasgupta P.
      • Ahmed K.
      Nontechnical skills in surgery: a systematic review of current training modalities.
      1,51,5323314
      The article by Morey et al.24 was not included in the meta-analytic analysis.
      Morey et al.
      • Griffin C.
      • Aydın A.
      • Brunckhorst O.
      • Raison N.
      • Khan M.S.
      • Dasgupta P.
      • et al.
      Non-technical skills: a review of training and evaluation in urology.
      23323316
      Munroe et al.
      • Page M.J.
      • McKenzie J.E.
      • Bossuyt P.M.
      • Boutron I.
      • Hoffmann T.C.
      • Mulrow C.D.
      • et al.
      The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.
      1,533231,514
      Obenrader et al.
      • Lipsey M.W.
      Identifying interesting variables and analysis opportunities.
      1,52133111,5
      Parsons et al.
      • Reed D.A.
      • Beckman T.J.
      • Wright S.M.
      • Levine R.B.
      • Kern D.E.
      • Cook D.A.
      Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
      1,523331,514
      Pascual et al.
      • Cook D.A.
      • Reed D.A.
      Appraising the quality of medical education research methods: the medical education research study quality instrument and the newcastle-ottawa scale-education.
      1,52,53131,512,5
      Patterson et al.
      • Borenstein M.
      • Hedges L.V.
      • Higgins J.P.T.
      Introduction to meta-analysis.
      1,51,53231,512,5
      Pennington et al.
      • Cohen J.
      Statistical power analysis for the behavioral sciences.
      1,533331,515
      Peters et al.
      • Wan X.
      • Wang W.
      • Liu J.
      • Tong T.
      Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range.
      1,52333315,5
      Pietsch et al.
      • Cooper H.M.
      • Hedges L.V.
      • Valentine J.C.
      The handbook of research synthesis and meta-analysis.
      1,5210318,5
      Rice et al.
      • Hartung J.
      An alternative method for meta-analysis.
      1,523331,514
      Ryan et al.
      • Aguinis H.
      • Gottfredson R.K.
      • Wright T.A.
      Best-practice recommendations for estimating interaction effects using meta-analysis.
      1,5210318,5
      Sauter et al.
      • Viechtbauer W.
      • López-López J.A.
      • Sánchez-Meca J.
      • Marín-Martínez F.
      A comparison of procedures to test for moderators in mixed-effects meta-regression models.
      1,52303312,5
      Shapiro et al.
      • Knapp G.
      • Hartung J.
      Improved tests for a random effects meta-regression with a single covariate.
      323231,514,5
      Turkelson et al.
      • Duval S.
      • Tweedie R.
      Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis.
      1,521231,511
      Winkelmann et al.
      • Rothstein H.R.
      • Borenstein M.
      • Sutton A.J.
      Publication bias in meta-analysis.
      1,52,510319
      TOTAL M (SD)1.87 (0.47)2.14 (0.45)2.37 (0.94)2.03 (1.10)3 (0)1.64 (0.57)12.88 (2.21)
      Note: All bibliographic references are in Appendix 2.
      a The article by Morey et al.
      • Griffin C.
      • Aydın A.
      • Brunckhorst O.
      • Raison N.
      • Khan M.S.
      • Dasgupta P.
      • et al.
      Non-technical skills: a review of training and evaluation in urology.
      was not included in the meta-analytic analysis.

      3.3 Mean effect size and heterogeneity

      Table 3 shows the results of the eight meta-analyses performed for each outcome measure, and the corresponding forest plots are shown in supplementary files 3 and 4. The mean effect sizes calculated for all eight outcomes were negative, indicating an improvement from pretest to posttest. All of them were statistically significant, with the exception of situational awareness (d+ = −0.448; 95% CI = −1.034, 0.139), most likely due to the low statistical power of such a small number of studies (n = 6).
      Table 3Mean effect sizes, 95% confidence, and heterogeneity statistics for the outcome variables.
      nd+95% CIQI2
      dldu
      Knowledge13−0.925−1.177−0.673120.515∗∗∗87.67%
      Attitude6−0.406−0.769−0.04424.880∗∗∗81.20%
      Self-efficacy3−0.469−0.874−0.0644.96558.17%
      Global NTS19−0.642−0.849−0.43469.956∗∗∗78.00%
      Team NTS13−0.606−0.949−0.262158.125∗∗∗92.48%
      Communication NTS14−0.458−0.818−0.099104.610∗∗∗87.94%
      Leadership NTS10−0.571−0.877−0.26440.468∗∗∗83.15%
      Awareness NTS6−0.448−1.0340.13924.225∗∗∗82.46%
      Note: NTS, nontechnical skills; n, number of studies; d+, Mean effect size; 95% CI, 95% confidence interval for d+; dl and du, lower and upper confidence limits; Q, Cochran's heterogeneity Q statistic; with k – 1 degrees of freedom; I2, Heterogeneity index; ∗∗∗p < .001.
      The highest mean effect size was found for knowledge (d+ = −0.925; 95% CI = −1.177, −0.673), followed by the mean effect sizes for global nontechnical skills (d+ = −0.642; 95% CI = −0.849, −0.434), team nontechnical skills (d+ = −0.606; 95% CI = −0.949, −0.262), and leadership nontechnical skills (d+ = −0.571; 95% CI = −0.877, −0.264). Similar mean effect sizes were found for attitude (d+ = −0.406; 95% CI = −0.769, −0.044), self-efficacy (d+ = −0.469; 95% CI = −0.874, −0.064), and communication nontechnical skills (d+ = −0.458; 95% CI = −0.818, −0.099). In all cases, the mean effect sizes were large to medium in magnitude.
      Large heterogeneity among the standardised mean changes was found in the meta-analyses (I2 > 75% and p < .001), with the exception of self-efficacy where I2 = 58.17%, and there was a nonstatistical result for Cochran's Q (see Table 3). This great variability is also reflected in the forest plots (see supplementary files 3 and 4).

      3.4 Analysis of publication bias

      With respect to the funnel plots using the trim-and-fill method, for most of the analysed outcomes, the effect sizes were imputed on the right-hand side of the funnel plots in order to achieve symmetry. For the knowledge measures specifically, three effect sizes were imputed (see supplementary file 5A), leading to a corrected effect size of dc = −0.801 (95% CI = −1.039, −0.564). For the self-efficacy and leadership nontechnical skills measures, two effect sizes were imputed (see supplementary files 5C and 6C, respectively), leading to corrected effect sizes of dc = −0.330 (95% CI = −0.548, −0.112) and dc = −0.475 (95% CI = −0.757, −0.193), respectively. For global nontechnical skills measures, four effect sizes were imputed (see supplementary file 5D), leading to a corrected effect size of dc = −0.556 (95% CI = −0.748, −0.364). For communication skills, six effect sizes were imputed (supplementary file 6B), leading to a corrected effect size of dc = −0.154 (95% CI = −0.482, 0.174). Finally, for situational awareness measures, one effect size was imputed (see supplementary file 6D), leading to a corrected effect size of dc = −0.326 (95% CI = −0.798, 0.147). However, Egger's test showed statistically nonsignificant results (p > .05) for all eight outcome measures, allowing us to rule out publication bias as a threat to the validity of the results of the meta-analyses.

      3.5 Analysis of moderator variables

      The great heterogeneity found among the standardised mean changes led to analysis of the moderator variables. This analysis was applied to those meta-analyses with at least 10 standardised mean changes, i.e., measures of knowledge, global, team, communication, and leadership skills. Supplementary files 7 to 12 show the results of the weighted ANOVAs and meta-regressions for the results analysed previously.
      Of the various potential moderator variables analysed, only simulation showed a statistically significant relationship with the effect sizes on communication nontechnical skills (p < .05). To be precise, the mean effect size obtained for high fidelity (d+ = −0.647) was statistically larger than that of low fidelity (d+ = 0.396), with 38.33% of variance accounted for (see supplementary file 10). Finally, the quality total score did not show a significant effect with any of the outcome measures analysed, i.e., knowledge, global, team, communication, and leadership skills (see supplementary file 12).

      4. Discussion

      4.1 Summary findings

      The present meta-analysis was conducted with a view to evaluating the effectiveness of educational interventions that include certain nontechnical skills and are targeted at healthcare professionals in critical care units and emergency departments, both in-hospital and out-of-hospital. Effects were observed on participants' knowledge, teamwork, global nontechnical skills, leadership, attitude, self-efficacy, and communication, assessed in the short term. There were not enough evaluations to perform a quantitative analysis of the long-term evaluations. O'Dea et al.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      reflect how most interventions are evaluated through posttest measures and within the first 2 months, concluding that there is no evidence of the long-term impacts of the training.

      4.2 Comparison and contrast with previous studies

      Unlike previous meta-analyses, our study has focused on specific clinical units where CRM is particularly relevant, such as emergency and critical care units. In the past, the focus was on industrial and/or military contexts, such as the commercial and military aviation industry, air traffic control, the nuclear sector, or commercial shipping.
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      Various medical units were also studied,
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      such as the operating room, trauma, anaesthesiology, obstetrics, and neonatal, as well as emergency and critical care units. Furthermore, while the present meta-analysis sample is of healthcare professionals, including nurses, physicians, and emergency healthcare technicians, others include students and professionals from other disciplines
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      ,
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      with varying degrees of experience, expertise, and training. It is worth noting that this current paper addresses all the dimensions of nontechnical skills,
      • Flin Rhona
      • O'Connor Paul
      • Crichton Margaret
      Safety at the sharp end: a guide to non-technical skills.
      whereas some previous meta-analyses have only focused on specific constructs such as teamwork
      • Salas E.
      • Diaz Granados D.
      • Klein C.
      • Burke C.S.
      • Stagl K.C.
      • Goodwin G.F.
      • et al.
      Does team training improve team performance? A meta-analysis.
      ,
      • McEwan D.
      • Ruissen G.R.
      • Eys M.A.
      • Zumbo B.D.
      • Beauchamp M.R.
      The effectiveness of teamwork training on teamwork behaviors and team performance: a systematic review and meta-analysis of controlled interventions.
      or situational awareness.
      • Walshe N.
      • Crowley C.
      • OʼBrien S.
      • Browne J.
      • Hegarty J.
      Educational interventions to enhance situation awareness: a systematic review and meta-analysis.
      With regard to the descriptive characteristics of the studies analysed, in line with previous meta-analyses,
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      ,
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      ,
      • Salas E.
      • Diaz Granados D.
      • Klein C.
      • Burke C.S.
      • Stagl K.C.
      • Goodwin G.F.
      • et al.
      Does team training improve team performance? A meta-analysis.
      ,
      • McEwan D.
      • Ruissen G.R.
      • Eys M.A.
      • Zumbo B.D.
      • Beauchamp M.R.
      The effectiveness of teamwork training on teamwork behaviors and team performance: a systematic review and meta-analysis of controlled interventions.
      most were carried out in the United States. Some of the studies included do not provide theoretical training models to support their interventions, which was already evidenced in the research of Walshe et al.
      • Walshe N.
      • Crowley C.
      • OʼBrien S.
      • Browne J.
      • Hegarty J.
      Educational interventions to enhance situation awareness: a systematic review and meta-analysis.
      The predominant methodology employed was the mixed methodology,
      • Buljac-Samardžić M.
      • Dekker-van Doorn C.M.
      • Maynard M.T.
      What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
      i.e., whereby in addition to being trained by an instructor (passive methodology), healthcare professionals actively participate (interactive methodology) as part of their learning (e.g., training in nontechnical skills using high-fidelity simulation). As the literature indicates, this methodology is more effective than traditional methodologies where training focuses on the transmission of information by an expert in an organised and systematic way, combined with the repetitive practice of clinical procedures and skills.
      • Maestre J.M.
      • Szyld D.
      • del Moral I.
      • Ortiz G.
      • Rudolph J.W.
      The training of clinical experts: reflective practice.
      In line with previous meta-analyses,
      • Walshe N.
      • Crowley C.
      • OʼBrien S.
      • Browne J.
      • Hegarty J.
      Educational interventions to enhance situation awareness: a systematic review and meta-analysis.
      most of the simulations performed were high fidelity. According to Alconero-Camarero et al.,
      • Alconero-Camarero A.R.
      • Sarabia-Cobo C.M.
      • Catalán-Piris M.J.
      • González-Gómez S.
      • González-López J.R.
      Nursing students' satisfaction: a comparison between medium- and high-fidelity simulation training.
      when the sample consists of expert professionals, high-fidelity simulations are more effective. However, if the sample is composed of nonexperts, e.g., nursing or medical students, then medium-fidelity simulations would be more effective because of their level of knowledge and clinical experience.
      With regard to the results of the different interventions, significant effect size statistics were obtained for the knowledge, attitude, self-efficacy, global nontechnical skills, teamwork, communication, and leadership variables. In relation to the outcome variable “knowledge”, the effect size observed in our study was larger than that obtained by O'Connor et al.,
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      but smaller than that of the meta-analysis conducted by O'Dea et at.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      The “attitude” variable also showed an intermediate effect size between that obtained by O'Dea et al.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      and O'Connor et al.
      • O'Connor P.
      • Campbell J.
      • Newon J.
      • Melton J.
      • Salas E.
      • Wilson K.A.
      Crew resource management training effectiveness: a meta-analysis and some critical needs.
      It was not possible to compare the variable “self-efficacy” with previous meta-analyses. However, it is considered a very important variable for the evaluation of training interventions, including nontechnical skills.
      • Escribano S.
      • Rocío J.
      • Nereida C.
      • Juana P.
      • Maria José C.
      Spanish Linguistic validation of the self-efficacy questionnaire in communication skills.
      Self-efficacy refers to healthcare professionals' perception of their abilities in terms of communication skills
      • Chung H.
      • Oczkowski S.J.W.
      • Hanvey L.
      • Mbuagbaw L.
      • You J.J.
      Educational interventions to train healthcare professionals in end-of-life communication: a systematic review and meta-analysis.
      and is a key predictor of future behaviour.
      • Ajzen I.
      From intentions to actions: a theory of planned behavior In: anonymous Action control.
      A significant effect size was also observed for “global nontechnical skills” as well as for the specific nontechnical skills measured: teamwork, communication, and leadership. Consistent with these findings, the meta-analysis conducted by McEwan et al.
      • McEwan D.
      • Ruissen G.R.
      • Eys M.A.
      • Zumbo B.D.
      • Beauchamp M.R.
      The effectiveness of teamwork training on teamwork behaviors and team performance: a systematic review and meta-analysis of controlled interventions.
      evidenced a moderately large effect size for teamwork. Finally, it should be noted that, as found in previous meta-analyses with healthcare professionals,
      • Walshe N.
      • Crowley C.
      • OʼBrien S.
      • Browne J.
      • Hegarty J.
      Educational interventions to enhance situation awareness: a systematic review and meta-analysis.
      situational awareness did not show a statistically significant effect with this type of intervention.
      However, the high diversity in the tools used for assessing each of the variables should be considered, which could be influencing the heterogeneity found in this meta-analysis, although we have not reported. O'Dea et at.
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      have shown that this diversity is one of the main limitations in this field of research since a wide variety of instruments are used, sometimes not validated, and include different types of measures, such as self-administered and direct behavioural observations. Standardisation of assessment is recommended
      • O'Dea A.
      • O'Connor P.
      • Keogh I.
      A meta-analysis of the effectiveness of crew resource management training in acute care domains.
      ,
      • Gross Benedict
      • Leonie Rusin
      • Jan Kiesewetter
      • Zottmann Jan M.
      • Fischer Martin R.
      • Stephan Prückner
      • et al.
      Crew resource management training in healthcare: a systematic review of intervention design, training conditions and evaluation.
      and, on the other hand, to analyse whether the assessment instruments are moderating the results.
      In relation to the analysis of moderator variables, only the fidelity level of simulation showed effects on communication nontechnical skills between the team. The results show that interventions are more effective when trained with high-fidelity simulation than those that performed with low fidelity. In keeping with these results, the meta-analysis conducted by Kim et al.,
      • Kim J.
      • Park J.H.
      • Shin S.
      Effectiveness of simulation-based nursing education depending on fidelity: a meta-analysis.
      whose aim was to determine the effect size of simulation-based educational interventions in nursing and compare the results according to the fidelity level, identify high-fidelity simulation offers benefits over low-fidelity simulation in cognitive and affective learning outcomes. While the characterised results are of psychomotor type, it evidenced the medium-fidelity simulation (full-body manikins and can be controlled by an external) as more effective. Therefore, adapting the most effective simulation method to meet the proposed results and objectives could be key.
      • Kim J.
      • Park J.H.
      • Shin S.
      Effectiveness of simulation-based nursing education depending on fidelity: a meta-analysis.
      In addition, high-fidelity simulation has a high cost. So, it is convenient to assess and consider whether other levels of less expensive simulation can achieve similar results.
      • Norman G.
      • Dore K.
      • Grierson L.
      The minimal relationship between simulation fidelity and transfer of learning.
      In any case, it would be advisable to continue research to detect whether there are differences in learning between the different levels.

      4.3 Strengths and limitations

      This study has some limitations. Firstly, the search strategy may have excluded potential articles, even though various search methods were employed to minimise this possibility. Secondly, most of the studies found were quasi-experimental studies with uncontrolled pretest and posttest designs, thus limiting their internal validity. Randomised controlled trials are the most appropriate option, but they are more complicated to carry out, especially in emergency health contexts. Thirdly, the results regarding the heterogeneity should be interpreted cautiously since the I2 index presents some limitations. On the one hand, if the number of studies included in the meta-analysis integrates large sample sizes, the I2 will be larger in the absence the heterogeneity. On the other hand, with the I2 being a proportion, the amount of heterogeneity quantified not is absolute.
      • Borenstein M.
      • Higgins Julian P.T.
      • Hedges Larry V.
      • Rothstein Hannah R.
      Basics of meta-analysis: I2 is not an absolute measure of heterogeneity.
      ,
      • Borenstein M.
      In a Meta-Analysis, the I-squared statistic does not tell us how much the effect size varies.
      Finally, moderate to high levels of heterogeneity were found in the meta-analytic synthesis of all the variables, which influences the final effect of the interventions. Particularly noteworthy is the lack of standardisation in the evaluation of outcomes. A wide variety of instruments, at times not validated, have been used to measure the outcome variables.
      Despite these limitations, there are sufficient studies demonstrating that educational interventions, such as CRM, are teaching methodologies that improve nontechnical skills training in the context of emergency departments and critical care units. The use of training programs based on these methodologies to improve the performance of healthcare professionals could improve safety, team performance, and, ultimately, the quality of patient care.
      • Pizzi L.
      • Goldfarb N.I.
      • Nash D.B.
      Crew resource management and its applications in medicine.

      4.4 Implications for educators and clinical practice

      Specialised services such as emergency and intensive care require highly trained teams to minimise clinical errors and achieve high-quality care performance, and for this, the acquisition of nontechnical skills such as communication, leadership, and teamwork is critical. The results of this meta-analysis evidence the need for professionals to continue to update their skills throughout their careers, as well as the inclusion of high-fidelity simulation in the curricula of healthcare professionals, especially for the acquisition of the communication skills.

      4.5 Areas for future research

      A feasible option for further research therefore appears to be the exploration of effective methodologies for situational awareness training, as evidenced in this study. It would also be advisable to conduct more longitudinal, multicentre, and multispecialty studies with the aim of establishing, with adequate methodological quality, the long-term effects of training in these methodologies and the impact of their transfer to clinical practice. It is essential to know if the results obtained are maintained over time
      • Buljac-Samardžić M.
      • Dekker-van Doorn C.M.
      • Maynard M.T.
      What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
      and, if not, to know the need to implement reminder interventions and their temporality. All of this will lead to more excellent training for dealing with complex scenarios and, therefore, to improve positive health outcomes, quality of care, and patient safety.
      On the other hand, further studies are recommended to examine the efficacy of specific CRM programs versus other simulation programs in other conceptual frameworks aimed at training in nontechnical skills, as well as other factors related to the characteristics of the interventions that could determine their efficacy. The results of this meta-analysis have not been conclusive in this regard for any of the variables studied.

      5. Conclusions

      Learning and teaching nontechnical skills to healthcare professionals in emergency crisis settings using simulation-based methodologies leads to improvements in their knowledge levels, attitudes, and self-efficacy, as well as in the performance of nontechnical skills, both at a global level and in specific skills such as communication, leadership, and work. The situational awareness of the professionals, however, needs to improve.

      Funding

      This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

      CRediT authorship contribution statement

      María Sánchez-Marco: conceptualisation, methodology, formal analysis, writing -original draft, writing -reviewing & editing and funding acquisition. Silvia Escribano: conceptualisation, methodology, formal analysis, writing -original draft, writing -reviewing & editing and funding acquisition. María José Cabañero-Martínez: conceptualisation, methodology, resources, reviewing & editing, supervision and funding acquisition. María Rubio-Aparicio: reviewing & editing, methodology and formal analysis. Rocio Juliá-Sanchís: conceptualisation, methodology, writing -original draft and writing -reviewing & editing and funding acquisition.

      Conflict of interest

      The authors state that they have no conflicts of interest.

      Appendix A. Supplementary data

      References

        • de Montalvo-Jääskeläinen F.
        • Altisent-Trota R.
        • Bellver-Capella V.
        • Cadena-Serrano F.
        • de los Reyes-López M.
        • Gándara-del -Castillo A.
        • et al.
        Report of the Spanish Bioethics Committee on the ethical aspects of patient safety and, specifically, on the implementation of an effective system for reporting safety incidents and adverse events.
        2020
        • Kohn L.T.
        To err is human: building a safer health system.
        National Academies Press, 2000
        • Organization for Economic Cooperation and Development
        The economics of patient safety from analysis to action.
        2020
        • The Joint Commission
        Sentinel event data: root causes by event type 2004-2015.
        2016
        • Flin Rhona
        • O'Connor Paul
        • Crichton Margaret
        Safety at the sharp end: a guide to non-technical skills.
        CRC Press, 2017
        • Leonard M.
        • Graham S.
        • Bonacum D.
        The human factor: the critical importance of effective teamwork and communication in providing safe care.
        Qual Saf Health Care. 2004; 13: i85-i90https://doi.org/10.1136/qshc.2004.010033
        • Gordon M.
        • Darbyshire D.
        • Baker P.
        Non-technical skills training to enhance patient safety: a systematic review.
        Med Educ. 2012; 46: 1042-1054https://doi.org/10.1111/j.1365-2923.2012.04343.x
        • Myers J.A.
        • Powell D.M.C.
        • Psirides A.
        • Hathaway K.
        • Aldington S.
        • Haney M.F.
        Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.
        Scand J Trauma Resuscitation Emerg Med. 2016; 24: 24https://doi.org/10.1186/s13049-016-0216-5
        • Jafri F.
        • Mirante D.
        • Ellsworth K.
        • Shulman J.
        • Dadario N.
        • Williams K.
        • et al.
        A microdebriefing crisis resource management program for simulated pediatric resuscitation in a community hospital: a feasibility study.
        Simulat Healthc J Soc Med Simulat. 2020; 16: 163-169https://doi.org/10.1097/sih.0000000000000480
        • Cooper G.E.
        • White M.D.
        • Lauber J.K.
        Resource management on the flight deck.
        National Aeronautics and Space Administration, 1980
        • Gaba D.M.
        • Fish K.J.
        • Howard S.K.
        Crisis management in anesthesiology.
        Churchill Livingstone, New York [U.A.]1994
        • Pizzi L.
        • Goldfarb N.I.
        • Nash D.B.
        Crew resource management and its applications in medicine.
        in: Dhojanmia K.G. Duncan B.W. McDonald K.M. Wachter R.M. Making health care safer: a critical analysis of patient safety practices. Evidence report/techonology assessment No 43, AHRQ. vol. 44. 2001: 511-519
        • Nader M.
        • Tasdelen-Teker G.
        • DeStephens A.J.
        • Lampotang S.
        • Prelipcean I.
        • Smith R.D.
        • et al.
        Simulation use in outreach setting: a novel approach to building sustainability.
        Simulat Healthc J Soc Med Simulat. 2022; 17: e136-e140https://doi.org/10.1097/sih.0000000000000555
        • Langdalen H.
        • Abrahamsen E.B.
        • Sollid S.J.M.
        • Sørskår L.I.K.
        • Abrahamsen H.B.
        A comparative study on the frequency of simulation-based training and assessment of non-technical skills in the Norwegian ground ambulance services and helicopter emergency medical services.
        BMC Health Serv Res. 2018; 18: 509https://doi.org/10.1186/s12913-018-3325-1
        • Alinier G.
        Developing high-fidelity health care simulation scenarios: a guide for educators and professionals.
        Simulat Gaming. 2011; 42: 9-26https://doi.org/10.1177/1046878109355683
        • Gaba D.M.
        Crisis resource management and teamwork training in anaesthesia.
        Br J Anaesth. 2010; 105: 3-6https://doi.org/10.1093/bja/aeq124
        • O'Dea A.
        • O'Connor P.
        • Keogh I.
        A meta-analysis of the effectiveness of crew resource management training in acute care domains.
        Postgrad Med. 2014; 90: 699-708https://doi.org/10.1136/postgradmedj-2014-132800
        • O'Connor P.
        • Campbell J.
        • Newon J.
        • Melton J.
        • Salas E.
        • Wilson K.A.
        Crew resource management training effectiveness: a meta-analysis and some critical needs.
        Int J Aviat Psychol. 2008; 18: 353-368https://doi.org/10.1080/10508410802347044
        • Buljac-Samardžić M.
        • Dekker-van Doorn C.M.
        • Maynard M.T.
        What do we really know about crew resource management in healthcare?: an umbrella review on crew resource management and its effectiveness.
        J Patient Saf. 2021; 17: e929-e958https://doi.org/10.1097/PTS.0000000000000816
        • Gross Benedict
        • Leonie Rusin
        • Jan Kiesewetter
        • Zottmann Jan M.
        • Fischer Martin R.
        • Stephan Prückner
        • et al.
        Crew resource management training in healthcare: a systematic review of intervention design, training conditions and evaluation.
        BMJ Open. 2019; 9e025247https://doi.org/10.1136/bmjopen-2018-025247
        • Low X.M.
        • Horrigan D.
        • Brewster D.J.
        The effects of team-training in intensive care medicine: a narrative review.
        J Crit Care. 2018; 48: 283-289https://doi.org/10.1016/j.jcrc.2018.09.015
        • Buljac-Samardzic M.
        • Doekhie Kirti D.
        • van Wijngaarden J.D.H.
        Interventions to improve team effectiveness within health care: a systematic review of the past decade.
        Hum Resour Health. 2020; 18: 2https://doi.org/10.1186/s12960-019-0411-3
        • Ounounou E.
        • Aydin A.
        • Brunckhorst O.
        • Khan M.S.
        • Dasgupta P.
        • Ahmed K.
        Nontechnical skills in surgery: a systematic review of current training modalities.
        J Surg Res. 2019; 76: 14-24https://doi.org/10.1016/j.jsurg.2018.05.017
        • Griffin C.
        • Aydın A.
        • Brunckhorst O.
        • Raison N.
        • Khan M.S.
        • Dasgupta P.
        • et al.
        Non-technical skills: a review of training and evaluation in urology.
        World J Urol. 2019; 38: 1653-1661https://doi.org/10.1007/s00345-019-02920-6
        • Page M.J.
        • McKenzie J.E.
        • Bossuyt P.M.
        • Boutron I.
        • Hoffmann T.C.
        • Mulrow C.D.
        • et al.
        The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.
        BMJ. 2021; 372: n71https://doi.org/10.1136/bmj.n71
        • Lipsey M.W.
        Identifying interesting variables and analysis opportunities.
        in: Cooper H. Hedges L.V. Valentine J.C. The handbook of research synthesis and meta-analysis. Rusell Sage Foundation, 2009: 147-158
        • Reed D.A.
        • Beckman T.J.
        • Wright S.M.
        • Levine R.B.
        • Kern D.E.
        • Cook D.A.
        Predictive validity evidence for medical education research study quality instrument scores: quality of submissions to JGIM's Medical Education Special Issue.
        J Gen Intern Med. 2008 Jul; 23: 903-907https://doi.org/10.1007/s11606-008-0664-3
        • Cook D.A.
        • Reed D.A.
        Appraising the quality of medical education research methods: the medical education research study quality instrument and the newcastle-ottawa scale-education.
        Acad Med. 2015; 90: 1067-1076https://doi.org/10.1097/ACM.0000000000000786
        • Borenstein M.
        • Hedges L.V.
        • Higgins J.P.T.
        Introduction to meta-analysis.
        Wiley, West Sussex, England2009 (p. xxix)
        • Cohen J.
        Statistical power analysis for the behavioral sciences.
        Erlbaum, Hillsdale, NJ [u.a.1988
        • Wan X.
        • Wang W.
        • Liu J.
        • Tong T.
        Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range.
        BMC Med Res Methodol. 2014; 14: 135https://doi.org/10.1186/1471-2288-14-135
        • Cooper H.M.
        • Hedges L.V.
        • Valentine J.C.
        The handbook of research synthesis and meta-analysis.
        Russell Sage Foundation, New York2019
        • Hartung J.
        An alternative method for meta-analysis.
        Biom J. 1999; 41: 901-916https://doi.org/10.1002/(SICI)1521-4036(199912)41:8%3C901::AID-BIMJ901%3E3.0.CO;2-W
        • Aguinis H.
        • Gottfredson R.K.
        • Wright T.A.
        Best-practice recommendations for estimating interaction effects using meta-analysis.
        J Organ Behav. 2011; 32: 1033-1043https://doi.org/10.1002/job.719
        • Viechtbauer W.
        • López-López J.A.
        • Sánchez-Meca J.
        • Marín-Martínez F.
        A comparison of procedures to test for moderators in mixed-effects meta-regression models.
        Psichol Methods. 2015; 20: 360-374https://doi.org/10.1037/met0000023
        • Knapp G.
        • Hartung J.
        Improved tests for a random effects meta-regression with a single covariate.
        Stat Med. 2003; 22: 2693-2710https://doi.org/10.1002/sim.1482
        • Duval S.
        • Tweedie R.
        Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis.
        Biometrics. 2000; 56: 455-463https://doi.org/10.1111/j.0006-341x.2000.00455.x
        • Rothstein H.R.
        • Borenstein M.
        • Sutton A.J.
        Publication bias in meta-analysis.
        Wiley, New York2006
        • Egger M.
        • Smith G.D.
        • Schneider M.
        • Minder C.
        Bias in meta-analysis detected by a simple, graphical test.
        BMJ. 1997; 315: 629-634https://doi.org/10.1136/bmj.315.7109.629
        • Viechtbauer W.
        Conducting meta-analyses in R with the metafor package.
        J Stat Software. 2010; 36: 1-48https://doi.org/10.18637/jss.v036.i03
        • Salas E.
        • Diaz Granados D.
        • Klein C.
        • Burke C.S.
        • Stagl K.C.
        • Goodwin G.F.
        • et al.
        Does team training improve team performance? A meta-analysis.
        Hum Factors. 2008; 50: 903-933https://doi.org/10.1518/001872008x375009
        • McEwan D.
        • Ruissen G.R.
        • Eys M.A.
        • Zumbo B.D.
        • Beauchamp M.R.
        The effectiveness of teamwork training on teamwork behaviors and team performance: a systematic review and meta-analysis of controlled interventions.
        PLoS One. 2017; 12e0169604https://doi.org/10.1371/journal.pone.0169604
        • Walshe N.
        • Crowley C.
        • OʼBrien S.
        • Browne J.
        • Hegarty J.
        Educational interventions to enhance situation awareness: a systematic review and meta-analysis.
        Simulat Healthc J Soc Med Simulat. 2019; 14: 398-408https://doi.org/10.1097/sih.0000000000000376
        • Maestre J.M.
        • Szyld D.
        • del Moral I.
        • Ortiz G.
        • Rudolph J.W.
        The training of clinical experts: reflective practice.
        Rev Clin Esp. 2013; 214: 216-220https://doi.org/10.1016/j.rce.2013.12.001
        • Alconero-Camarero A.R.
        • Sarabia-Cobo C.M.
        • Catalán-Piris M.J.
        • González-Gómez S.
        • González-López J.R.
        Nursing students' satisfaction: a comparison between medium- and high-fidelity simulation training.
        Int J Environ Res Publ Health. 2021; 18: 804https://doi.org/10.3390/ijerph18020804
        • Escribano S.
        • Rocío J.
        • Nereida C.
        • Juana P.
        • Maria José C.
        Spanish Linguistic validation of the self-efficacy questionnaire in communication skills.
        Contemp Nurse. 2021; (ahead-of-print:1-10)https://doi.org/10.1080/10376178.2021.2015415
        • Chung H.
        • Oczkowski S.J.W.
        • Hanvey L.
        • Mbuagbaw L.
        • You J.J.
        Educational interventions to train healthcare professionals in end-of-life communication: a systematic review and meta-analysis.
        BMC Med Educ. 2016; 16: 131https://doi.org/10.1186/s12909-016-0653-x
        • Ajzen I.
        From intentions to actions: a theory of planned behavior In: anonymous Action control.
        Springer, 1985: 11-39
        • Kim J.
        • Park J.H.
        • Shin S.
        Effectiveness of simulation-based nursing education depending on fidelity: a meta-analysis.
        BMC Med Educ. 2016 May 23; 16: 152https://doi.org/10.1186/s12909-016-0672-7
        • Norman G.
        • Dore K.
        • Grierson L.
        The minimal relationship between simulation fidelity and transfer of learning.
        Med Educ. 2012; 46: 636-647https://doi.org/10.1111/j.1365-2923.2012.04243.x
        • Borenstein M.
        • Higgins Julian P.T.
        • Hedges Larry V.
        • Rothstein Hannah R.
        Basics of meta-analysis: I2 is not an absolute measure of heterogeneity.
        Res Synth Methods. 2017; 8: 5-18https://doi.org/10.1002/jrsm.1230
        • Borenstein M.
        In a Meta-Analysis, the I-squared statistic does not tell us how much the effect size varies.
        J Clin Epidemiol. 2022; S0895–4356 (00244-X)
        • Haddaway N.R.
        • Page M.J.
        • Pritchard C.C.
        • McGuinness L.A.
        PRISMA2020: an R package and Shiny app for producing PRISMA 2020-compliant flow diagrams, with interactivity for optimised digital transparency and Open Synthesis.
        Campbell Syst Rev. 2022; 18e1230https://doi.org/10.1002/cl2.1230