Introduction

Hip fractures (HFs) are one of the most common injuries diagnosed in the emergency department. They are associated with high morbidity and mortality rates in the elderly [1,2,3,4]. To optimize care for elderly HF patients, several guidelines for care and management have been developed worldwide [5,6,7,8].

Also, around the world clinical audits have been started to further improve the quality of the provided HF care. In audits, quality indicators (QIs) are used to measure (outcomes of) care and to enable benchmarking. QIs are measurable aspects of care that reflect the quality of care [9, 10]. They are defined as “measurement tools, screens, or flags that are used as guide to monitor, evaluate, and improve the quality of patient care, clinical support services, and organization functions that affect patient outcomes” [10]. Three categories of QIs are distinguished: structure, process, and outcome indicators [11]. Structure indicators describe what is needed within a hospital or health care system to provide good care and reflect the setting of the provided care [12]. Process indicators provide information about appropriateness of the delivered care and can be measured on patient level [10]. Process indicators are often based on guidelines. QI categorized as an outcome reflect the end results of the provided care.

A good QI must meet four criteria: clinically relevant, scientifically acceptable, feasible, and usable [13, 14]. To be scientifically acceptable, a QI has to be reliable and valid [9]. To meet these criteria, a high-quality QI should undergo a well-described methodological development process [15].

The primary aim of this study was to identify quality indicators for HF care that are reported in the literature, ongoing HF audits, and national guidelines. The secondary aim was to compose a set of methodologically sound quality indicators for the evaluation of HF care in clinical practice.

Methods

This review was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement [16]. The study protocol was registered in PROSPERO, the international prospective database of systematic reviews (registration number CRD42016053425).

Search strategy

The search strategy was developed in collaboration with an experienced medical librarian of the Leiden University Medical Center, to identify all relevant publications in MedLine, Embase, Web of Science, Cochrane Library, Cinahl, and Google Scholar. The search strategy included “Hip fracture” and “QIs/benchmarking/audit/medical audit/outcome assessment/process assessment/quality assurance/performance measure” as Mesh and Tiab terms. The exact search strategy is presented in Appendices 14. Publications in English from 1990 up to 14 November 2016 were included.

Parallel to the literature search, an internet search for HF audits worldwide was performed. These websites and their annual reports were searched to identify the QIs used in these audits. In a second internet search, all national HF guidelines published in English were probed for QIs.

Study selection

The first author (S.V.) conducted the search and entered the articles identified in EndNote (EndNote X7; Thomson Reuters, Philadelphia, Pennsylvania). After removal of duplicates, the remaining publications were imported into the web-based software platform Covidence (www.covidence.com). Two authors (S.V. and D.V.) independently screened the titles and abstracts of the articles for relevance, based on the stated inclusion and exclusion criteria. In case of disagreement, a third member (M.W.) was consulted. The full text of articles found to be relevant on the basis of title and abstract was read by the reviewers who made the final selection following the same procedure. The reference lists of the included articles were screened for relevant studies that had been missed in the literature search.

The inclusion criteria were:

  • Studies describing (the development of) QIs/performance measures in HF care.

  • Studies describing the assessment of the quality of QIs/performance measures in HF care.

  • Systematic reviews, meta-analyses, randomized controlled trials, cross-sectional studies, cohort studies, case–control studies, and guidelines on this topic.

Articles were excluded if they described:

  • Non-HF care QIs.

  • QIs for HF patients below 18 years of age.

  • QIs for HF prevention or prehospital HF care.

  • Patient reported outcomes measures (PROMs) for HF care.

  • Meeting abstracts.

Data extraction

The definition and operationalization of the reported indicators were extracted from the selected articles. Instead of assessing the quality of the selected articles, the type and quality of the indicators were assessed. The Donabedian quality of care model [11] was used to categorize the QIs as structure, process, or outcome indicator.

All identified articles, audits, and guidelines were screened to obtain information about the quality of the QIs. The AIRE instrument (Appraisal of Indicators through Research and Evaluation) is an assessment tool for the methodological quality of QIs. In order to use the AIRE instrument, information on clinical relevancy, scientific acceptability, feasibility, and usability of the QIs has to be described [17]. If the articles did not provide the information needed for the application of the AIRE instrument, the construct validity of the QIs was assessed as the correlation of the structure and process QIs with one or more outcome measures [18]. Worthy of note is that the outcome measures that were used to judge the predictive value of the indicator are different from outcomes categorized as an outcome QI.

The set of QIs to be selected should be based on qualitative measures, preferably using the AIRE instrument or, if this was not possible, on the basis of their construct validity. Since not enough qualitative information was available, it was decided to use a quantitative measure for the QI selection. This selection criterion was that the QIs were described in at least two articles and were used in at least two audits or guidelines.

Results

Study selection

The literature search resulted in 1210 hits (Fig. 1). After removal of duplicates and meeting abstracts, 696 articles were available for assessment. Based on title and abstract, a total of 653 articles were excluded. After full-text screening of the remaining 43 articles, a further 29 articles were excluded. Two articles were included based on screening of the reference lists.

Fig. 1
figure 1

Flowchart of study selection

The 16 selected studies included 15 cohort studies (3 prospective and 12 retrospective) and 1 systematic review (Table 1). The cohort studies covered a total of 593,584 HF patients, and the study of Neuburger represents almost 80% of these patients.

Table 1 Overview of quality indicators for in-hospital hip fracture care, reported in studies, audits, or guidelines

Websites of ongoing hip fracture audits

Nine national HF audits were identified: the National Hip Fracture Database (United Kingdom minus Scotland), Scottish Hip Fracture Audit (Scotland), Australian and New Zealand Hip Fracture Registry (Australia/New Zealand), Danish Multidisciplinary Hip Fracture Registry (Denmark), Rikshöft (Sweden), the Dutch Hip Fracture Audit (The Netherlands), Irish Hip Fracture Audit (Ireland), Kaiser Permanente National Implant Registries (United States), and The Norwegian Hip Fracture Register (Norway). On the websites of the first seven audits, QIs were described. The QIs used in the United States were obtained by e-mail. No QIs were described in the Norwegian Hip Fracture Register [46, 47].

Hip fracture guidelines

Five hip fracture guidelines were probed for quality indicators. Two guidelines did not report on QIs: management of hip fractures in the elderly by the American Academy of Orthopedic Surgeons (AAOS) and management of hip fracture in older people by the Scottish Intercollegiate Guidelines Network (SIGN) [6, 7]. The National Institute for Health and Care Excellence (NICE) wrote the management of hip fracture in adults (CG 124); this guideline was the basis of two different standards with QIs: the Hip fracture in Adults: Quality Standard 16 and the British Orthopaedic Association Standards for Trauma [5, 43, 44]. The Australian and New Zealand Hip Fracture Registry has published an overall Hip Fracture Care Clinical Care Standard, which contains both the audits’ and the guidelines’ QIs [8, 37, 48]. In Canada, the national QIs were described in the National Hip Fracture Toolkit [45].

Identified quality indicators

In the included articles, audits, and guidelines 217 QIs were described (Table 1). Some of the reported QIs were similar, leaving 97 unique QIs. The unique QIs included 9 structure indicators (Table 2), 63 process indicators (Table 3), and 25 outcome indicators (Table 4). Sixty-five QIs were only described in one article or audit. The process indicator “time to surgery within a specific time frame” was described most frequent in 12 of 16 articles and in all audits and guidelines.

Table 2 Structure indicators for hip fracture care
Table 3 Process indicators for hip fracture care
Table 4 Outcomes categorized as quality indicators for hip fracture care

Quality of the QIs

Limited information was found in the articles, on the audit websites, and in the guidelines that could be used to assess the quality of the identified QIs regarding clinical relevancy, scientific acceptability, feasibility, and usability. In addition, the articles, audits, and guidelines used different definitions for the same QI. The AIRE Instrument could therefore not be applied.

Information on the construct validity was obtained for the structure and process QIs. In 11 of 16 articles, one audit and one guideline QIs were correlated with an outcome measure. In total, 30 different outcome measures to judge the predictive value of the indicators were used: mortality rate (in-hospital, within 1 month (crude and adjusted), and after 3, 6, and 12 months), readmission (after 1, 3, and 6 months), length of stay (postoperative length of stay on trauma ward, postoperative length of hospital stay, and overall hospital length of stay), reoperation rate, 30-day reoperation rate, functional outcome (FIM score after 2 and 6 months, Parker/KATZ-ADL score after 3 months, functional outcome after 1 and 5 years), discharge back home, place of residence (after discharge, after 30 days and after 3 months), return to pre-hip fracture level of mobility, complication rate, pressure ulcer occurrence, non-union of fracture, HFs sustained as an inpatient, case ascertainment, and surgery on day of admission. In six articles, QIs were correlated to one or more outcome measures. In five articles, only a set of QIs was correlated to outcome measures, and in five articles, no correlation was assessed.

One of nine structure indicators (presence of a post-discharge rehabilitation program) was reported to have a positive correlation with an outcome measure (30-day mortality, Table 2). Ten of the 63 process indicators were correlated with various outcome measures(Table 3): Hip Fracture Program during admission, time to surgery within a specified time, total hip replacement in defined conditions, extramedullary implants in AO classification types A1 and A2, IM nail with a subtrochanteric fracture, fracture prevention assessment, being mobilized within a specific time after surgery, systematic pain assessment, assessment of activities of daily living before fracture, and assessment of activities of daily living before discharge.

Selected set of quality indicators for a hip fracture audit

Information about the methodological quality of the HF QIs was lacking. Furthermore, the construct validity of the QIs was assessed for just 24 of the 72 structure and process QIs. For only 11 QIs a correlation with a limited number of outcome measures used to judge the predictive value of the indicator was found. It was therefore impossible to select a set of QIs based on qualitative criteria.

As an alternative, we applied quantitative criteria and selected QIs that were described in at least two articles and were used in at least two existing audits/guidelines. This produced the following set of nine QIs consisting of one structure indicator, six process indicators, and two outcome indicators.

  • Orthogeriatric management during admission (structure indicator, correlation with outcome not tested)

  • Time to surgery (process indicator, correlated with 1-year mortality)

  • Time to mobilization after surgery (process indicator, correlated with length of stay, 30-day readmission, and 30-day mortality)

  • Future fracture prevention assessment (process indicator, correlated with 30-day readmission and 30-day mortality rate)

  • Systematic pain assessment (process indicator, correlated with 30-day readmission and 30-day mortality rate)

  • Assessment of malnutrition (process indicator, no correlation with outcome found)

  • Prevention/assessment of pressure ulcer (process indicator, no correlation with outcome found)

  • Mortality rate (outcome indicator)

  • Return to the place of residence within a specific time frame (outcome indicator)

Discussion

This study is the first systematic review of the available literature, existing audits, and guidelines that summarizes existing QIs for HF care. A wide variety of QIs was found, covering different aspects and outcomes of HF care. No information on the clinical relevancy, scientific acceptability, feasibility, and usability of the QIs was found to assess the methodological quality.

Development of methodologically sound quality indicators

QIs differ from recommendations made in guidelines, as QIs must indicate the quality of delivered care [15]. Methodologically sound QIs should be developed in a systematic manner [49, 50]. For instance, Martin-Khan et al. [51] used a three-step development process to define a set of QIs for measuring the quality of care provided to elderly in the emergency department. Ideally, the QIs for HF care should have been developed in a similar manner, but this has not been described in the literature. It seems that the QIs described and used in the included articles and audits are obtained from guideline recommendations and applied without being systematically evaluated first. This might explain the wide variety of QIs that were found and the fact that 59 of the 97 QIs were described/used in only one article, audit, or guideline.

The clinimetric properties of the identified quality indicators

If QIs are properly developed and described, the clinical relevance, validity, reliability, feasibility, and usability can be assessed [49]. Thus, the methodological quality of QIs for several clinical conditions has been reviewed using the AIRE instrument [52,53,54,55,56]. For the identified QIs for HF care in our review, however, information about these parameters was missing and the AIRE instrument could not be applied.

Only information on the construct validity of some of the QIs could be found in the literature. A correlation with one or more outcome measures was studied for 24 of the 72 structure and process QIs, and reported present for 11 of these QIs. Future research should focus on the assessment of relevance, reliability, feasibility, and usability of the existing QIs through interviews, surveys, audits, or focus groups [50]. Assessing a set of QIs rather than individual QIs could be considered, as in three of the included articles a set of QIs was associated with an improvement in outcome measures whereas individual QIs were not [21, 33, 34].

Evaluation of the proposed quality indicator set

Since the methodological quality of the identified QIs could not be assessed, the proposed set of nine QIs was based on quantitative instead of qualitative criteria. The following discussion of each proposed QI is based on the available evidence presented in this systematic review, supplemented with information found in other available literature on these individual QIs.

Orthogeriatric management during admission (structure indicator)

This QI is described in four articles and three audits/guidelines. In the included articles, audits, and guidelines, this indicator was not evaluated against outcome measures to assess the construct validity. However, in other literature, evidence for this QI was found, as two reviews support the beneficial effects of orthogeriatric care models on mortality [57, 58]. This finding was confirmed in a recent prospective cohort study by Folbert et al. [59] that showed a significant decrease in the 1-year mortality rate from 35.1 to 23.2% after implementation of an integrated orthogeriatric treatment model. The available evidence suggests that this might be a promising QI.

Time to surgery (process indicator)

This QI is described in all the identified audits/guidelines and in 12 of 16 included articles. Various time frames for surgical delay (varying from 24 to 48 h) are used in the definition of this QI. Sund et al. [32] found a correlation between operative delay and a higher mortality rate; the other included articles found no correlation with the complication rate, place of residence after 3 months, functional status after 2 and 6 months, in-hospital mortality, and mortality after 3, 6, and 12 months [22, 23, 34]. The Hip fracture in Adults: Quality Standard 16 stated that delays in surgery are negatively associated with mortality and return to prefracture mobility [43].

In the literature, a debate is ongoing whether a specific time frame should be used and, if so, what this time frame should be (ranging from 24 to 48 h). Three systematic reviews stated that the timing of surgery is complex and that confounding might be present in all included articles [60,61,62]. Patients with delayed surgery have more comorbidities so it might be better to optimize them first. Based on evidence currently available, the time frame after which the risk of mortality increases is still unclear. The complication rate seems to increase with every delay in time to operation.

As suggested by Panesar et al. [63], the physical condition of weak patients should be optimized before surgery. In our opinion, the ideal time frame in the definition of this QI should be specified differently for fit patients (ASA I–II) and frail patients (ASA III–IV).

Time to mobilization after surgery (process indicator)

This QI was described in three articles and five audits/guidelines. For this QI, the time frame differed from 24 to 48 h after surgery. A correlation with better performance on six outcome measures (length of hospital stay, complication rate, return to pre-hip fracture level of mobility, 30-day readmission, return to original residence, and 30-day mortality) was described [21, 43], which renders this a promising QI. On the other hand, a review by Handoll et al. [64] concluded that there is insufficient evidence to substantiate the supposed effect of specific postoperative mobilization strategies.

Future fracture prevention assessment (process indicator)

In seven articles and in six audits/guidelines, future fracture prevention was described as a QI. Two types of fracture prevention were reported: (1) bone health assessment and treatment (if necessary) and (2) risk of falls assessment and future fall prevention. Some articles, audits, and guidelines consider this as one QI [20, 30, 31, 44] and others as two separate QIs [21, 24, 25, 33, 35, 36, 42, 65].A correlation between anti-osteoporotic medication and 30-day readmission was found by Kristensen et al. [21]; bone health assessment and treatment was not correlated with 30-day mortality rate and length of hospital stay. For prevention of future fall incidents, they found no correlation with 30-day mortality rate, 30-day readmission rate, and length of hospital stay. The study of Nielsen et al. [33] found a correlation between the initiation of anti-osteoporotic medication and a lower 30-day mortality rate.

We believe that the two types of fracture prevention (assessment and treatment of bone quality and fall prevention) can be taken together as one single QI, as they both have the same aim. It is important that the composite QI is described clearly and that the numerator and denominator are well defined. With this composite QI, it may be more likely that changes in quality of care due to preventive measures can be identified.

Systematic pain assessment (process indicator)

This indicator is described in two articles and three audits/guidelines. For this indicator, a correlation with lower 30-day readmission and 30-day mortality was described [21, 33]. The timing of pain assessment differed between the articles and audits/guidelines. Evidence for the timing and strategy of analgesia is also lacking in literature but is difficult to obtain with well-designed trials [66]. Recommendations in guidelines are therefore based on consensus rather than evidence [5].

Assessment of malnutrition (process indicator)

The assessment of the nutritional status is described as a QI in two articles and two audits/guidelines. Of the included articles and audits, only Nielsen et al. [33] correlated this indicator with an outcome measure. They found no correlation with the 30-day mortality rate, while the correlation with other outcome measures was not tested for this QI individually.

The review by Avenell et al. [67] showed that nutritional supplementation did not have an effect on the mortality of HF patients. There is low-quality evidence that oral nutritional supplementation started before or soon after surgery might prevent complications (pressure sore, infection, venous thrombosis, pulmonary embolism) and might shorten the length of hospital stay [68, 69].

Prevention/assessment of pressure ulcer (process indicator)

Two articles, two audits, and one guideline used this QI. However, the guideline combined the pressure sore assessment/treatment with the thromboembolism assessment/treatment in their QI [44]. The correlation with the outcome measures “time to surgery” and “30-day mortality” was not tested for the QI individually, but as part of a set including five other QIs [25].

As stated before, in the literature a longer waiting time to operation is associated with an increase in complications especially pressure ulcers [60,61,62]. In a prospective cohort study of 567 patients, the influence of pressure ulcers on the 6-month mortality rate was studied. Magny et al. found that having a pressure ulcer was associated with an increased 6-month mortality rate [70]. The occurrence of pressure ulcers was also used as outcome QI in two articles and one guideline [24, 28, 37].

Mortality rate (outcome indicator)

This QI was used in three articles and four audits/guidelines. The time frame for mortality varied between 30-day, 90-day, 120-day, and 1-year mortality. When comparing outcomes of care such as mortality between hospitals (benchmarking), differences in patient characteristics between the hospital populations should be accounted for in the analysis. This so-called casemix correction enables a fair comparison [71]. In the HF audit of England, Wales, and Northern Ireland, a casemix correction model has already been developed and is used in the evaluation of mortality [35]. This casemix correction model might also be suitable for other HF audits, but should be validated first in other settings.

Return to the place of residence within a specific time frame (outcome indicator)

This QI was described in two articles and four audits/guidelines. Whether HF patients can return to their original place of residence does not only depend on the in-hospital care, but also on the quality of the rehabilitation program. This QI may therefore provide insight into the overall quality of HF care. To obtain this information may be a logistical challenge, as the final place of residence may not be known at discharge.

Strengths

The broad spectrum of the identified QIs is in line with a recent scoping literature review of (potential) QIs for HF care conducted by Pitzul et al. [72]. As opposed to their review in which they grouped the QIs in a limited number of constructs, we evaluated the QIs individually and retrieved the available evidence for the methodological quality of the identified QIs. In addition, the search underlying the present review not only covered the available literature but also ongoing audits and HF guidelines. Our search for ongoing audits seems to be complete, as all the identified audits have also been described by Johansen et al. who recently published a HF audit overview [73].

In our review, we also recommend a set of QIs for future clinical research, including the most frequently mentioned and used indicators.

Limitations

Many QIs were identified, but their methodological quality could not be determined. Also, a clear definition was lacking for most of the existing QIs, or the definition differed between articles, audits, and guidelines. For this review, we therefore grouped the QIs that concern the same aspect of care. This makes it even more difficult to evaluate their methodological quality and to decide how these QIs can be defined best for the purpose of evaluating the quality of HF care. Due to these limitations, a set of QIs for use in clinical practice could not be selected on the basis of scientific evidence. As an alternative, we propose a set of nine QIs that are frequently described in the literature and are commonly used in clinical audits and guidelines. As this selection is based on quantitative criteria, we want to underline that the recommended set of quality indicators is only a suggestion. Their value as instruments for evaluating and improving HF care has yet to be ascertained. This set should therefore not be implemented as standard and should not prevent clinicians and policy-makers from using other QIs. The ultimate goal should be to define a standard set of evidence-based QIs that can be used for (inter)national benchmarking and for improving HF care based on best practices worldwide.

Conclusion

In conclusion, many HF structure/process/outcome QIs are available and being used in audits worldwide, but there is little evidence for their methodological quality and usability. The focus of future research should therefore be on assessing the methodological aspects of the existing QIs. In particular, further study of the predictive validity of QIs on outcomes that are meaningful to patients and those running health care systems is needed. As evidence-based QIs for HF care cannot be identified based on the available literature, we recommend to use the set of nine indicators described in this review as the basis for further clinical research. Should the development of additional or new QIs be required, this should be done through a systematic approach.