|
|
||||||||
Ann Thorac Surg 2001;72:1845-1848
© 2001 The Society of Thoracic Surgeons
a Providence Health System, Portland, Oregon, USA
* Address reprint requests to Dr Grunkemeier, 9155 SW Barnes Rd, #33, Portland, OR 97225, USA
e-mail: ggrunkemeier{at}providence.org
For several years, data-driven methodologies have been used in an attempt to improve performance in cardiac surgery programs. The article by Shahian and his colleagues [1] in this issue of The Annals provides a thoughtful and thorough comparison of "Report Cards" and continuous quality improvement (CQI) initiatives. Their conclusions are that CQI, including multi-disciplinary team site visits to identify and share processes and systems, has been proven to be effective. In contrast, "Report Cards" as currently implemented are not satisfactory and have potential to do harm. Their arguments are cogent and comprehensive. They claim that these cardiac surgery report cards are based on "sophisticated mathematical models" which engender "an exaggerated aura of scientific accuracy". Examining some of the deficiencies and limitations of the risk models used for cardiac surgery, from which the report card "Grades" are derived, provides support for their claim.
Curse of a binary outcome
The mathematical models in question are constructed using multivariable regression, which yields a formula which uses the risk factors for an individual patient to provide an estimate (expected value) of his outcome. Building such a model is not a fixed, reproducible exercise, and there are at least 9 reasons why different investigators with the same data set would produce different risk models [2]. For a continuous outcome (cost, length of stay, etc) such a model can exactly predict or at least come close to a patients observed value. But operative mortality is a binary outcome, and an ideal formula would result in a classification of alive or dead. Instead, logistic regression provides the expected mortality, the probability that the patient will be an operative death. This probability is always between 0% and 100%, so it will never match the observed mortality, which is either 0%, for survivors, or 100%, for deaths (Fig 1). The discrimination of such a model is measured by the C-index (area under the ROC curve). The C-index for the model in Figure 1 is 0.80. (Shahian notes that most cardiac surgery models have C-indices between 0.760.82.) A C-index of .50 indicates no discrimination and an index of 1.00 is perfect discrimination. So a value of 0.80 is only 60% of the way between worthless and perfect. Only in aggregate can we achieve agreement between observed and expected mortality. For example, if we have 20 patients with an expected mortality of 0.05 (5%) and one of them dies, we consider the model successful. But note that we do not say which of the 20 will die; this seems unsatisfactory, but that is the best we can do.
|
Another reason for the poor performance of CABG risk models may be the complex pathways of the outcome (death) compared to the relative few risk factors available to predict it. Most current models use a stepwise regression procedure to allow only statistically significant factors. In general the larger the sample size, the more risk factors can be found (an informal rule says the number of risk factors identified should not exceed the number of deaths in the training set divided by 10). Table 1 summarizes recently published multivariable risk models based on at least 2,000 patients [315]. There are 44 different risk factors cited (the original number was larger, but some similar categories were combined). The number of independent risk factors cited by any one paper varied from 5 to 29, with the largest numbers being found by the largest series.
|
Aggregating by provider
To grade a provider (doctor, clinic) the expected number of deaths (the sum of the expected mortality for all the patients in the group, based on the risk model) is compared to the observed number of deaths. Dividing by the number of patients yields the providers expected (E) and observed (O) mortality, respectively. To make comparisons, the Es are usually considered to be without error (!), and sampling error is attached to the Os by assuming, incorrectly, that they are statistically independent. This has the effect of identifying too many outliers (in either direction). Hierarchical models, as advocated by Shahian, extend the traditional risk model structure to compensate for this by reducing the overly optimistic precision of the estimates [20, pages 507 to 511]. Incorporating hospital or surgeon as another dimension of variability results in more realistic estimates of the provider effects, dampening or shrinking them towards the mean value for all providers.
Figure 2 illustrates the use of a heirarchical model, using data from nine hospitals. For comparison, a conventional logistic regression model was used to provide expected mortality for each hospital and the O/E ratios ranged from 0.62 to 1.62 (horizontal axis). Confidence intervals were computed for each hospital [20, page 488]. The 95% confidence intervals (solid horizontal lines) for 2 of the hospitals do not include the value 1 (vertical line), so they would be considered abnormal by this simplistic analysis, and the "significantly high" one would get a "flunking" grade. However, if 99.4% confidence intervals (dashed horizontal lines) are used instead to ensure protection against type I error (finding spurious significance due to multiple simultaneous comparisons), then all of the hospitals would get passing grades.
|
Conclusion
The current methods used to determine risk factors, construct risk models and compute variability result in anti-conservative comparisons of provider effects. The report by Shahian and his colleagues provides a masterful discussion of these and related issues. It deserves a careful reading.
Acknowledgments
We thank the Providence Health System hospitals for use of their cardiac surgery data from the following institutions: AlaskaProvidence Anchorage Medical Center; WashingtonProvidence Everett Medical Center, Providence Seattle Medical Center, Providence St. Peter Hospital (Olympia), Providence Yakima Medical Center; OregonProvidence Portland Medical Center, Providence St. Vincent Medical Center (Portland); CaliforniaProvidence St. Joseph Medical Center (Burbank), Providence Holy Cross Medical Center (Mission Hills).
References
This article has been cited by other articles:
![]() |
D. M. Shahian, S. M. O'Brien, G. Filardo, V. A. Ferraris, C. K. Haan, J. B. Rich, S.-L. T. Normand, E. R. DeLong, C. M. Shewan, R. S. Dokholyan, et al. The Society of Thoracic Surgeons 2008 cardiac surgery risk models: part 1--coronary artery bypass grafting surgery. Ann. Thorac. Surg., July 1, 2009; 88(1 Suppl): S2 - 22. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Ferraris, F. H. Edwards, D. M. Shahian, and S. P. Ferraris Risk Stratification and Comorbidity Card. Surg. Adult, January 1, 2008; 3(2008): 199 - 246. [Full Text] |
||||
![]() |
P. E. Antunes, L. Eugenio, J. Ferrao de Oliveira, and M. J. Antunes Mortality risk prediction in coronary surgery: a locally developed model outperforms external risk models Interactive CardioVascular and Thoracic Surgery, August 1, 2007; 6(4): 437 - 441. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Epstein Do Cardiac Surgery Report Cards Reduce Mortality? Assessing the Evidence Med Care Res Rev, August 1, 2006; 63(4): 403 - 426. [Abstract] [PDF] |
||||
![]() |
D. M. Shahian, D. F. Torchiana, R. J. Shemin, J. D. Rawn, and S.-L. T. Normand Massachusetts Cardiac Surgery Report Card: Implications of Statistical Methodology Ann. Thorac. Surg., December 1, 2005; 80(6): 2106 - 2113. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. K. Nallamothu, K. A. Eagle, V. A. Ferraris, and R. M. Sade Should Coronary Artery Bypass Grafting Be Regionalized? Ann. Thorac. Surg., November 1, 2005; 80(5): 1572 - 1581. [Full Text] [PDF] |
||||
![]() |
G. L. Grunkemeier and A. P. Furnary Mandatory Database Participation: Risky Business? Ann. Thorac. Surg., September 1, 2005; 80(3): 799 - 801. [Full Text] [PDF] |
||||
![]() |
R. Jin, G. L. Grunkemeier, A. P. Furnary, J. R. Handy Jr, and for the Providence Health System Cardiovascular St Is Obesity a Risk Factor for Mortality in Coronary Artery Bypass Surgery? Circulation, June 28, 2005; 111(25): 3359 - 3365. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Shahian, E. H. Blackstone, F. H. Edwards, F. L. Grover, G. L. Grunkemeier, D. C. Naftel, S. A.M. Nashef, W. C. Nugent, and E. D. Peterson Cardiac Surgery Risk Models: A Position Article Ann. Thorac. Surg., November 1, 2004; 78(5): 1868 - 1877. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A M Nashef and R. J F Baskett Quality Assessment in Cardiac Surgery: Do Not Miss the Boat! Asian Cardiovasc Thorac Ann, September 1, 2004; 12(3): 187 - 189. [Full Text] [PDF] |
||||
![]() |
M. A. Herbert, S. L. Prince, J. L. Williams, M. J. Magee, and M. J. Mack Are unaudited records from an outcomes registry database accurate? Ann. Thorac. Surg., June 1, 2004; 77(6): 1960 - 1964. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Cutlip, K. K. L. Ho, R. E. Kuntz, and D. S. Baim Risk assessment for percutaneous coronary intervention: our version of the weather report? J. Am. Coll. Cardiol., December 3, 2003; 42(11): 1896 - 1899. [Full Text] [PDF] |
||||
![]() |
D. M. Shahian and S.-L. T. Normand The volume-outcome relationship: from Luft to Leapfrog Ann. Thorac. Surg., March 1, 2003; 75(3): 1048 - 1058. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Ferraris and S. P. Ferraris Risk Stratification and Comorbidity Card. Surg. Adult, January 1, 2003; 2(2003): 187 - 224. [Full Text] |
||||
![]() |
S. A.M. Nashef and F. Roques Risk Assessment, League Tables, and Report Cards: Where is True Quality Monitoring? Ann. Thorac. Surg., November 1, 2002; 74(5): 1748 - 1749. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ANN THORAC SURG | ASIAN CARDIOVASC THORAC ANN | EUR J CARDIOTHORAC SURG |
| J THORAC CARDIOVASC SURG | ICVTS | ALL CTSNet JOURNALS |