ATS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
David M. Shahian
David F. Torchiana
Richard J. Shemin
James D. Rawn
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shahian, D. M.
Right arrow Articles by Normand, S.-L. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shahian, D. M.
Right arrow Articles by Normand, S.-L. T.
Related Collections
Right arrow Professional affairs

Ann Thorac Surg 2005;80:2106-2113
© 2005 The Society of Thoracic Surgeons


Original article: Cardiovascular

Massachusetts Cardiac Surgery Report Card: Implications of Statistical Methodology

David M. Shahian, MD a , * , David F. Torchiana, MD b , Richard J. Shemin, MD c , James D. Rawn, MD d , Sharon-Lise T. Normand, PhD e

a Department of Surgery, Caritas St. Elizabeth's Medical Center, Boston, Massachusetts
b Department of Cardiac Surgery, Massachusetts General Hospital, Boston, Massachusetts
c Department of Cardiac Surgery, Boston University Medical Center, Boston, Massachusetts
d Division of Cardiac Surgery, Brigham and Women's Hospital, Boston, Massachusetts
e Department of Health Care Policy, Harvard Medical School, and Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts

Accepted for publication June 28, 2005.

* Address correspondence to Dr Shahian, Department of Surgery, Caritas St. Elizabeth's Medical Center, 736 Cambridge St, Boston, MA 02135 (Email: david.shahian{at}caritaschristi.org).


    Abstract
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
BACKGROUND: Choice of statistical methodology may significantly impact the results of provider profiling, including cardiac surgery report cards. Because of sample size and clustering issues, logistic regression may overestimate systematic interprovider variability, leading to false outlier classification. Theoretically, the use of hierarchical models should result in more accurate representation of provider performance.

METHODS: Extensively validated and audited data were available for all 4,603 isolated coronary artery bypass grafting procedures performed at 13 Massachusetts hospitals during 2002. To produce the official Massachusetts cardiac surgery report card, a 19-variable predictor set and a hierarchical generalized linear model were employed. For the current study, this same analysis was repeated with the 14 predictors used in the New York Cardiac Surgery Reporting System. Two additional analyses were conducted using each set of predictor variables and applying standard logistic regression. For each of the four combinations of predictors and models, the point estimates of risk-adjusted 30-day mortality, 95% confidence or probability intervals, and outlier status were determined for each hospital.

RESULTS: Overall unadjusted mortality for coronary bypass operations was 2.19%. For most hospitals, there was wide variability in the point estimates and confidence or probability intervals of risk-adjusted mortality depending on statistical model, but little variability relative to the choice of predictors. There were no hospital outliers using hierarchical models, but there was one outlier using logistic regression with either predictor set.

CONCLUSIONS: When used to compare provider performance, logistic regression increases the possibility of false outlier classification. The use of hierarchical models is recommended.


    Introduction
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
Cardiac surgery report cards have now been published by a number of states including Pennsylvania, New York, New Jersey, and Massachusetts [1–3]. Similar provider reports have been used for confidential, quality improvement initiatives by the Department of Veteran's Affairs [4], the Northern New England Cardiovascular Disease Study Group (NNE) [5, 6], and by The Society of Thoracic Surgeons (STS) [7]. In most such studies, risk-adjusted mortality rates (RAM) are calculated from logistic regression models, and the resulting estimates are presented with "accuracy" to the nearest one hundredth of a percentage point [1]. Although usually accompanied by 95% confidence intervals (CI), these point estimates of RAM are inevitably the major focus of consumer and media attention. Most observers do not realize that the point estimates of RAM, their associated CI, and the determination of outlier status are all highly dependent on a number of statistical decisions made during the study design, some of which represent personal preferences from among a number of reasonably credible alternatives. These include choice of a database instrument; selection of the predictor variables; management of missing data; details of model development; audit and validation; and the statistical methodology for calculating and comparing risk-adjusted mortality rates among providers [1, 8–12].

To investigate how the choice of a particular set of predictors and statistical methodology may affect the results of cardiac surgery report cards, we have utilized the highly audited data set that formed the basis of the Massachusetts 20002 cardiac surgery report card [13].


    Material and Methods
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
In 2000, the Massachusetts legislature established a Cardiac Care Quality Commission (MA CCQC) to monitor invasive cardiac outcomes in the state, and it specifically mandated publication of an annual report card beginning with procedures performed in 2002. The MA CCQC elected to utilize the STS National Cardiac Database (NCD) as the data collection instrument for this project. For the current study, we used the 2002 coronary artery bypass grafting surgery (CABG) cohort from this database, which was the basis for the first Massachusetts cardiac surgery report card.

There were 13 hospitals performing CABG in Massachusetts in 2002, of which two were new programs with a limited number of cases. There were 53 active cardiac surgeons in the Commonwealth with a median of 95 isolated CABG admissions. All of the programs in the state were major academic or tertiary centers, or were sponsored by such a center. Of 7,661 hospital admissions in which cardiac surgery was performed in Massachusetts in 2002, 4,604 involved solitary CABG procedures. One patient was lost to follow-up in Europe, leaving 4,603 patients available for analysis.

All cardiac surgery data were collected, cleansed, audited, validated, analyzed, and warehoused at the Massachusetts Data Analysis Center (Mass-DAC), which was based in the Department of Health Care Policy at Harvard Medical School. The Massachusetts Inpatient Acute Hospital Case Mix and Charge database and the Massachusetts Mortality Index were used to validate the data submissions. At regular intervals, each hospital received reports on invalid, missing, inconsistent, or out-of-usual range data, and they were given 30 days to justify or correct them. One hundred fourteen data quality reports were issued, with a mean of 8.8 per hospital (range, 5 to 12).

Fourteen informational meetings were held between August 15, 2002, and October 13, 2004, regarding the 2002 data. Separate meetings were held for data managers, chiefs of cardiac surgery, and members of the Mass-DAC Cardiac Advisory Board (composed of experts from other major database and quality monitoring organizations).

Two separate audits were conducted, the first in the spring of 2003 by MassPro, a Massachusetts quality improvement organization. This audit comprised approximately 500 cases, including all deaths (the endpoint in our analysis was 30 day all-cause mortality) plus a random sample of surviving patients from each hospital. After final data submission closeout and initial cleansing, a second audit was performed of selected data fields including cases that were initially coded as "other cardiac surgery," urgent and emergent procedures, and certain patient codes such as unstable angina, chronic obstructive pulmonary disease, or advanced New York Heart Association class. When necessary, hospitals were required to submit additional documentation including histories and physicals, progress notes, operative notes, intensive care unit flow charts, and discharge summaries. Of 1,820 charts initially identified for audit, 724 were selected for review by an adjudication committee consisting of three cardiac surgery members from three different institutions. Unanimous consent was required for each problematic entry. A total of 835 changes were made by this committee, more than half of which were in one institution that had systematically miscoded angina.

For the public report card, a 19-variable Mass-DAC predictor set was derived based on expert literature review. A random intercept, two-level hierarchical generalized linear model (HGLM) was chosen as the analytical technique [14] (see Statistical Appendix), and was estimated using BUGS software (Bayesian Inference Using Gibbs Sampling, version 0.60; MRC Biostatistics Unit, Cambridge, United Kingdom). The specific BUGS code for this study is available from the authors upon request. Exchangeability was assumed across all providers [15, 16]. Because hospital characteristics were excluded from the model, hospitals were compared globally, not just to similar hospitals [17, 18]. A standardized mortality incidence rate was determined for each provider, which is conceptually similar to the RAM used in report cards based upon logistic models. It is computationally quite different, however, and is determined by dividing the "smoothed," risk-adjusted, provider-specific estimate of mortality by the estimate of expected mortality obtained using the average intercept for all Massachusetts providers (see Statistical Appendix). This quantity, similar to the observed/expected ratio, is then multiplied by the state unadjusted mortality rate to obtain the standardized RAM. The credible 95% probability interval (PI) for each provider was then estimated and was compared with the state average mortality of 2.19%. As there is no simple closed-form solution for the hierarchical estimator of risk-adjusted mortality (and 95% PI), models were estimated using Gibbs sampling. After an initial burn-in of 2,000 iterations, parameter estimates were based on the subsequent 5,000 draws. Ninety-five percent PIs were obtained through identification of the 2.5th and 97.5th percentiles of the 5,000 RAMs. This was the methodology used to construct the first Massachusetts public report card, published in October 2004.

For the purposes of this paper, three additional analyses were then conducted using the exact same data set. We first recomputed the HGLM estimates using a "pseudo-New York" predictor set based on the 14 variables included in the New York Cardiac Surgery Reporting System (CSRS). Slight modifications were necessary because the definitions varied somewhat between the STS NCD and CSRS databases. Finally, two additional analyses were performed using both the Mass-DAC and CSRS predictor variables and applying standard logistic regression, the method used in the New York CSRS, the NNE, the STS NCD, and the Department of Veteran's Affairs [1, 2, 5, 8, 19]. Ninety-five percent CIs were calculated for the logistic models [20]. Therefore, in total, we calculated hospital-specific, risk-adjusted mortality rates and determined outlier status using four combinations of predictor variables and statistical methodologies.

The prior distribution for the between-hospital variance parameter ({tau}2) is the key determinant of the degree of shrinkage in our HGLM. We selected a gamma distribution, gamma (0.001, 0.001), with a mean of 1 and variance of 1,000 for the between-hospital precision parameter {tau}–2, the inverse of the between-hospital variance parameter. This "just proper" distribution has been used in previous CABG profiling studies and is often a reasonable choice, although it does favor low variance [16, 21]. We assessed the robustness of our conclusions by varying the values of the mean and variance of this gamma distribution.

It is also important that the hierarchical model employ estimates of between-hospital standard deviation that are uninfluenced by potential aberrant providers. Otherwise, the model might accommodate excessive variability and thus have reduced sensitivity to detect true outliers. Using an odds ratio median and range methodology [21], we assessed the reasonableness of our estimate of between-hospital standard deviation {tau} when using the gamma distribution.


    Results
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
The Mass-DAC and pseudo-NY predictors, their frequencies and odds ratios (95% CI) are presented in Tables 1 and 2. Go The point estimates of RAM, their associated 95% CI or PI, and their receiver operating characteristic curve areas are summarized in Table 3. Note that the discrimination of the four combinations of model and predictors was nearly identical. The risk-adjusted mortality rates are also presented graphically in Figure 1 in combination with their 95% CI or PI and the state average mortality of 2.19%. The horizontal axis scale is the same for each graph, thus facilitating visual comparison of the mortality point estimates, CIs, and PIs. Applying logistic regression techniques to either the Mass-DAC or NY predictor variables, hospital 13 was an outlier, indicated by the fact that the lower limit of the 95% CI exceeded the state average mortality of 2.19%. However, using HGLM there were no outliers. "Shrinkage" inherent in the hierarchical model resulted in narrower 95% PIs compared with the CIs for logistic regression, and this shrinkage also reduced the apparent interhospital variability [22]. Using HGLM, the observed zero mortality rates of two start-up programs with only a few cases are shrunk towards the overall state mean of 2.19%. Figure 2 demonstrates just the point estimates of risk-adjusted mortality for each hospital using each of the four combinations of predictor variables and statistical model. Substantial variability is demonstrated, most of which is attributable to the choice of analytical model rather than the predictors (Mass-DAC or NY).


View this table:
[in this window]
[in a new window]
 
Table 1. Massachusetts Data Analysis Center Predictors and Odds Ratios
 

View this table:
[in this window]
[in a new window]
 
Table 2. Pseudo-New York Predictors and Odds Ratios
 

View this table:
[in this window]
[in a new window]
 
Table 3. Summary of Observed and Risk-Adjusted Outcomes for 13 Massachusetts Providers
 


View larger version (18K):
[in this window]
[in a new window]
 
Fig 1. Standardized mortality incidence rates (RAM) with 95% confidence intervals (CI [logistic model]) or 95% probability intervals (hierarchical generalized linear model) for each of the four combinations of predictors and statistical models. (Mass-DAC = Massachusestts Data Analysis Center; NY = New York.)

 


View larger version (19K):
[in this window]
[in a new window]
 
Fig 2. Point estimates of risk-adjusted mortality (standardized mortality incidence rate [RAM]) for each hospital, using each of four combinations of predictors and statistical models. RAM% = standardized mortality incidence rate, or risk-adjusted mortality; solid circle = logistic Massachusetts Data Analysis Center (Mass-DAC); solid triangle = hierarchical generalized linear model (HGLM) Mass-DAC; gray circle = logistic New York; gray triangle = HGLM New York.

 
Using the gamma prior distribution (0.001, 0.001) and either the Massachusetts or New York data, our HGLM posterior estimate of the between-hospital standard deviation {tau} was 0.2. This corresponds to an odds ratio range (97.5% to 2.5% points of the odds ratio distribution) of 2.19, and a median odds ratio (median ratio of the maximum to minimum odds ratios in a random pair) of 1.24, both of which appear reasonable [21]. Sensitivity analysis demonstrated little difference in our results as we varied the values of the mean and variance of the gamma prior distribution chosen for the between-hospital precision parameter {tau}–2. With alternative priors (gamma [1.0E-2, 1.0E-2], gamma [1.0E-1, 1.0E-1]) for the between-hospital precision parameter {tau}-2, the risk-adjusted mortality of hospital 13 was most affected but the lower limit of its 95% PI still included the state mean (ie, not an outlier). Furthermore, the posterior estimates of between-hospital standard deviation using these different gamma priors were all less than 0.4.


    Comment
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
Choice of the optimal statistical methodology is crucial for accurate provider profiling. Because of the "accuracy" with which cardiac surgery risk-adjusted mortality rates have been presented in the literature, many observers are unaware of how dependent these results are upon numerous statistical decisions made during design of the study, including choice of statistical model. Although logistic regression is the most commonly employed statistical model to determine risk-adjusted mortality and to compare providers, there are significant concerns regarding this method [1, 8, 14–16, 23–25]. For example, logistic regression neglects clustering of observations among providers (such as heart failure patients at transplant centers). This intraclass correlation effectively reduces the number of independent observations from any provider and increases the standard error of the observations. This is one common cause of overdispersion in logistic models. Although other methods have been used to analyze clustered data (eg, generalized estimating equations, alternating logistic regression, median and interval odds ratios [26–32]), they do not provide a direct method for estimating risk-adjusted hospital-specific rates and their corresponding variances, which is the primary focus of cardiac surgery profiling. We prefer the hierarchical approach, which mimics the belief that there are unmeasured factors leading to systematic differences between true hospital mortality rates, and from which hospital-specific mortality estimates and their associated variances can be explicitly determined.

Likewise, in standard logistic regression, the estimates of observed mortality obtained from low volume providers are a much less accurate reflection of their true unobserved mortality. The problem of small sample size as it relates to provider profiling is well known and has received extensive attention in the literature [1, 14, 33, 34]. Statistical theory dating back to the work of Stein and James more than 50 years ago [35] suggests that better estimates from small samples can be obtained by "shrinking" the observed values, the degree of shrinkage being inversely proportional to the number of observations. The shrunken estimate represents a weighted average between the observed value for the sample and the grand mean for the entire population of similar samples. Such estimates also account for the phenomenon of "regression to the mean" and thus are better predictors of future performance [36].This is important, as there is typically a 1- to 2-year lag between data collection and publication of report cards ("the future is now").

If the effects of clustering and small sample size are ignored, the net result will be overestimation of systematic interprovider variability, underestimation of random interprovider variability, and an increased potential for the provider to be falsely classified as an outlier. This is further magnified by the problem of multiple comparisons, which can lead to the spurious identification of significant differences between providers [33, 36].

Increasingly, many statisticians believe that hierarchical generalized linear models of the type employed in Massachusetts, and recently introduced for use in the STS NCD, are a preferable approach. They specifically account for both clustering and small sample sizes, thus ensuring better overall precision in provider estimates, and through the shrinkage process they also mitigate the problem of multiple comparisons [33, 36]. Such models assume exchangeability among providers. That is, they explicitly acknowledge the heterogeneity among providers but assert that the providers' performances cannot be ordered a priori [16, 22]. Some might argue that this assumption is not justified because, on average, lower-volume programs have worse results [37]. However, this relationship is relatively weak for CABG surgery, and many lower volume programs including those in Massachusetts have better than average results [38–41]. It has also been argued that hierarchical models, which view the observed provider results as samples from a larger unobserved population, are not appropriate for report cards, where all state or regional data are available. However, although the goal is not to generalize to other unobserved providers, hierarchical models do provide better estimates of future results, which is really the most useful application of report cards for patients [15, 36, 42].

Several studies have reanalyzed available New York cardiac surgery outcome data and have demonstrated how the use of HGLM mitigates apparent (and misleading) interprovider variability and the likelihood of false outlier identification [15, 16, 43]. Localio and associates [33] analyzed Pennsylvania 1991-1992 CABG report card results, which were based on logistic regression, and compared these with results obtained with a hierarchical model. Use of a hierarchical model dramatically reduced false outlier identification while still providing adequate statistical power to detect true outliers, particularly when 2 years of data were used. DeLong and colleagues [42, 44] investigated the impact of different statistical methods for CABG provider profiling, including random effects models, and they also concluded that a mixed-effects model provided the most realistic assessment of provider performance, especially when there were small sample sizes. Austin and colleagues [45] compared the performance of logistic regression and hierarchical models to assess the care of patients with acute myocardial infarction. In general, random-effects (hierarchical) models had greater specificity and positive predictive value than fixed-effect logistic models, whereas the latter had greater sensitivity to detect outliers. However, when the volume of cases for each hospital in this simulation study was arbitrarily raised to 250, a reasonable number for hospital CABG volume, the sensitivity differences between the two models for detecting true outliers diminished. The negative predictive value of being classified a non-outlier was roughly similar for the two models.

Our results demonstrate the potential impact that choice of statistical methodology can make in the results of provider profiling. The "smoothing" effect of the hierarchical model results in shrinkage of individual hospital results toward the overall state mean, and a narrowing of the PI compared with the corresponding logistic regression CI [22]. Results of our prior distribution sensitivity analyses and the value of the random effects standard deviation were reasonable, and they did not suggest that our models produced excessive shrinkage.

We believe the hierarchical approach results in a more accurate estimate of a provider's unobserved true performance, both absolute and relative to their comparison institutions. In our study, logistic regression would have labeled one provider as an outlier, but this result was not confirmed with HGLM. There is always a tradeoff in modeling between sensitivity and specificity. Ultimately, it is a public policy decision as to whether increased sensitivity to detect true outliers or avoidance of false outlier identification is the more important consideration in state report cards [45]. We acknowledge that despite what we consider to be their theoretical superiority, the profiling results obtained with hierarchical models may not always differ significantly from those based upon logistic regression. However, by consistently applying the correctly specified model, the likelihood of accurate results in a variety of scenarios is increased.

The relative lack of variability we observed between results obtained using the Mass-DAC or NY predictor sets is not surprising. Most major cardiac risk models have similar predictors and there is a core set of critical variables that provide most of the predictive power [46, 47].

One incidental observation of interest emerged from this study. The Massachusetts report card demonstrated very low unadjusted CABG mortality, one of the lowest ever publicly reported. Massachusetts has a strict determination-of-need process and a highly academic environment, but before the 2002 report card, there was no mandatory data collection or public reporting of cardiac surgery outcomes. These results confirm the previous report of Ghali and associates [48] that was based on administrative data. They suggest that public report cards may have a valuable role in providing public accountability, but that they are not essential to the provision of the highest quality cardiac surgery care.

We conclude that extreme caution must be employed in calculating and interpreting risk-adjusted outcome results, and that the most appropriate statistical methodology must be employed. For provider profiling, we believe this is HGLM.


    Statistical Appendix
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
Standard Logistic Regression
For patient j at hospital i:

The logit (log-odds) of the probability of 30-day mortality for patient j with co-morbidity x ij (y ij = 1) is given by:


{2106.05011872.si1}

Estimate and from population data

Then, the probability that the j th patient treated at the i th hospital will die (30-day mortality) is given by:


{2106.05011872.si4}

Observed mortality rate at hospital i (n i coronary artery bypass procedures) =


{2106.05011872.si5}

Expected mortality rate at hospital i (n i coronary artery bypass procedures) =


{2106.05011872.si6}

Risk-adjusted mortality (RAM) = observed mortality rate divided by expected mortality rate, multiplied by the state unadjusted mortality rate. Confidence intervals for the RAM are calculated using a large-sample approximation.

Random Intercept Hierarchical Generalized Linear Model (HGLM)
For patient j at hospital i:

Given patient covariate x ij , fixed coefficient ß 1 and random intercept ß 0i which is normally distributed with mean µ and between-hospital variance {tau}2, then the logit (log-odds) of the probability of 30-day mortality for patient j (y ij = 1) is given by the following equation:


{2106.05011872.si7}

where


{2106.05011872.si8}

The corresponding probability of 30-day mortality for the j th patient treated at the i th hospital:


{2106.05011872.si9}

The smoothed, hospital-specific mortality rate for hospital i with n i coronary bypass procedures, adjusted for patient case-mix and incorporating shrinkage estimators, is given by:


{2106.05011872.si10}

The expected probability of death for patient j (if operated upon at an average Massachusetts hospital) is:


{2106.05011872.si11}

The corresponding expected mortality rate is:


{2106.05011872.si12}

The standardized mortality incidence rate (RAM) is the smoothed, adjusted, hospital-specific mortality rate divided by the expected mortality rate, multiplied by the state unadjusted mortality rate. Probability intervals are calculated via Monte Carlo simulation.


    Acknowledgments
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 
We thank Ann Lovett, RN, MA, Program Manager of Mass-DAC, and Robert Wolf, MA, Biostatistician and Programmer/Analyst, for management and expert programming of the Mass-DAC data. Doctor Normand gratefully acknowledges funding from the Massachusetts Department of Public Health.


    References
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Statistical Appendix
 Acknowledgments
 References
 

  1. Shahian DM, Normand SL, Torchiana DF, et al. Cardiac surgery report cardscomprehensive review and statistical critique. Ann Thorac Surg 2001;72:2155-2168.[Abstract/Free Full Text]
  2. Hannan EL, Kilburn Jr H, O'Donnell JF, Lukacik G, Shields EP. Adult open heart surgery in New York State. An analysis of risk factors and hospital mortality rates JAMA 1990;264:2768-2774.[Abstract/Free Full Text]
  3. Hannan EL, Kilburn Jr H, Racz M, Shields E, Chassin MR. Improving the outcomes of coronary artery bypass surgery in New York State JAMA 1994;271:761-766.[Abstract/Free Full Text]
  4. Grover FL, Johnson RR, Shroyer AL, Marshall G, Hammermeister KE. The Veterans Affairs Continuous Improvement in Cardiac Surgery Study Ann Thorac Surg 1994;58:1845-1851.[Abstract]
  5. O'Connor GT, Plume SK, Olmstead EM, et al. The Northern New England Cardiovascular Disease Study Group A regional prospective study of in-hospital mortality associated with coronary artery bypass grafting JAMA 1991;266:803-809.[Abstract/Free Full Text]
  6. O'Connor GT, Plume SK, Olmstead EM, et al. The Northern New England Cardiovascular Disease Study Group A regional intervention to improve the hospital mortality associated with coronary artery bypass graft surgery JAMA 1996;275:841-846.[Abstract/Free Full Text]
  7. Ferguson Jr TB, Peterson ED, Coombs LP, et al. Use of continuous quality improvement to increase use of process measures in patients undergoing coronary artery bypass graft surgerya randomized controlled trial. JAMA 2003;290:49-56.[Abstract/Free Full Text]
  8. Shahian DM, Blackstone EH, Edwards FH, et al. Cardiac surgery risk modelsa position article. Ann Thorac Surg 2004;78:1868-1877.[Abstract/Free Full Text]
  9. Harrell Jr. FE. Regression modeling strategies with applications to linear models, logistic regression, and survival analysis. New York: Springer-Verlag; 2001.
  10. Harrell Jr FE, Lee KL, Mark DB. Multivariable prognostic modelsissues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med 1996;15:361-387.[Medline]
  11. Steyerberg EW, Eijkemans MJ, Harrell Jr FE, Habbema JD. Prognostic modelling with logistic regression analysisa comparison of selection and estimation methods in small data sets. Stat Med 2000;19:1059-1079.[Medline]
  12. Naftel DC. Do different investigators sometimes produce different multivariable equations from the same data? J Thorac Cardiovasc Surg 1994;107:1528-1529.[Free Full Text]
  13. Normand S-LT. 2005. Available at: www.massdac.org. Accessed June 15, 2005..
  14. Normand S-LT, Glickman ME, Gatsonis CA. Statistical methods for profiling providers of medical careissues and applications. J Am Stat Assoc 1997;92:803-814.
  15. Goldstein H, Spiegelhalter DJ. League tables and their limitationsstatistical issues in comparisons of institutional performance. J R Stat Soc (Series A) 1996;159:385-443.
  16. Marshall EC, Spiegelhalter DJ. Institutional performanceIn: Leyland AH, Goldstein H, editors. Multilevel modelling of health statistics. Chichester: John Wiley & Sons; 2001. pp. 127-142.
  17. Austin PC, Alter DA, Anderson GM, Tu JV. Impact of the choice of benchmark on the conclusions of hospital report cards Am Heart J 2004;148:1041-1046.[Medline]
  18. Romano PS. Peer group benchmarks are not appropriate for health care quality report cards Am Heart J 2004;148:921-923.[Medline]
  19. Grover FL, Shroyer AL, Hammermeister KE. Calculating risk and outcomethe Veterans Affairs database. Ann Thorac Surg 1996;62(Suppl):6-11.
  20. Hosmer DW, Lemeshow S. Confidence interval estimates of an index of quality performance based on logistic regression models Stat Med 1995;14:2161-2172.[Medline]
  21. Spiegelhalter DJ, Abrams KR, Myles JP. Prior distributions Bayesian approaches to clinical trials and health-care evaluation. Chichester: John Wiley & Sons; 2004. pp. 139-180.
  22. Spiegelhalter DJ, Abrams KR, Myles JP. An overview of the Bayesian approach Bayesian approaches to clinical trials and health-care evaluation. Chichester: John Wiley & Sons; 2004. pp. 49-120.
  23. McNeil BJ, Pedersen SH, Gatsonis C. Current issues in profiling quality of care Inquiry 1992;29:298-307.[Medline]
  24. Thomas N, Longford NT, Rolph JE. Empirical Bayes methods for estimating hospital-specific mortality rates Stat Med 1994;13:889-903.[Medline]
  25. Grunkemeier GL, Zerr KJ, Jin R. Cardiac surgery report cardsmaking the grade. Ann Thorac Surg 2001;72:1845-1848.[Free Full Text]
  26. Liang KY, Zeger SL. Regression analysis for correlated data Annu Rev Public Health 1993;14:43-68.[Medline]
  27. Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models Biometrika 1986;73:13-22.[Abstract/Free Full Text]
  28. Carey V, Zeger SL, Diggle P. Modelling multivariate binary data with alternating logistic regressions Biometrika 1993;80:517-526.[Abstract/Free Full Text]
  29. Preisser JS, Arcury TA, Quandt SA. Detecting patterns of occupational illness clustering with alternating logistic regressions applied to longitudinal data Am J Epidemiol 2003;158:495-501.[Abstract/Free Full Text]
  30. Preisser JS, Koch GG. Categorical data analysis in public health Annu Rev Public Health 1997;18:51-82.[Medline]
  31. Larsen K, Petersen JH, Budtz-Jorgensen E, Endahl L. Interpreting parameters in the logistic regression model with random effects Biometrics 2000;56:909-914.[Medline]
  32. Larsen K, Merlo J. Appropriate assessment of neighborhood effects on individual healthintegrating random and fixed effects in multilevel logistic regression. Am J Epidemiol 2005;161:81-88.[Abstract/Free Full Text]
  33. Localio AR, Hamory BH, Fisher AC, TenHave TR. The public release of hospital and physician mortality data in Pennsylvania. A case study Med Care 1997;35:272-286.[Medline]
  34. Dimick JB, Welch HG, Birkmeyer JD. Surgical mortality as an indicator of hospital qualitythe problem with small sample size. JAMA 2004;292:847-851.[Abstract/Free Full Text]
  35. Efron B, Morris C. Data analysis using Stein's estimator and its generalizations J Am Stat Assoc 1975;70:311-319.
  36. Spiegelhalter DJ, Abrams KR, Myles JP. Observational studies Bayesian approaches to clinical trials and health-care evaluation. Chichester: John Wiley & Sons; 2004. pp. 251-265.
  37. Hannan EL, Wu C, Ryan TJ, et al. Do hospitals and surgeons with higher coronary artery bypass graft surgery volumes still have lower risk-adjusted mortality rates? Circulation 2003;108:795-801.[Abstract/Free Full Text]
  38. Rathore SS, Epstein AJ, Volpp KG, Krumholz HM. Hospital coronary artery bypass graft surgery volume and patient mortality, 1998-2000 Ann Surg 2004;239:110-117.[Medline]
  39. Peterson ED, Coombs LP, DeLong ER, Haan CK, Ferguson TB. Procedural volume as a marker of quality for CABG surgery JAMA 2004;291:195-201.[Abstract/Free Full Text]
  40. Shahian DM. Improving cardiac surgery quality—volume, outcome, process? JAMA 2004;291:246-248.[Free Full Text]
  41. Shahian DM, Normand SL. The volume-outcome relationshipfrom Luft to Leapfrog. Ann Thorac Surg 2003;75:1048-1058.[Abstract/Free Full Text]
  42. DeLong ER, Peterson ED, DeLong DM, Muhlbaier LH, Hackett S, Mark DB. Comparing risk-adjustment methods for provider profiling Stat Med 1997;16:2645-2664.[Medline]
  43. Christiansen CL, Morris CN. Improving the statistical approach to health care provider profiling Ann Intern Med 1997;127:764-768.[Abstract/Free Full Text]
  44. Peterson ED, DeLong ER, Muhlbaier LH, et al. Challenges in comparing risk-adjusted bypass surgery mortality resultsresults from the Cooperative Cardiovascular Project. J Am Coll Cardiol 2000;36:2174-2184.[Abstract/Free Full Text]
  45. Austin PC, Alter DA, Tu JV. The use of fixed- and random-effects models for classifying hospitals as mortality outliersa Monte Carlo assessment. Med Decis Making 2003;23:526-539.[Abstract/Free Full Text]
  46. Tu JV, Sykora K, Naylor CD, Steering Committee of the Cardiac Care Network of Ontario Assessing the outcomes of coronary artery bypass graft surgeryhow many risk factors are enough?. J Am Coll Cardiol 1997;30:1317-1323.[Abstract]
  47. Jones RH, Hannan EL, Hammermeister KE, et al. The Working Group Panel on the Cooperative CABG Database Project Identification of preoperative variables needed for risk adjustment of short-term mortality after coronary artery bypass graft surgery J Am Coll Cardiol 1996;28:1478-1487.[Abstract]
  48. Ghali WA, Ash AS, Hall RE, Moskowitz MA. Statewide quality improvement initiatives and mortality after cardiac surgery JAMA 1997;277:379-382.[Abstract/Free Full Text]



This article has been cited by other articles:


Home page
JAMAHome page
D. A. Asch, S. Nicholson, S. Srinivas, J. Herrin, and A. J. Epstein
Evaluating Obstetrical Residency Programs Using Patient Outcomes
JAMA, September 23, 2009; 302(12): 1277 - 1283.
[Abstract] [Full Text] [PDF]


Home page
SEMIN CARDIOTHORAC VASC ANESTHHome page
A. A. Kramer and J. E. Zimmerman
Predicting Outcomes for Cardiac Surgery Patients After Intensive Care Unit Admission
Seminars in Cardiothoracic and Vascular Anesthesia, September 1, 2008; 12(3): 175 - 183.
[Abstract] [PDF]


Home page
J. Thorac. Cardiovasc. Surg.Home page
D. M. Shahian and S.-L. T. Normand
Low-volume coronary artery bypass surgery: Measuring and optimizing performance.
J. Thorac. Cardiovasc. Surg., June 1, 2008; 135(6): 1202 - 1209.
[Full Text] [PDF]


Home page
CirculationHome page
D. M. Shahian and S.-L. T. Normand
Comparison of "Risk-Adjusted" Hospital Outcomes
Circulation, April 15, 2008; 117(15): 1955 - 1963.
[Abstract] [Full Text] [PDF]


Home page
Card Surg AdultHome page
V. A. Ferraris, F. H. Edwards, D. M. Shahian, and S. P. Ferraris
Risk Stratification and Comorbidity
Card. Surg. Adult, January 1, 2008; 3(2008): 199 - 246.
[Full Text]


Home page
CirculationHome page
G. J. Stukenborg, D. P. Wagner, F. E. Harrell Jr, M. N. Oliver, S. W. Heim, A. L. Price, C. K. Han, A. M. D. Wolf, and A. F. Connors Jr
Which Hospitals Have Significantly Better or Worse Than Expected Mortality Rates for Acute Myocardial Infarction Patients?: Improved Risk Adjustment With Present-at-Admission Diagnoses
Circulation, December 18, 2007; 116(25): 2960 - 2968.
[Abstract] [Full Text] [PDF]


Home page
BMJHome page
S. Westaby, N. Archer, N. Manning, S. Adwani, C. Grebenik, O. Ormerod, R. Pillai, and N. Wilson
Comparison of hospital episode statistics and central cardiac audit database in public reporting of congenital heart surgery mortality
BMJ, October 13, 2007; 335(7623): 759 - 759.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
J. A. Suaya, D. S. Shepard, S.-L. T. Normand, P. A. Ades, J. Prottas, and W. B. Stason
Use of Cardiac Rehabilitation by Medicare Beneficiaries After Myocardial Infarction or Coronary Bypass Surgery
Circulation, October 9, 2007; 116(15): 1653 - 1662.
[Abstract] [Full Text] [PDF]


Home page
Arch SurgHome page
E. H. Livingston, A. C. Elliott, L. S. Hynan, and E. Engel
When Policy Meets Statistics: The Very Real Effect That Questionable Statistical Analysis Has on Limiting Health Care Access for Bariatric Surgery
Arch Surg, October 1, 2007; 142(10): 979 - 987.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
D. M. Shahian, T. Silverstein, A. F. Lovett, R. E. Wolf, and S.-L. T. Normand
Comparison of Clinical and Administrative Data Sources for Hospital Coronary Artery Bypass Graft Surgery Report Cards
Circulation, March 27, 2007; 115(12): 1518 - 1527.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
H. M. Krumholz, Y. Wang, J. A. Mattera, Y. Wang, L. F. Han, M. J. Ingber, S. Roman, and S.-L. T. Normand
An Administrative Claims Model Suitable for Profiling Hospital Performance Based on 30-Day Mortality Rates Among Patients With an Acute Myocardial Infarction
Circulation, April 4, 2006; 113(13): 1683 - 1692.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
H. M. Krumholz, Y. Wang, J. A. Mattera, Y. Wang, L. F. Han, M. J. Ingber, S. Roman, and S.-L. T. Normand
An Administrative Claims Model Suitable for Profiling Hospital Performance Based on 30-Day Mortality Rates Among Patients With Heart Failure
Circulation, April 4, 2006; 113(13): 1693 - 1701.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
David M. Shahian
David F. Torchiana
Richard J. Shemin
James D. Rawn
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shahian, D. M.
Right arrow Articles by Normand, S.-L. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shahian, D. M.
Right arrow Articles by Normand, S.-L. T.
Related Collections
Right arrow Professional affairs


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ANN THORAC SURG ASIAN CARDIOVASC THORAC ANN EUR J CARDIOTHORAC SURG
J THORAC CARDIOVASC SURG ICVTS ALL CTSNet JOURNALS