|
|
||||||||
Ann Thorac Surg 2004;78:1868-1877
© 2004 The Society of Thoracic Surgeons
a Lahey Clinic, Burlington, Massachusetts, USA
b Cleveland Clinic Foundation, Cleveland, Ohio, USA
c University of Florida, Jacksonville, Florida, USA
d University of Colorado HSC, Denver, Colorado, USA
e Providence Health System, Portland, Oregon, USA
f University of Alabama, Birmingham, Alabama, USA
g Papworth Hospital, Cambridge, United Kingdom
h Dartmouth-Hitchcock Medical Center, Lebanon, New Hampshire, USA
i Duke Clinical Research Institute, Durham, North Carolina, USA
* Address reprint requests to Dr Shahian, Department of Thoracic and Cardiovascular Surgery, Lahey Clinic, 41 Mall Rd, Burlington, MA 01805 (E-mail: david.m.shahian{at}lahey.org).
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Although researchers have utilized risk models for many years, their broader applicability and critical importance were more fully appreciated after the 1986 release of unadjusted hospital outcome data by the Health Care Financing Administration ([HCFA], now named the Centers for Medicare and Medicaid Services [CMS]). Providers correctly argued that such data did not account for patient severity, and this led directly to the development of a number of high quality clinical databases and risk models, especially in cardiac surgery. Coronary artery bypass grafting (CABG) has been a particular focus of such research, not only because of the desire of cardiac surgeons to improve patient outcomes, but also because regulators and insurers have sought greater control over this high-profile, costly, and frequently performed procedure.
Some of the original cardiac surgery databases were voluntary, such as the Northern New England Cardiovascular Disease Study Group [9], and some were mandated by state or federal law (such as the NY [10], NJ [11], and Veterans Affairs Administration [12] databases). Soon after the HCFA release of unadjusted outcome data in 1986, The Society of Thoracic Surgeons (STS) established an Ad Hoc Committee on Risk Factors for Coronary Artery Bypass Surgery [13]. Subsequently, a committee under the direction of Dr Richard Clark began work on the development of The STS National Cardiac Database (STS NCD). This database was formally established in 1989 and the software was released to STS members in 1990 [1416]. During the subsequent 13 years, this has evolved to become one of the largest single specialty databases in the world, containing data on more than 2.4 million patients from 60% of United States cardiac programs.
In this review, we present some fundamental aspects of risk model development and validation, the current uses and limitations of such models in cardiac surgery (with special attention to CABG), and prospects for the future. A statistical Appendix is provided to explain less familiar terms and concepts.
| Data |
|---|
|
|
|---|
Core Variables
Griffith and associates [20] found that critical clinical variables known to be associated with mortality (eg, ejection fraction, emergency procedures) were not included in the first Pennsylvania CABG database, leading to problematic accuracy. Hannan and associates compared models based on the New York clinical cardiac surgery database (CSRS) with models derived from the New York administrative database (SPARCS) [21] and with the Health Care Financing Administration MEDPAR database [17]. In both instances, models derived from the clinical database were found to provide superior performance. Accuracy of the models based on administrative data were improved substantially by the addition of a few critical clinical variables (ejection fraction, reoperation, left main obstruction).
Tu and associates [4] have determined a limited set of six core variables (age, gender, acuity, reoperation, left main coronary obstruction, ejection fraction) beyond which they believe there is little incremental improvement in model performance. Similarly, Jones and associates [22] from the Cooperative CABG Database Project identified seven core predictor variables (age, gender, left ventricular function, acuity, left main disease, reoperation, number of diseased vessels). They found that acuity, reoperation, and age accounted for the majority of predictive information in most CABG databases. In the STS NCD, 78% of the explained variance from the entire 28 variable model is derived from the eight most important predictors (ie, age, surgical acuity, reoperative status, creatinine level, dialysis, shock, chronic lung disease, and ejection fraction).
All studies confirm that the gold standard for data is a specialty-specific, prospectively maintained clinical database such as the STS NCD. Such databases should contain, at the minimum, a core set of variables that has been demonstrated to be associated with outcome.
Definitions
Certain caveats regarding data apply generally to all regression models including those used for cardiac surgery. One of the most important is strict standardization of definitions, both for predictor variables and for endpoints. Even for a seemingly unambiguous endpoint like mortality, there are important statistical and policy implications of using (1) in-hospital mortality, regardless of when it occurs, (2) 30-day all-cause mortality, regardless of where it occurs, and (3) operative mortality, defined as either (1) or (2). A fixed time period is statistically preferable [1], although more difficult to obtain than in-hospital mortality from databases that are not payer-based. Osswald and colleagues [23, 24] have studied the implications of different definitions of "early" post-CABG mortality, especially in light of advancements in postoperative care. Particularly for higher risk patients in which the early postoperative phase may be prolonged, these investigators assert that the true early mortality will be underestimated unless a complete tally of deaths and their time of occurrence are compiled for the first 6 months after surgery.
Quality
Whenever practical, continuous data should be used as such to avoid the arbitrariness and loss of valuable information that occurs with categorization [25]. However there are some instances in which such categorization may be useful if the goal is to identify a case, such as patients with morbid obesity, renal insufficiency, or severe carotid stenosis. This is most applicable when the definition of this categorical state is well-defined and broadly accepted. Transformations of the measurement scale may be required for the values of the variables to be commensurate with the assumptions of the risk factor model being used (eg, so-called linearizing transformations). Data entry software should contain internal quality controls for out of range, inconsistent, contradictory, and missing data. Ideally, values for missing data should be substituted using multiple imputation techniques [26, 27]. Institutions should receive periodic reports regarding their data quality including any anomalies in their data recording compared with regional and national averages. All these features are included in the STS NCD. Particularly in situations where risk-adjusted outcomes are used to assess provider performance, there should be regular independent auditing of the data to assure accuracy and completeness.
| Model Development |
|---|
|
|
|---|
Three principal techniques have been utilized for the construction of cardiac surgery risk models. Bayesian models were used initially for the STS NCD because they are robust with regard to missing data, an important problem in the early database experience. As data completeness improved, logistic models were substituted in 1995 [31, 32]. Logistic regression models continue to be the most common statistical technique for cardiac surgery risk modeling, not only for the STS NCD but also for those developed in New York [33], the Veterans Affairs Administration [34], and the Northern New England Cardiovascular Disease Study Group [35]. Some groups have used simple, additive scores with weights derived from the logistic regression model [11, 36]. Comparative studies [37] have generally demonstrated that logistic models offer the best overall performance.
Some have suggested that the next major advance in model performance would come from the application of algorithmic models, sometimes called machine-learning techniques [38], of which artificial neural networks are an example [39]. These models permit complex, nonlinear information processing, thus avoiding one of the constraints of logistic models. However, two studies [40, 41], one using 80,606 patients from the STS NCD [40], failed to demonstrate any significant improvement over logistic or Bayesian models. Potential disadvantages of machine-learning techniques include the large amount of data required for model development, and the tendency of some methods to overfit the data and others to perform too conservatively. Furthermore, just as with traditional statistical models, algorithmic techniques generally model past data better than they predict future events.
Strategies for logistic regression model development include data reduction techniques and variable selection, interaction terms and transformations of variables where appropriate, imputation of missing data, verification of model assumptions, and model validation [8, 42, 43]. One fundamental and still controversial question is the number of variables to include in a risk model. An excessive number of covariates may lead to some predictors having statistical significance but not biological or clinical relevance, over-fitting of the model, numerical instability, and increased cost and difficulty of data collection [4, 8, 42, 43]. Harrell and colleagues [8, 42] have recommended that the number of covariates considered for inclusion in such models be less than one-tenth the number of end points observed in the data set (percent occurrence, x sample size, for early events), which typically requires some data reduction technique to decrease the number of candidate variables. Univariate screening of candidate covariates followed by forward or backward stepwise selection is commonly used for this purpose. However some statisticians including Harrell [8] have criticized this variable selection technique on theoretical grounds, and research continues on the relative merits of parsimonious models versus those that retain most or all potential predictors [29, 44, 45]. The use of machine-learning variable selection methods such as bagging (bootstrap aggregating) is also gaining popularity [46, 47].
Finally, most studies demonstrate that models have inferior performance when applied to patient groups other than the one from which they were developed. Ivanov and colleagues [48] found that institution or region-specific custom models performed best, followed by recalibrated models using the covariate set from a ready-made model, but with different weights. Applying unmodified off-the-shelf models to other groups of patients provided the least satisfactory performance. In contrast, Nashef and colleagues [49] recently reported that the additive EuroSCORE cardiac surgery risk-prediction model functioned well when applied to North American populations.
| Model Validity |
|---|
|
|
|---|
The predictive validity of a model is a measure of how well it performs on a data set other than the one from which it was developed. This test data may be internal or external. In the former, only the original data set is used. Numerous techniques are available to segment the original data including simple data splitting (the whole data set is randomly and only once split into development and validation or test subsets) and more sophisticated cross-validation techniques (eg, leave-one-out cross validation and K-fold cross validation) that use repeated resampling from the original data set [8, 50]. Harrell [8] has recommended using the entire data set for model development, then validating by using another technique such as bootstrapping. External validation uses a completely new data set to validate the model, perhaps originating from a different state, country, or hospital.
Regardless of whether internal or external test data are used, certain tests are commonly used to assess how well the model fits the data. Two of the most common are calibration (reliability) and discrimination (resolution). Calibration assesses the extent to which the model assigns appropriate risk to the population under consideration. It answers the question: in 100 patients with the same estimated mortality risk (R%) as mine, would the observed number of deaths equal R? Calibration may be measured by a number of techniques including the Pearson
2 statistic, the unweighted residual sum-of-squares, or the Murphy decomposition of the Brier mean probability score [43, 51]. The most commonly used measure of calibration is the degree of concordance between deciles of observed and expected risk, the Hosmer-Lemeshow test [43]. Most CABG risk models are well calibrated overall but may produce estimates that are too extreme in the lowest and especially the highest risk subsets of patients, particularly in small data sets [45, 52]. Shrinkage of regression coefficients may provide more accurate and realistic prediction for future patients [8, 45, 52, 53].
Discrimination is the more demanding test and measures the tradeoff between the specificity and sensitivity of the risk model at various probability cut points (ie, what probability do you require to assign the patient to a particular outcome category?). It asks the question: how well does the model separate patients who die from those who survive? This may be interpreted as the percentage of discordant (meaning one death and one survivor) patient-pairs for which the model predicts a higher mortality risk for the patient who actually dies [43, 54, 55]. This percentage is equal to the area under the receiver operating characteristic curve (ROC) [43, 55] and is called the c-index or c-statistic [8, 42]. Unfortunately, most CABG risk models have only moderate discrimination as measured by the c-index. This has significant implications for their use in individual patient counseling [54, 56]. Risk models may accurately predict that 3 of 100 patients with a given set of risk factors will die postoperatively, but they cannot identify which 3.
The similar and limited discrimination of most CABG risk models (VA, Northern New England Cardiovascular Disease Study Group, STS, NY) suggests that we still await the quantum improvement in risk prediction anticipated by Steen [39] more than a decade ago. The performance of current models is limited by as yet unknown predictors, difficulty in measuring or representing certain complex clinical states, random catastrophic events, such as a serious protamine reaction or sudden hemorrhage, and similar occurrences that are rare in the population but important in the individual patient [22, 40, 5759]. Because known patient characteristics will never explain all the variances in cardiac surgery outcomes, the performance of any risk model has inherent limitations.
| Uses of Risk Models |
|---|
|
|
|---|
A second use for risk models is the development of tools that aid in everyday patient management. These would include patient care algorithms or critical pathways scientifically based on risk-adjusted studies [58]. Risk models may be used to facilitate individual patient counseling or as a decision support tool for clinicians choosing between different interventions (eg, coronary artery bypass vs percutaneous angioplasty). Pocket cards (Northern New England Cardiovascular Disease Study Group) [70] or handheld computers [71] have been used to generate bedside risk estimates for individual patients. Obviously, in order to properly apply such methods, values for each risk factor in the model must be available. Because of the modest discrimination of most risk models, which limits the accuracy of individual patient prediction, it is recommended that such information not be presented to patients simply as probability point estimates. Rather, these estimates should be accompanied by confidence limits that demonstrate their uncertainty.
One of the most common uses of risk models is to compare provider performance. This is statistically challenging from the outset because of the low incidence of the binary outcome (operative mortality) as well as the highly variable and often small sample sizes from different providers. Typically, as in New York State, profiling has been achieved by aggregating the probabilities of death of each patient treated by a given provider based on the results of logistic modeling. This aggregate predicted mortality is used together with the observed provider mortality to construct a ratio of observed to expected mortalities (O/E). However, modern research in provider profiling has demonstrated potential deficiencies with this approach. Standard techniques may not accurately reflect the unobserved true mortality of low volume providers, and they do not adequately account for clustering (nonrandom allocation) of observations within providers (such as the prevalence of heart failure patients at transplant centers [58]). The net effect may be an underestimation of random interprovider variability, an overestimation of systematic interprovider variability, and an increased likelihood of falsely classifying a provider as an outlier. Hierarchical or multilevel models have been designed to address these concerns and some advocate their use for profiling whenever feasible [44, 7277].
Whatever method is used for provider profiling, identification of statistical outliers is only a starting point for further analysis [58, 78]. All risk models have inherent limitations, and the results obtained from different models and by different statisticians may vary [2830, 52, 74]. The results derived from any risk model should be regarded as one element to be considered in conjunction with other traditional indices of competent surgical care. The Society of Thoracic Surgeons firmly holds that those surgical programs identified as statistical outliers should not be arbitrarily labeled as substandard. The coding practices of outlying providers and each individual mortality should be carefully analyzed, and structural causes should be sought to explain any truly aberrant results [12, 7982].
Finally, risk-adjusted outcomes have also been used in northern New England [83] and Minnesota [84] as the basis for confidential continuous quality improvement activities, and in the Veterans Affairs Administration for both confidential monitoring of performance and continuous quality improvement [12, 80]. Here the main goal is not public accountability but rather provider-initiated determination of best practice, benchmarking, and regional or system-wide improvement. This has resulted in mortality reduction that appears comparable with that achieved using public report cards [85]. The Society of Thoracic has implemented process improvement initiatives based on analyses derived from the STS NCD [64].
| Limitations and Disadvantages of Risk Models |
|---|
|
|
|---|
From a health policy perspective, some argue that current risk models place too much emphasis on mortality as the sole endpoint, which may ultimately decrease access to surgery for those who might benefit most (high-risk case avoidance) [57, 72, 87, 88]. For similar reasons, such models may encourage gaming of the reporting system when used for provider profiling [72]. Emphasis on outcome endpoints such as mortality may deflect attention from important process and structural aspects of care. Finally, challenges common to all cardiac surgery databases include their cost (especially at a time when revenues from heart surgery are declining) and privacy issues related to the HIPAA.
| Current Status and Future Direction |
|---|
|
|
|---|
Although it is already the dominant cardiac surgical database in the world, it is the goal of the STS to achieve 100% participation of all cardiac surgery providers in the United States. Along with increased efforts to validate the accuracy of the submitted data, this will eliminate any remaining concerns about the voluntary nature of the database, although generally there has been a close correlation between the results obtained from the voluntary databases (STS NCD, Northern New England Cardiovascular Disease Study Group) and mandatory databases (VA, NY) [78, 89]. Recent studies matching STS NCD results with those from the Centers for Medicare and Medicaid Services claims data suggest substantial agreement in both completeness and observed mortality. The STS NCD should become the dominant source of accurate information for the federal and state governments regarding outcome and reimbursement issues, and there is also the opportunity to partner with the industry in assessing new technologies.
A systematic review of the STS NCD was begun in 1997 [16, 78]. The STS Definitions Committee worked with the American College of Cardiology to eliminate unimportant variables and to develop a new data dictionary. The original 512 fields were reduced to 217 core fields and 255 extended fields. Discussions continue among representatives of the major cardiac surgery and cardiology databases to resolve inconsistent data definitions.
The Duke Clinical Research Institute became the data analysis and warehouse center for the STS NCD beginning in 1998. In addition to their sophisticated resources for data cleaning and verification, the Duke Clinical Research Institute provides detailed semiannual reports comparing individual programs with their region and the overall STS national data set. Regular national meetings of hospital data managers have substantially enhanced uniformity of coding practices and completeness.
From models that focused primarily on CABG mortality, the STS NCD has evolved to a family of related risk models for prediction of outcomes after valve replacement (with or without CABG) [90, 91], congenital heart surgery [92], and general thoracic surgery. There is an increased awareness of the importance of endpoints other than operative mortality, including perioperative morbidity and length of stay [15, 16, 78, 93, 94]. Postoperative complications and readmissions, which are delayed complications, appear to measure different and complementary aspects of care compared with hospital mortality [95, 96], and not all measures of outcome are equally indicative of program quality. Future emphasis will also include documentation of long-term follow-up data, functional status and quality of life measures, and the relationship of clinical factors to hospital costs. Finally, there will be an increasing emphasis on process measures (eg, internal mammary artery and ß blocker use) to assess and improve provider performance [58, 64, 78]. This approach may offer the greatest potential to enhance the results of all cardiac surgery programs and to reduce interprovider variability. It also diminishes our reliance on sophisticated, yet still imperfect, methods of risk-adjustment.
Cardiac surgery remains at the forefront of risk model development and clinical quality monitoring. With advancements in statistical methodology, expanding enrollment in major databases such as the STS NCD and European databases, and with the firm commitment of cardiac surgeons, our profession will maintain its leadership in this vital area of health care.
| Appendix |
|---|
Bootstrap Technique
Efron introduced the bootstrap technique in 1979 as a general method for estimating standard errors and confidence intervals of any test statistic [50]. This method relies on repeated bootstrap samples, each of which involves re-sampling with replacement from the original data set. All samples will have the same number of observations as the original, but in each bootstrap sample any of the original observations may be included once, more than once, or not at all. The standard deviation of the bootstrap replicates is taken as the standard error of the data. The bootstrap technique has found many other applications, including its use in risk factor identification [47]. In this technique, known as bagging or bootstrap aggregation, analyses are performed on all bootstrap data sets and the results are aggregated.
Brier Mean Probability Score
The Brier mean probability score was a formula described by Brier in 1950 to assess the predictive accuracy of weather forecasting, but it is equally applicable to any situation with dichotomous outcomes such as postoperative mortality [29]. The Brier score varies between 0 and 1, with smaller values representing better performance and a score of 0 indicating perfect prediction. The Brier score may be decomposed into components that assess different aspects of predictive accuracy, including calibration and resolution.
Hierarchical Models
Many real-life situations are inherently multilevel or hierarchical [7375, 77, 97, 98]. In medical studies, patients are grouped by physicians and physicians are grouped within hospitals. In a two-level model, the first or lower level (eg, patient) is sometimes referred to as the micro level, and the second or higher level (eg, surgeon or hospital) is the macro level, group, context, or cluster.
Often there is interest in understanding the interaction between the several levels of such a hierarchy, or in evaluating macro-level performance based on micro-level data. For example, in provider profiling, quality at the group level (physician or hospital) has often been assessed through the use of aggregated, risk-adjusted patient outcomes. However, such traditional approaches ignore an important aspect of multilevel structure, namely that subjects within a group or cluster are more similar to each other than they are to subjects in other clusters. For example, patients treated at a heart failure center are more similar to each other than they are to patients at an institution without this special interest. Such clustering may also exist when longitudinal studies are conducted using repeated observations from the same subjects. In either situation, intraclass correlation or clustering effectively reduces the amount of independent data available, increases the standard errors of estimates, and if not accounted for may exaggerate the significance of performance differences between groups.
Through the use of hierarchical models (also called random effects models, mixed models, or random coefficient regression models), the amount of random and systematic variation between groups can be more accurately partitioned, more appropriate and conservative estimates of between-group variability are obtained, and the impact of specific macro-level characteristics can be better assessed. Furthermore, these models improve the estimates from groups with relatively smaller numbers of observations by shrinking their results closer to the mean of the remaining groups.
Jackknife
From a data set with n data points, n-jackknife samples are derived, each of which has exactly one observation deleted. From these, estimates of biases and standard errors can be obtained. The jackknife was the predecessor of the bootstrap. It is computationally simpler and may be thought of as an approximation to the bootstrap [50].
K-Fold Cross Validation
K-fold cross validation is similar to leave-one-out cross validation. The original data are randomly divided into k groups of equal or roughly equal size. A model is developed from k-1 groups and its accuracy is assessed using the excluded group. The process is repeated k times and the predictive accuracy is averaged [8, 50].
Leave-One-Out Cross Validation
For a data set with n observations, training is conducted on n different subsets of data, each of which has one data point left out. Each excluded observation is predicted by the model obtained from the remaining data points, and the average predictive accuracy of the models is determined [8, 50].
Logistic Regression Model
The logistic regression model is a type of multivariable model in which, unlike standard regression analysis, the dependent or outcome variable is qualitative [43, 99]. In medical applications the most common example is binary logistic regression to estimate the probability of a dichotomous outcome such as mortality. Given a particular combination of predictor variables, the probability of event occurrence is constrained by the logistic equation to the range of 0 to 1. Rearrangement of the logistic equation demonstrates that the linear combination of predictors and their regression coefficients is equal to the logit or log odds (the logarithm of the odds of the outcome). Exponentiation of the logit to the base e yields the odds of the outcome.
Odds and Odds Ratio
The odds of an event are the probability of its occurrence divided by the probability that it will not occur. The odds ratio is the ratio of one odds to another [8, 43, 99]. In the case of a binary logistic regression model for mortality, the odds ratio for a given predictor variable would be the ratio of the mortality odds in the presence of the predictor divided by the odds of mortality in its absence. This is conveniently obtained by exponentiation of the regression coefficient of the predictor variable to the base e. Medical outcome studies typically cite the odds ratios for a number of significant predictor variables along with their 95% confidence intervals.
Overfitting
Overfitting implies an excessively complex model with too many parameters to be estimated from the available data [8]. The resulting model will appear to have an extremely good fit to the training set. However it will generalize poorly to subsequent test samples and have limited ability to predict future events.
Propensity Score
The propensity score is a method to account for selection factors in nonrandomized, observational studies [6, 7]. A logistic regression model is developed using treatment received as the outcome variable. Using this regression equation, the propensity of each patient to receive the particular treatment is calculated. By grouping patients based on their propensity scores (often stratified by quintiles), it is usually possible to obtain treatment and nontreatment groups that are well matched with regard to most predictor variables, although unbalanced with regard to number of subjects. At this point, further comparative analyses between treatment and nontreatment groups can be performed.
Receiver Operating Characteristic (ROC) Curve
The receiver operating characteristic curve was developed in the early days of radar imaging to help separate signals from noise [8, 43, 55]. During the past two decades this technique has been used as a visual method of assessing two naturally competing characteristics of any diagnostic test, sensitivity, and specificity. The receiver operating characteristic curve depicts the specificity and sensitivity of the diagnostic test or predictive model at various cut points (the probability that a researcher requires to assign a patient to the event category). The area under the receiver operating characteristic curve is also called the c-index or c-statistic. It may also be thought of as the percentage of all possible discordant patient pairs (one patient has an event and the other does not) for which the model predicts a higher event probability for the patient who actually has the event. An receiver operating characteristic curve area of 0.5 is no better than a coin toss, whereas an area of 1.0 is perfect discrimination.
Transformations
Transformations are changes in the scale of measurement of a variable during model development. Common reasons for doing this include (1) variance stabilization, (2) linearization, (3) normalization, (4) to simplify handling of the data, and (5) to enable more appropriate presentation of the results [100].
Underfitting
Underfitting is the failure to include one or more important predictor variables, which may lead to poor predictive accuracy.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. M. Shahian, S. M. O'Brien, G. Filardo, V. A. Ferraris, C. K. Haan, J. B. Rich, S.-L. T. Normand, E. R. DeLong, C. M. Shewan, R. S. Dokholyan, et al. The Society of Thoracic Surgeons 2008 Cardiac Surgery Risk Models: Part 1--Coronary Artery Bypass Grafting Surgery Ann. Thorac. Surg., July 1, 2009; 88(1_Supplement): S2 - S22. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Shahian, S. M. O'Brien, G. Filardo, V. A. Ferraris, C. K. Haan, J. B. Rich, S.-L. T. Normand, E. R. DeLong, C. M. Shewan, R. S. Dokholyan, et al. The Society of Thoracic Surgeons 2008 Cardiac Surgery Risk Models: Part 3--Valve Plus Coronary Artery Bypass Grafting Surgery Ann. Thorac. Surg., July 1, 2009; 88(1_Supplement): S43 - S62. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Brunelli, R. G. Berrisford, G. Rocco, G. Varela, and on behalf of the European Society of Thoracic Surg The European Thoracic Database project: composite performance score to measure quality of care after major lung resection Eur. J. Cardiothorac. Surg., May 1, 2009; 35(5): 769 - 774. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Filardo, C. Hamilton, R. F. Hebeler Jr, B. Hamman, and P. Grayburn New-Onset Postoperative Atrial Fibrillation After Isolated Coronary Artery Bypass Graft Surgery and Long-Term Survival Circ Cardiovasc Qual Outcomes, May 1, 2009; 2(3): 164 - 169. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Bonaros, D. Hennerbichler, G. Friedrich, A. Kocher, O. Pachinger, G. Laufer, and J. Bonatti Increased mortality and perioperative complications in patients with previous elective percutaneous coronary interventions undergoing coronary artery bypass surgery J. Thorac. Cardiovasc. Surg., April 1, 2009; 137(4): 846 - 852. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Shahian and S.-L. T. Normand Comparison of "Risk-Adjusted" Hospital Outcomes Circulation, April 15, 2008; 117(15): 1955 - 1963. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. van Gameren, A. P. Kappetein, E. W. Steyerberg, A. C. Venema, E. A.J. Berenschot, E. L. Hannan, A. J.J.C. Bogers, and J. J.M. Takkenberg Do We Need Separate Risk Stratification Models for Hospital Mortality After Heart Valve Surgery? Ann. Thorac. Surg., March 1, 2008; 85(3): 921 - 930. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Ferraris, F. H. Edwards, D. M. Shahian, and S. P. Ferraris Risk Stratification and Comorbidity Card. Surg. Adult, January 1, 2008; 3(2008): 199 - 246. [Full Text] |
||||
![]() |
G. Filardo, C. Hamilton, B. Hamman, and P. Grayburn Obesity and Stroke After Cardiac Surgery: The Impact of Grouping Body Mass Index Ann. Thorac. Surg., September 1, 2007; 84(3): 720 - 722. [Full Text] [PDF] |
||||
![]() |
C. A. Mestres, M. A. Castro, E. Bernabeu, M. Josa, R. Cartana, J. L. Pomar, J. M. Miro, J. Mulet, and the Hospital Clinico Endocarditis Study Group Preoperative risk stratification in infective endocarditis. Does the EuroSCORE model work? Preliminary results Eur. J. Cardiothorac. Surg., August 1, 2007; 32(2): 281 - 285. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Shahian, F. H. Edwards, V. A. Ferraris, C. K. Haan, J. B. Rich, S.-L. T. Normand, E. R. DeLong, S. M. O'Brien, C. M. Shewan, R. S. Dokholyan, et al. Quality Measurement in Adult Cardiac Surgery: Part 1--Conceptual Framework and Measure Selection Ann. Thorac. Surg., April 1, 2007; 83(4_Supplement): S3 - S12. [Full Text] [PDF] |
||||
![]() |
D. M. Shahian, T. Silverstein, A. F. Lovett, R. E. Wolf, and S.-L. T. Normand Comparison of Clinical and Administrative Data Sources for Hospital Coronary Artery Bypass Graft Surgery Report Cards Circulation, March 27, 2007; 115(12): 1518 - 1527. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Bergsland, E. Kabil, E. Mujanovic, I. Terzic, J. Roislien, J. L. Svennevig, and E. Fosse Training of Cardiac Surgeons for Bosnia and Herzegovina: Outcomes in Coronary Bypass Grafting Surgery Ann. Thorac. Surg., February 1, 2007; 83(2): 462 - 467. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Brunelli, N. J. Morgan-Hughes, M. Refai, M. Salati, A. Sabbatini, and G. Rocco Risk-adjusted morbidity and mortality models to compare the performance of two units after major lung resections J. Thorac. Cardiovasc. Surg., January 1, 2007; 133(1): 88 - 96. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. W. Steyerberg, B. A. Neville, L. B. Koppert, V. E.P.P. Lemmens, H. W. Tilanus, J.-W. W. Coebergh, J. C. Weeks, and C. C. Earle Surgical Mortality in Patients With Esophageal Cancer: Development and Validation of a Simple Risk Score J. Clin. Oncol., September 10, 2006; 24(26): 4277 - 4284. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. D. Berrizbeitia Invited commentary Ann. Thorac. Surg., June 1, 2006; 81(6): 2088 - 2088. [Full Text] [PDF] |
||||
![]() |
H. M. Krumholz, R. G. Brindis, J. E. Brush, D. J. Cohen, A. J. Epstein, K. Furie, G. Howard, E. D. Peterson, S. S. Rathore, S. C. Smith Jr, et al. Standards for Statistical Models Used for Public Reporting of Health Outcomes: An American Heart Association Scientific Statement From the Quality of Care and Outcomes Research Interdisciplinary Writing Group: Cosponsored by the Council on Epidemiology and Prevention and the Stroke Council Endorsed by the American College of Cardiology Foundation Circulation, January 24, 2006; 113(3): 456 - 462. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Shahian, D. F. Torchiana, R. J. Shemin, J. D. Rawn, and S.-L. T. Normand Massachusetts Cardiac Surgery Report Card: Implications of Statistical Methodology Ann. Thorac. Surg., December 1, 2005; 80(6): 2106 - 2113. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ANN THORAC SURG | ASIAN CARDIOVASC THORAC ANN | EUR J CARDIOTHORAC SURG |
| J THORAC CARDIOVASC SURG | ICVTS | ALL CTSNet JOURNALS |