ATS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Ann Thorac Surg 2010;89:139-146. doi:10.1016/j.athoracsur.2009.08.058
© 2010 The Society of Thoracic Surgeons

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
Karl F. Welke
Tara Karamlou
Ross M. Ungerleider
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Welke, K. F.
Right arrow Articles by Diggs, B. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Welke, K. F.
Right arrow Articles by Diggs, B. S.
Related Collections
Right arrow Congenital - cyanotic
Right arrow Professional affairs


J. Maxwell Chamberlain Memorial Paper for Congenital Heart Surgery

Mortality Rate Is Not a Valid Indicator of Quality Differences Between Pediatric Cardiac Surgical Programs

Karl F. Welke, MD, MSa,*, Tara Karamlou, MD, MSb, Ross M. Ungerleider, MD, MBAc, Brian S. Diggs, PhDa

a Division of Cardiothoracic Surgery, Department of Surgery, Oregon Health and Science University, Portland, Oregon
b Division of Cardiac Surgery, Department of Surgery, University of Michigan School of Medicine, Ann Arbor, Michigan
c Division of Pediatric Cardiothoracic Surgery, Department of Surgery, Case Western Reserve University, Cleveland, Ohio

Accepted for publication August 20, 2009.

* Address correspondence to Dr Welke, Division of Cardiothoracic Surgery L353, Oregon Health and Science University, 3181 SW Sam Jackson Park Rd, Portland, OR 97239-3098 (Email: welkek{at}ohsu.edu).

Presented at the Forty-fifth Annual Meeting of The Society of Thoracic Surgeons, San Francisco, CA, Jan 26–28, 2009. Winner of the J. Maxwell Chamberlain Memorial Award for Congenital Heart Surgery.


    Abstract
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
Background: In order to detect statistically relevant differences in mortality rates, it is essential to have adequate sample sizes and event rates. Our hypothesis is that case volumes and mortality rates present in pediatric cardiac surgery are too low to allow the use of mortality to differentiate between hospitals.

Methods: Pediatric cardiac surgical operations performed at U.S. hospitals were identified in the Nationwide Inpatient Sample (NIS) Database 2000 to 2005 (21,709 operations from 161 hospitals). Hospital annual surgical volumes and in-hospital mortality rates for Risk Adjustment for Congenital Heart Surgery, version 1 (RACHS-1) categories and select individual operations were calculated. The actual case volumes were compared with thresholds necessary to detect a doubling and a 5 percentage point increase in the mortality rate.

Results: No hospital had a sufficient annual case volume to determine a doubling of or 5 percentage point increase in mortality for any individual operation and a minority (0% to 5.6%) had sufficient volume to detect these differences for specific RACHS-1 categories. Minimum hospital case volumes needed to detect a doubling of mortality from a benchmark ranged from 11 for RACHS-1 category 5 to 2,935 for RACHS-1 category 1. Minimum case volumes necessary to detect a 5 percentage point difference in mortality between two hospitals ranged from 173 for RACHS-1 category 1 to 1,483 for RACHS-1 category 5. Five hundred twenty-five patients were needed to detect a doubling of overall hospital mortality rate compared with another hospital. Only 1.6% (n = 4) of hospitals met this minimum caseload.

Conclusions: Pediatric cardiac surgery operations are either performed too infrequently or have mortality rates that are too low to allow valid hospital quality comparisons to be based on mortality.


    Introduction
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
In-hospital mortality is the most frequently cited measure of quality for pediatric cardiac surgery programs. Several states report mortality rates after adult cardiac surgery and some are beginning to report mortality rates after pediatric cardiac surgery as well [1]. The Leapfrog Group [2] and the National Quality Forum [3] have endorsed surgical mortality as a marker of quality. The Agency for Healthcare Research and Quality [4] lists mortality as a quality indicator for both adult and pediatric cardiac surgery. The Society of Thoracic Surgeons includes hospital level mortality rate comparisons in their National Congenital Cardiac Surgery Database Report [5]. In addition, individual hospitals are publicizing their mortality rates and comparing them to other institutions and national means.

While adult cardiac surgery was the initial focus of public reporting efforts, pediatric cardiac surgery is increasingly being included. However, there are important differences between the two specialties that make comparisons of pediatric cardiac surgery mortality rates problematic. In order to detect statistically relevant differences, adequate sample sizes and event rates are essential [6]. Pediatric cardiac surgical operations are performed relatively infrequently at individual hospitals and in general the mortality rates are low. The combination of small sample sizes and low event rates limits the statistical power of a comparison of an individual hospital's mortality rate to either a mean or benchmark or to that of another hospital. As a result, important differences may not be identified; a type II error.

Our hypothesis for this investigation is that pediatric cardiac surgical case volumes and mortality rates are too low to allow the use of mortality to differentiate between hospitals. We will obtain actual hospital annual case volumes and national mean mortality rates from a national administrative database and test our hypothesis using both 1-tailed and 2-tailed tests.


    Material and Methods
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
This study was designed as a retrospective cohort analysis. The Oregon Health and Science University Institutional Review Board approved this study. Due to the nature of the study, patient consent was not required. We obtained data from the Nationwide Inpatient Sample (NIS) [7]. The NIS is the largest all-payer inpatient care database in the United States. The database is a stratified, cross-sectional sample that includes approximately 20% of all community (non-federal) hospital discharges in the United States. The NIS data are available from 1988 to 2005, over which time the number of states in the NIS has grown from 8 to 37. In 2005, the database contained discharge data on approximately 8 million hospital stays at 1,054 hospitals in 37 states. The sampling frame for the 2005 NIS is a sample of hospitals that comprises approximately 90% of all hospital discharges in the United States. To ensure the representative nature of the database, the NIS is stratified by geographic region, urban versus rural location, teaching status, hospital ownership, and hospital bed size. The NIS is managed under the Healthcare Cost and Utilization Project of the Agency for Healthcare Research and Quality. For this study we combined data from the 2000 through 2005 NIS databases.

Congenital cardiac surgical procedures performed on patients under 18 years of age were identified by International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) diagnosis and procedure codes. In order for a patient to be included in this study the procedure code had to match to a plausible diagnostic code. Operations were categorized by the Risk Adjustment for Congenital Heart Surgery, version 1 (RACHS-1-1) method [8]. This risk stratification system groups the varied congenital cardiac surgical case mix into six categories based on similar expected short-term mortality rates. Category 1 has the lowest risk of death and category 6 the highest. Category 1 contains atrial septal defect repair, patent ductus arteriosis closure on patients over 30 days of age, and aortic coarctation repair on patients over 30 days of age. Category 2 contains operations such as ventricular septal defect closure, pulmonary valve replacement, total repair of tetralogy of Fallot, and Glenn shunt. Category 3 includes operations such as aortic valve replacement, Fontan procedure, and arterial switch. Category 4 includes complex neonatal surgery such as repair of transposition of the great arteries with ventricular septal defect and repair of truncus arteriosis. Category 5 includes repair of neonatal Epstein's anomaly and repair of truncus interrupted arch. Category 6 includes the Norwood procedure and the Damus-Kaye-Stansel procedure. The RACHS-1 method is a widely used risk stratification methodology for congenital heart surgery [8–12]. The methodology has been validated and is included in the Society of Thoracic Surgeons Congenital Heart Surgery Database reports [5]. Selected individual operations were also identified by ICD-9-CM codes.

Mortality was defined as in-hospital mortality as indicated by the discharge disposition. Hospital volume was defined as the number of operations performed in a year. If a hospital appeared in the NIS in 2 or more years, each entry was treated separately. Hospital volume was not averaged over the 6 year period because all hospitals were not sampled in all years and it is possible that personnel and systems changes occurred over this time that impacted volume and quality of care.

The actual annual hospital case volume for each RACHS-1 category and selected individual operation was compared with thresholds necessary to detect a statistically significant (as measured by 95% confidence intervals) doubling and a 5 percentage point increase in the mortality rate. Both 1-tailed and 2-tailed t tests were used. A 1-tailed test is appropriate when the hypothesis states the direction of the difference or relationship. For example, a 1-tailed test would be appropriate for testing whether or not a hospital's mortality rate was significantly worse than a mean or benchmark. A 2-tailed test is used when the direction of the difference or relationship is not specified. For example, a 2-tailed test would be used to determine whether or not the mortality rate at one hospital was significantly different, either higher or lower than the mortality rate of another hospital. Analyses were performed using SAS version 9.1 (SAS, Inc, Cary, NC).


    Results
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
We identified 21,709 operations in the NIS from 2000 to 2005 that could be placed in RACHS-1 categories. These operations took place at 161 hospitals giving a total of 252 hospital-year combinations (Table 1). The median hospital case volume was 61 and the overall national mortality rate was 4.2%. The median hospital case volume for designated children's hospitals was 100.


View this table:
[in this window]
[in a new window]

 
Table 1 Characteristics of Pediatric Cardiac Surgical Procedures in the United States 2000 to 2005
 
We first performed a series of 1-tailed tests as would be used to determine if the performance of a hospital was significantly worse than a mean or benchmark. No hospital had the minimum case volumes necessary to detect a doubling of mortality or a 5 percentage point increase in mortality for a list of chosen operations (Fig 1). After operations were aggregated into RACHS-1 categories, a minority of hospitals met the volumes needed to detect a doubling of mortality or a 5 percentage point increase in mortality for any individual RACHS-1 category (Fig 2). Minimum case volumes necessary to detect a 5 percentage point increase in mortality ranged from 71 for RACHS-1 category 1 to 588 for RACHS-1 category 5. A median volume hospital would need 6 years to reach the RACHS-1 category 1 volume threshold and 120 years to accumulate the 481 cases needed to reach the threshold for RACHS-1 category 6. Results were similar with regard to the surgical case volumes needed to detect a doubling of mortality. Minimum hospital case volumes needed to detect a doubling of mortality ranged from 11 for RACHS-1 category 5 to 2,935 for RACHS-1 category 1 (Table 2). Of note, a median volume hospital would have to have at least a 24% mortality rate for RACHS-1 category 1 and at least a 90% mortality rate for RACHS-1 category 6 for it to be differentiated from a hospital with the national mean mortality rate. If all RACHS categorized cases were aggregated, 167 operations would be needed to detect a 5 percentage point difference from the national mean mortality rate of 4.2%. Fifteen percent of hospitals met or exceeded this threshold. A median volume hospital, 61 operations per year, would have to have a mortality rate of 15% to be statistically different from the national mean mortality rate. A hospital with an annual volume equal to the designated children's hospitals median volume of 100 operations per year would have to have a mortality rate of 12% to be statistically different from the national mean. Similarly, in order to detect a doubling of mortality rate for all RACHS-1 category patients, a total of 220 patients were required and only 7.9% (n = 20) of hospitals met this minimum caseload (Fig 3). The time needed for a median volume hospital to accrue this case volume was 4 years.


Figure 1
View larger version (26K):
[in this window]
[in a new window]

 
Fig 1. Actual hospital case volumes and threshold volumes necessary to detect a doubling of or 5 percentage point increase in the mortality rate: selected operations. The height of each box ranges from the 25th to the 75th percentile of volume (for U.S. hospitals), the whiskers are the 10th and 90th percentiles, and the median volume is depicted as a horizontal line. The width of each box represents the 95% confidence intervals of the mortality rate. As predicted mortality decreases, the volume for detecting a doubling increases. Operations with higher mortality require higher volume to detect a 5 percentage point difference in mortality. Threshold volumes for a 1-tailed test, such as comparison of a hospital to a mean or benchmark, are depicted as dashed lines. Threshold volumes for a 2-tailed test, such as comparison of two hospitals, are depicted as solid lines. (ASD = atrial septal defect closure; VSD = ventricular septal defect closure.)

 

Figure 2
View larger version (28K):
[in this window]
[in a new window]

 
Fig 2. Actual hospital case volumes and threshold volumes necessary to detect a doubling of or 5 percentage point increase in the mortality rate: RACHS-1 cases by category. The height of each box ranges from the 25th to the 75th percentile of volume (for U.S. hospitals), the whiskers are the 10th and 90th percentiles, and the median volume is depicted as a horizontal line. The width of each box represents the 95% confidence intervals of the mortality rate. As predicted mortality decreases, the volume for detecting a doubling increases. Operations with higher mortality require higher volume to detect a 5 percentage point difference in mortality. Threshold volumes for a 1-tailed test, such as comparison of a hospital to a mean or benchmark, are depicted as dashed lines. Threshold volumes for a 2-tailed test, such as comparison of two hospitals, are depicted as solid lines. (RACHS-1 = Risk Adjustment for Congenital Heart Surgery, version 1.)

 

View this table:
[in this window]
[in a new window]

 
Table 2 Mortality Rates Needed to Detect Differences at Current Median Case Volumes and Volume Thresholds Needed to Detect Differences at Current Median Mortality Rates: 1-Tailed Test
 

Figure 3
View larger version (22K):
[in this window]
[in a new window]

 
Fig 3. Actual hospital case volumes and threshold volumes necessary to detect a doubling of or 5 percentage point increase in the mortality rate: hospital total RACHS-1 case volume. The height of each box ranges from the 25th to the 75th percentile of volume (for U.S. hospitals), the whiskers are the 10th and 90th percentiles, and the median volume is depicted as a horizontal line. The width of each box represents the 95% confidence intervals of the mortality rate. As predicted mortality decreases, the volume for detecting a doubling increases. Operations with higher mortality require higher volume to detect a 5 percentage point difference in mortality. Threshold volumes for a 1-tailed test, such as comparison of a hospital to a mean or benchmark, are depicted as dashed lines. Threshold volumes for a 2-tailed test, such as comparison of two hospitals are depicted as solid lines. (RACHS-1 = Risk Adjustment for Congenital Heart Surgery, version 1.)

 
We then performed a series of 2-tailed tests as would be used to determine whether the performance of one hospital was either better or worse than a second hospital. Again, no hospital had the minimum annual case volumes necessary to detect a doubling of mortality or a 5 percentage point increase in mortality for a list of chosen operations (Fig 1). Similarly, no hospital met the volumes needed to detect doubling of mortality or a 5 percentage point increase in mortality for any individual RACHS-1 category (Fig 2). Minimum case volumes necessary to detect a 5 percentage point increase in mortality ranged from 173 for RACHS-1 category 1 to 1,483 for RACHS-1 category 5 (Table 3). Minimum hospital case volumes needed to detect a doubling of mortality ranged from 30 for RACHS-1 category 5 to 6,889 for RACHS-1 category 1. When all RACHS-1 categorized operations were grouped together, 329 operations were necessary to detect a 5 percentage point difference between two hospitals. 3.6% of hospitals reached or exceeded this threshold. Similarly, in order to detect a doubling of mortality rate between hospitals for all RACHS-1 categorized operations combined, a total of 525 patients were required. Only 1.6% (n = 4) of hospitals met this minimum caseload (Fig 3). A median volume hospital would need 9 years to accumulate this case volume (Table 2).


View this table:
[in this window]
[in a new window]

 
Table 3 Mortality Rates Needed to Detect Differences at Current Median Case Volumes and Volume Thresholds Needed to Detect Differences at Current Median Mortality Rates: Two-Tailed Test
 

    Comment
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
The ability to differentiate between hospitals based on mortality rates depends on both the mortality rate of the procedure of interest and the frequency with which the procedure is performed. The minimum number of cases needed to detect a doubling of mortality is inversely related to the mortality rate of the procedure. The minimum number of cases needed to detect a 5 percentage point difference in mortality increases as the mortality rate increases. Pediatric cardiac surgery operations, either individually or in aggregate, are either performed too infrequently or have mortality rates that are too low to allow valid hospital quality comparisons to be based on mortality. Not only does the comparison of mortality rates fail to differentiate high performers, but it also may erroneously reassure poor performers by showing them to be no different than the mean.

The numbers of individual operations performed annually were too small for quality comparisons. Even when operations were aggregated into groups (RACHS-1 categories) or totaled for each hospital, the sample sizes were still too small. Because mortality rates for the majority of pediatric cardiac surgical operations are low, grouping dilutes the influence of more infrequent, higher mortality operations such as the Norwood procedure and results in the need for a sample size more similar to a low complexity operation. There are additional problems with grouping operations. When dissimilar operations are grouped together, information is lost. Although there are some system level similarities that affect patients who undergo different operations, there are factors specific to each operation that are crucial to achieving a good outcome. By grouping vastly different operations together, the ability to investigate operation specific factors is lost. Larger sample sizes can be achieved by collecting data over longer periods of time. However, over time not only do personnel and systems at hospitals change, but treatments change as well. As a result, the utility of the collected data is reduced.

Our thresholds of a doubling of or 5 percentage point increase in mortality were chosen as targets that most individuals would agree represent clinically significant quality differences between programs. As one often wishes to detect finer gradations of difference, these thresholds may seem excessive. However, as the disparity in mortality rates one wishes to detect is reduced, the volume of cases needed to detect a difference increases. In order to detect a 1.5 times or 2 percentage point difference in mortality, case volumes would have to be even larger than the thresholds we used for a doubling or 5 percentage point increase. Given our findings, such comparisons cannot be made with current annual hospital volumes.

Another way of addressing the problem of small sample size is to investigate outcomes that occur more frequently than mortality, such as postoperative complications. At present, this is complex for several reasons. Unlike a relatively objective outcome, such as mortality, many complications are more subjective. Collection of complications data, either by administrative coders or clinical personnel, is time consuming and errors are likely to occur. In addition, definitions of complications are less standard than for mortality and collected data are more difficult to validate. Although more common than mortality, most complications still occur infrequently and may be associated with only one or a subset of operations. If complications are grouped into composite scores, or a surrogate for complications such as length of stay is used, the ability to link the resulting data to specific clinical issues or processes of care is reduced. Outcomes such as quality of life and neurologic status are experienced by all patients, alleviating the problem with small event rates. However, at present, due in part to cost and practicality, such data are not widely available.

Rather than a focus on outcomes, an alternative approach to determining quality is to look at structural components and process measures [13]. In order for the chosen measures to be valuable they must impact outcome and be modifiable. It is of no use to scrutinize a process measure that does not impact outcome, nor is it useful to consider a structural measure that cannot be changed. Unfortunately, the numbers of structural components and process measures demonstrated to influence outcome in congenital cardiac surgery are small and those that are known are often procedure specific.

We chose to use the RACHS-1 method to classify operations for this investigation. The RACHS-1 was developed to compare the mortality for groups of patients undergoing congenital cardiac surgery. The RACHS-1 method is a widely used risk stratification methodology for congenital heart surgery [8–12]. The methodology does not allow for conditions that may be risk factors for specific operations. However, for this analysis, RACHS-1 was used for categorization only, not risk adjustment. While risk adjustment is essential for accurate comparison of mortality rates, it is secondary to sample size. If there are too few cases available for a valid comparison to be made, the risk adjustment methodology employed is irrelevant.

Our analysis has several limitations. First, although the NIS is the largest all-payer inpatient care database in the United States, it is a sample and not a complete database of all hospital discharges. As a result, although it is designed to be representative of national practice, there is the possibility for error. However, the NIS is the largest collection of real data that can currently be used for addressing the present question. Second, the NIS is an administrative database. Administrative data were designed for claims data collection and billing, not heath care research, and can be limited by erroneous coding of congenital diagnoses [14, 15]. As a result, we may have miscategorized some cases. However, given the large sample size, random miscoding would be unlikely to significantly influence our findings. We reduced error from miscoding of data in this study by only including patient records in which the procedure code matched to a plausible diagnostic code.

Notwithstanding these limitations, our study was conducted using a large, national dataset. This gave us adequate power to generate current, stable mortality rates. Because the NIS is a sample of all hospitals performing congenital cardiac surgery in the United States, rather than a select sample that has chosen to participate in a voluntary database, the mortality rates and hospital case volumes are likely an unbiased representation of the overall practice in the country.

Presently available data and current statistical methodology limit the use of mortality as a metric of quality in pediatric cardiac surgery. The structure of the United States health care system, with a large number of low volume pediatric cardiac centers, contributes to the difficulty in assessing quality [16]. The majority of hospital mortality rates are clustered, or regressed, around a mean value. Adequate sample size is therefore critical to allow discrimination; both identification of hospitals performing exceptionally well and those performing poorly. Regionalization of pediatric cardiac care or regional collaboration among centers not only might improve outcomes by concentrating experience, but also would facilitate quality assessment by increasing surgical volumes [17]. Performance metrics other than mortality may provide a more meaningful measurement of quality. The reliance on mortality alone to represent quality of pediatric cardiac surgery misleads health care providers and hospitals who may incorrectly assume they are providing quality care and patients and their families who are falsely reassured.


    Discussion
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 
DR DAVID M. SHAHIAN (Boston, MA): I would like to congratulate the authors for an important and timely study and Dr Welke for his excellent presentation. Although based upon administrative data, the average mortality rates that were used for these power calculations are quite consistent with those in the STS [Society of Thoracic Surgeons] congenital database, and I generally agree with both the methodology and the inferences of this study. So we have a problem. If traditional mortality profiling is underpowered to detect performance differences, then how do we responsibly measure and assure quality in congenital heart surgery?

First, I would not completely abandon the concept of mortality measurement, as there are many adjuncts and refinements that may enhance its utility. For example, hierarchical models are increasingly used in outcomes profiling to provide better estimates of true performance when sample sizes are small. Two graphical methods have also been utilized to supplement traditional mortality reports. Sequential, real-time mortality monitoring using CUSUM [cumulative sum] plots may provide early warning of deteriorating performance, and congenital heart surgery was a noteworthy early application of this method. Funnel plots graph mortality against procedural volumes. Their superimposed funnel-shaped confidence intervals focus attention on the increased random variability associated with small sample size.

Effective performance monitoring should also encompass other measures in addition to mortality. The Institute of Medicine advocates more comprehensive multidimensional measures of quality, and this principle motivated development of the STS CABG [coronary artery bypass grafting] composite score, consisting of 11 NQF [National Quality Forum]-endorsed measures. In congenital heart surgery, I have listed a few of the many potential outcomes measures that might be utilized in addition to mortality, as well as several structure and process measures. Such measures could be used individually, or preferably aggregated into a composite together with mortality. In addition to providing a more comprehensive perspective on quality, such composites effectively increase the number of endpoints and thus statistical power.

Notwithstanding the incremental value of all these approaches, there is no escaping the fact that mortality will always remain a central component of surgical quality and assurance, which brings me to my question for Dr Welke.

Karl, in studies last year you demonstrated that many hospitals in the U.S. perform very low volumes of congenital procedures, and that, in contrast to CABG surgery, there appears to be a significant volume outcome association, at the very least for the more complex cases. Today you have shown that it would be almost impossible to effectively monitor mortality for low and very low volume programs. Viewed in aggregate, do your findings over the last several years suggest a role for volume thresholds or regionalization in congenital heart surgery, especially for more complex cases? Thank you.

DR WELKE: Thank you, Dr Shahian, for your kind comments and your question. I agree with everything that you have said. The issue of volume is a touchy one. In the papers that we wrote last year, we did see a relationship between volume and mortality when appropriately adjusted for risk. An important caveat to the thresholds in those papers is that although in aggregate large volume hospitals performed better than small volume hospitals, there were many small volume hospitals that performed equally well, at least equally well given the limits of our current data and methodology, relative to large volume hospitals. So it is difficult to use the thresholds that we had in our papers as absolute cutoffs.

What is noteworthy from those papers, however, is the number of very small volume hospitals doing congenital heart surgery. When we looked at the Nationwide Inpatient Sample, for instance, 188 out of the 307 hospitals performed less than 20 cases per year. Even in the Society of Thoracic Surgeons Congenital Heart Surgery Database, 15 of the 48 hospitals performed under 150 cases per year. So there are a lot of very small volume hospitals out there.

The aggregation of those cases into larger centers may have an effect on mortality. This was looked a couple of years ago using California administrative data from 1995 to 1997. Theoretically, 24% of congenital cardiac surgical deaths could be eliminated by moving cases from small volume to large volume hospitals. So consolidation and regionalization may be attractive not only from an economy of scale perspective, but also to reduce mortality. What the actual volume threshold should be, however, is still to be determined.

DR PETER MCKEOWN (Pikeville, KY): I want to compliment Dr Welke on a very pertinent and a very well presented paper. The difficulty you presented with mortality is the fact that it is a discrete variable. It is an all or none phenomenon. And so I guess I have got a couple of questions. Number one is, did you look at the observed-to-expected (O/E) outcomes such as used in the STS and NSQIP [National Surgical Quality Improvement Program] databases and how did that influence the data?

The second thing that you brought out very nicely is the need for us as societies to find ways to benchmark and share, particularly now that in many cases we are being judged by claims data, which is not the same as outcome data or performance data. So that is the second thing.

And then the third thing is, how can we measure competency as a measure of performance and quality and did you think about that in your review of the data? Thank you very much.

DR WELKE: Thank you for your comments. With regard to your first question, we did not look at observed-to-expected ratios. In fact, these numbers are not adjusted. There is no need in this case. When making mortality comparisons, one must first have an adequate number of patients in the sample. Then risk adjustment can be addressed. The former is necessary before the latter is a concern. Our analysis addresses sample size.

With regard to measurement of competency, there is going to be board certification in congenital heart surgery very soon. Individuals are currently going through fellowship training in preparation for that examination and currently practicing surgeons will have the opportunity to become certified as well. So there will be a measurement of surgeon competency, at least at the certification level, from the American Board of Thoracic Surgery. That will be an important innovation. I am interested to see the changes in the practice of congenital cardiac surgery across the country over the next 10 to 15 years that result from the designation of a group of certified surgeons.

DR MCKEOWN: And then the other part was how do we share the benchmark data, how do we get better by sharing competencies or outcomes as the New England group has done?

DR WELKE: Our specialty is different than adult cardiac surgery in that there are a lot fewer of us and our centers are more widely dispersed. In some ways that facilitates collaboration because, in general, we have fewer competition issues with programs across town or in the state. What we need to do is to take the regional quality improvement model and move it to a national quality improvement model. That may be led by this Society, it may be led by the Congenital Heart Surgeons Society, but that would be a real addition to what we do.

DR DOUGLAS E. WOOD (Seattle, WA): I have one additional question related to your answers to both discussants. Dr Shahian and you discussed the aspect of regionalization of care, which is an automatic conclusion of this type of report, but you have also pointed out the wide geographic separation of congenital centers. Although it seems logical that there would be decreased mortality with regionalization of care, can you comment on the unintended consequences of regionalization, more specifically, the potential lack of access to care that you have just alluded to in the wider geographic distribution of congenital centers?

DR WELKE: Thank you, Dr. Wood. That is an excellent point. There is always going to be a compromise as to where you draw the line. It is easy in a large city to say from a patient care perspective, "we are going to reduce the number of centers." It is more difficult where you and I live, in the Northwest, to say there will only be congenital cardiac care in Seattle and Denver, for instance. Interestingly, in the California study that I mentioned earlier, if all the patients at hospitals that did under 70 cases per year were moved to hospitals that did over 170 cases per year, the average added travel distance was 12.7 miles. So there are a lot of areas where we could regionalize without an increased distance component. However, any regionalization plan should take into account and minimize the longer travel distances and related negative impacts, such as family stress and potential delays in treatments, which patients and families in more sparsely populated areas might experience.


    References
 Top
 Abstract
 Introduction
 Material and Methods
 Results
 Comment
 Discussion
 References
 

  1. Pediatric Congenital Cardiac Surgery in New York State 2002–2005Albany, NY: New York State Department of Health; 2007.
  2. The Leapfrog Grouphttp://www.leapfroggroup.org 2007Accessed January 28, 2009.
  3. National Quality Forumhttp://www.qualityforum.org 2007Accessed January 28, 2009.
  4. Agency for Healthcare Research and Qualityhttp://www.ahrq.gov 2007Accessed January 28, 2009.
  5. The Society of Thoracic Surgeons Congenital Heart Surgery Database Executive Summary, fall 2008 harvesthttp://www.sts.org/documents/pdf/ndb/Fall2008STSCONG-ALLPatientsSUMMARY.pdf 2007Accessed January 28, 2009.
  6. Dimick JB, Welch HG, Birkmeyer JD. Surgical mortality as an indicator of hospital quality: the problem with small sample size JAMA 2004;292:847-851.[Abstract/Free Full Text]
  7. HCUP Nationwide Inpatient Sample (NIS). Healthcare Cost and Utilization Project (HCUP). 2000–2005. Agency for Healthcare Research and Quality, Rockville,, MDwww.hcup-us.ahrq.gov/nisoverview.jsp 2004Accessed January 28, 2009.
  8. Jenkins KJ, Gauvreau K. Center-specific differences in mortality: preliminary analyses using the Risk Adjustment in Congenital Heart Surgery (RACHS-1) method J Thorac Cardiovasc Surg 2002;124:97-104.[Abstract/Free Full Text]
  9. Boethig D, Jenkins KJ, Hecker H, Thies WR, Breymann T. The RACHS-1 risk categories reflect mortality and length of hospital stay in a large German pediatric cardiac surgery population Eur J Cardiothorac Surg 2004;26:12-17.[Abstract/Free Full Text]
  10. Kang N, Cole T, Tsang V, Elliot M, de Leval M. Risk stratification in paediatric open-heart surgery Eur J Cardiothorac Surg 2004;26:3-11.[Abstract/Free Full Text]
  11. Welke KF, Shen I, Ungerleider RM. Current assessment of mortality rates in congenital cardiac surgery Ann Thorac Surg 2006;82:164-171.[Abstract/Free Full Text]
  12. Al-Radi OO, Harrell Jr FE, Caldarone CA, et al. Case complexity scores in congenital heart surgery: a comparative validation study of the Aristotle Basic Complexity score and the Risk Adjustment in Congenital Heart Surgery (RACHS-1) system J Thorac Cardiovasc Surg 2007;133:865-875.[Abstract/Free Full Text]
  13. Donabedian A. Evaluating the quality of medical care Milbank Mem Fund Q 1966;8344:166-206.
  14. Cronk CE, Malloy ME, Pelech AN, et al. Completeness of state administrative databases for surveillance of congenital heart disease Birth Defects Res A Clin Mol Teratol 2003;67:597-603.[Medline]
  15. Frohnert BK, Lussky RC, Alms MA, et al. Validity of hospital discharge data for identifying infants with cardiac defects J Perinatol 2005;25:737-742.[Medline]
  16. Welke KF, Diggs BS, Karamlou T, Ungerleider RM. The relationship between hospital surgical case volumes and mortality rates in pediatric cardiac surgery: a national sample 1988–2005 Ann Thorac Surg 2008;86:889-896.[Abstract/Free Full Text]
  17. Chang RK, Kiltzner TS. Can regionalization decrease the number of deaths for children who undergo cardiac surgery?. A theoretical analysis. Pediatrics 2002;109:173-181.[Abstract/Free Full Text]



This article has been cited by other articles:


Home page
J. Thorac. Cardiovasc. Surg.Home page
M. Nathan, J. M. Karamichalis, H. Liu, P. del Nido, F. Pigula, R. Thiagarajan, and E. A. Bacha
Intraoperative adverse events can be compensated by technical performance in neonates and infants after cardiac surgery: A prospective study
J. Thorac. Cardiovasc. Surg., November 1, 2011; 142(5): 1098 - 1107.e5.
[Abstract] [Full Text] [PDF]


Home page
J. Thorac. Cardiovasc. Surg.Home page
M. E. Oster, M. J. Strickland, and W. T. Mahle
Impact of prior hospital mortality versus surgical volume on mortality following surgery for congenital heart disease
J. Thorac. Cardiovasc. Surg., October 1, 2011; 142(4): 882 - 886.
[Abstract] [Full Text] [PDF]


Home page
World Journal for Pediatric and Congenital Heart SurgeryHome page
K. F. Welke
Interpreting Congenital Heart Disease Outcomes: What Do Available Metrics Really Tell Us?
World Journal for Pediatric and Congenital Heart Surgery, July 1, 2010; 1(2): 194 - 198.
[Abstract] [Full Text] [PDF]


Home page
Ann. Thorac. Surg.Home page
D. Chu and F. G. Bakaeen
Reply
Ann. Thorac. Surg., July 1, 2010; 90(1): 358 - 359.
[Full Text] [PDF]


Home page
Eur J Cardiothorac SurgHome page
W. G. Williams
Congenital heart disease: interrelation between German diagnoses-related groups system and Aristotle complexity score
Eur J Cardiothorac Surg, June 1, 2010; 37(6): 1276 - 1277.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
Karl F. Welke
Tara Karamlou
Ross M. Ungerleider
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Welke, K. F.
Right arrow Articles by Diggs, B. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Welke, K. F.
Right arrow Articles by Diggs, B. S.
Related Collections
Right arrow Congenital - cyanotic
Right arrow Professional affairs


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ANN THORAC SURG ASIAN CARDIOVASC THORAC ANN EUR J CARDIOTHORAC SURG
J THORAC CARDIOVASC SURG ICVTS ALL CTSNet JOURNALS