ATS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Warner, B. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Warner, B. A.
Related Collections
Right arrowRelated Articles

Ann Thorac Surg 1997;63:1529-1530
© 1997 The Society of Thoracic Surgeons


Editorial

Thoughts and Considerations on Modeling Coronary Bypass Surgery Risk

Bradley A. Warner, PhD

United States Air Force Academy, USAF Academy, Colorado

Neural network models are becoming increasingly common in medical research as evidenced by the recent surge of articles in the medical literature discussing or using neural network models [13]. The attraction of neural networks seems to stem from the potential for improved predictive performance, the loose biological relationship upon which neural network models were originally founded, and the impressive predictive performance reported in many articles. Lippmann and Shahian's article [4] introduces the use of neural network models in coronary artery bypass grafting (CABG) mortality prediction. They do an excellent job addressing concerns and potential limitations of neural network models as well as comparing neural network models with existing methods.

The history of neural network models began with studies that attempted to model the brain. The idea was to have many simple computational units acting in a highly interconnected manner, similar to the brain. However, neural network models presently used in prediction problems have little biological equivalence to the brain. The neural network models typically used for prediction consist of layers of simple computational units. These simple units are often called nodes or neurons (again hinting at a biological motivation). The nodes in a layer receive as input a simple weighted sum of outputs from nodes in previous layers. Each node applies a functional transformation to this weighted input and outputs this result to the nodes in the subsequent layer(s). The weights (also known as synaptic weights) are adjusted through a training procedure (the most common procedure is known as backpropagation) to achieve maximum predictive performance. Once training stops, the weights remain fixed and prediction is accomplished by passing input data through the completed model. The output is then used as the predicted value (for a more comprehensive discussion of neural networks see Lippmann [5]).

See also 1531 and 1635.

The first layer of a neural network model is the input layer and the last layer is the output layer. The layers in between are known as hidden layers. It is the hidden layers that give neural network models their ability to model complex functional relationships. With a sufficient number of hidden units, a neural network can approximate any continuous function [6]. Without the hidden layers, a neural network can be thought of as a traditional regression model. Therefore, the difference between the traditional regression models and neural networks is, on a simplified level, the hidden units. In a traditional regression model, there is an implied additive, or linear, structure between predictor variables. The inclusion of hidden units in a neural network model remove this linearity restriction, enabling the modeling of more complex relationships. In practice, the hidden layers are using the data to find nonlinearities, interactions, and nonlinear interactions in the predictor set. In other words, the hidden units of the neural network models enable the effects of the predictors on the outcome to be nonlinear and also to depend on the levels of the other predictors. It is possible to build regression models that include interaction and nonlinear terms; however, unless there is some prior knowledge these relationships exist, this process is a time-consuming trial and error routine. With a large number of predictor variables, it is impractical to manually search for these relationships. For example, with just 13 predictor variables, 78 different regression models would have to be constructed just to compare all single two-way interactions. Additional regression models are necessary to investigate models with more than one interaction term, nonlinear terms, or interactions of nonlinear terms. The neural network models automate this process to some extent by using the data to find these relationships. From a practical standpoint, a moderate number of predictors makes it impractical to build regression models with the equivalent complexity of neural network models.

Other modeling methods exist that are able to achieve the modeling complexity of neural network models (such as projection pursuit regression and multivariate adaptive regression splines [7, 8]), but their use and discussion is absent from the medical literature. This is in part due to the readily available software for neural network models, the relatively simple conceptual presentation of the model, which includes in some part its crude biological motivation, and the abundance of medical literature reporting success in applying neural network models in prediction problems. It is this last point where Lippmann and Shahian's article makes an important contribution. Although neural network models have the potential to improve predictive performance, it is not a guarantee they will. Lippmann and Shahian illustrate this point for operative mortality prediction in CABG patients by comparing a neural network model and logistic regression model. They establish this result through a careful and thorough process of model building and model comparison. Their results are important because unrealistic expectations of improved predictive performance are established when the only articles published demonstrate a superior performance of neural network models over more conventional models.

A final point to consider is the following: Why is it that neural network models do not improve the predictive performance as compared with the more conventional models for this application? Remember, a neural network model will improve predictive performance over a logistic regression model only if significant interactions, nonlinearities, or nonlinear interactions exist. The lack of superior performance from the neural network model may indicate these relationships do not exist for this problem. Conversely, these relationships could exist, but the data do not contain sufficient information for the neural network model to uncover them. Examining the predictors used to model CABG operative mortality reveals many of the predictors are binary, they take on only two values. With only two values, the neural network model will not be able to learn the effects of the predictor on the outcome outside of these two values. The neural network has been restricted to a simpler model than is possible if the predictor variables take on a range of values. The implication of Lippmann and Shahian's results is that improved prediction of CABG operative mortality is not possible for the current set of risk factors. If improved prediction is possible, it will come from predictor variables that contain more information in use with models that can exploit this information. This means risk factors that are not coded as binary variables or the inclusion of other predictors of operative mortality besides risk variables; for example, variables that measure the processes and structures of care such as total operating room time or surgeon experience. This does not imply improved predictive performance is possible, but only that the potential for increase exists in the appropriate combination of predictors and models.

Footnotes

Address reprint requests to Dr Warner, HQ USAFA/DFMS, 2354 Fairchild Dr, Suite 6D2A, USAF Academy, CO 80840.

References

  1. Kok MR, Boon ME. Consequences of neural network technology for cervical screening: increase in diagnostic consistency and positive scores. Cancer 1996;78:112–7.[Medline]
  2. Brigatti L, Hoffman D, Caprioli J. Neural networks to identify glaucoma with structural and functional measurements. Am J Ophthalmol 1996;121:511–21.[Medline]
  3. Dybowski R, Weller P, Chang R, Gant V. Prediction of outcome in critically ill patients using artificial neural network synthesized by genetic algorithm. Lancet 1996;347:1146–50.[Medline]
  4. Lippmann RP, Shahian DM. Coronary artery bypass risk prediction using neural networks. Ann Thorac Surg 1997;63:1635–43.[Abstract/Free Full Text]
  5. Lippmann RP. An introduction to computing with neural nets. IEEE ASSP Magazine 1987;4:4–22.
  6. Cybenko G. Approximation by superpositions of a sigmodial function. Mathematics of Control, Signals, and Systems 1989;2:303–14.
  7. Friedman JH, Stuetzle RJ. Projection pursuit regression. J Am Stat Assoc 1981;76:817–23.
  8. Friedman JH. Multivariate adaptive regression splines. Ann Stat 1991;19:1–141.

Related Articles

Cardiothoracic Databases: Where Are We Headed?
Frederick L. Grover
Ann. Thorac. Surg. 1997 63: 1531-1532. [Extract] [Full Text]

Coronary Artery Bypass Risk Prediction Using Neural Networks
Richard P. Lippmann and David M. Shahian
Ann. Thorac. Surg. 1997 63: 1635-1643. [Abstract] [Full Text]



This article has been cited by other articles:


Home page
Ann. Thorac. Surg.Home page
D. M. Shahian, E. H. Blackstone, F. H. Edwards, F. L. Grover, G. L. Grunkemeier, D. C. Naftel, S. A.M. Nashef, W. C. Nugent, and E. D. Peterson
Cardiac Surgery Risk Models: A Position Article
Ann. Thorac. Surg., November 1, 2004; 78(5): 1868 - 1877.
[Abstract] [Full Text] [PDF]


Home page
Ann. Thorac. Surg.Home page
D. M. Shahian, S.-L. Normand, D. F. Torchiana, S. M. Lewis, J. O. Pastore, R. E. Kuntz, and P. I. Dreyer
Cardiac surgery report cards: comprehensive review and statistical critique
Ann. Thorac. Surg., December 1, 2001; 72(6): 2155 - 2168.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Warner, B. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Warner, B. A.
Related Collections
Right arrowRelated Articles


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ANN THORAC SURG ASIAN CARDIOVASC THORAC ANN EUR J CARDIOTHORAC SURG
J THORAC CARDIOVASC SURG ICVTS ALL CTSNet JOURNALS