European Heart Journal (1996) 17, 1181-1191 Early diagnosis of acute myocardial infarction using clinical and electrocardiographic data at presentation: derivation and evaluation of logistic regression models R. L. Kennedy*, A. M. Burton*, H. S. Fraser*, L. N. McStay* and R. F. Harrisonf *Department of Medicine, Western General Hospital, Edinburgh, U.K. and jDepartment of Automatic Control & Systems Engineering, University of Sheffield was the most effective on test data, yielding accuracies of 84-3 and 83-6% on the two test sets. A model constructed solely of electrocardiographic data performed nearly as well as those incorporating clinical data. Previously published logistic regression models did not perform so well as the models derived from data collected for this study. (Eur Heart J 1996; 17: 1181-1191) Key Words: Myocardial infarction, diagnosis, logistic regression, electrocardiograph, case history. networks'17 191. It is not clear whether any of these methods is clearly superior to the others. Also, the place The early diagnosis and management of acute chest pain of these tools in clinical practice remains to be estabremains one of the greatest challenges in emergency lished, although early observations have suggested medicine. Identification of patients with unstable cor- that their use, in a study situation, improves diagnosonary disease is essential so that appropriate investi- tic performance and the use of high dependency 12 20 211 gation, therapy and monitoring can be delivered, beds' ' ' . The algorithms which have been described particularly to high risk groups. Application of an recently are complex, making use of a relatively large 10 191 appropriate algorithm might improve diagnostic per- number (typically around 50) of data input items' ' . formance, optimize therapy and use of resources, and The more complex a model is, the less likely it is to be may also be helpful in healthcare planning. Appropriate used in routine practice. interpretation of the electrocardiogram at presentation Logistic regression is a non-linear classification yields the most discriminatory information regarding technique which uses binary data to derive a series of diagnosis and prognosis'1"41, but the power of these data coefficients which, when applied to unseen vectors, are increased by combining them with clinical infor- yield a probability of a single output (e.g. the presence mation'51. A variety of techniques has been used to of myocardial infarction). The performance of logistic combine clinical and electrocardiographic data into de- regression models for early diagnosis of acute myocision support algorithms'61. Methods used include linear cardial infarction is almost certainly superior to those discriminant analysis'7'81, Bayesian inference'9101, recur- derived using decision trees'221. The other major advansive partition analysis'"'121, logistic regression'13"151, tages are that the models are easy to apply and can knowledge-based systems'161, and artificial neural be derived easily using widely available statistical packages. Thus, it should be easy, given the appropriate data, to optimise models for use in any given Revision submitted and accepted 5 September 1995. situation, overcoming the problems with portability Correspondence: Professor R. L. Kennedy, Department of Medi- which have been encountered previously in this area of cine, District General Hospital, Kayll Road, Sunderland SR4 7TP, work. U.K. Introduction 0195-668X/96/08I181 + 11 $18.00/0 © 1996 The European Society of Cardiology Downloaded from by guest on October 15, 2014 The aim of this study was to determine which, and how many, data items are required to construct a decision support algorithm for early diagnosis of acute myocardial infarction using clinical and electrocardiographic data available at presentation. Logistic regression models were derived using data items from 600 consecutive patients at one centre (Edinburgh), then tested prospectively on 510 cases from the same centre and 662 consecutive cases from another centre (Sheffield). Although performance of the models increased with progressive addition of data inputs when applied to training data, a simple six-factor model 1182 R. L. Kennedy et al. The aims of this study were: (1) to determine the optimal inputs for a logistic regression model to assist with early diagnosis of acute myocardial infarction; (2) to examine the applicability of such models to data collected prospectively in different centres; (3) to compare the performance of models derived only from electrocardiographic data with those which also include clinical data — for retrospective use, electrocardiographic data is easier to collect and validate; (4) to compare the performance of models optimized for use in our centres with that of previously published logistic regression models. Patients and methods Patients and clinical data The logistic regression models derived from the above data were independently tested on data collected from 662 patients attending the Northern General Eur Heart J, Vol. 17, August 1996 Logistic regression models These were derived using the maximum likelihood method to calculate coefficients on the Advanced Statistics Module of the SPSS for Windows Program (SPSS Inc., Chicago, Illinois, U.S.A.). Logically redundant inputs were eliminated. Thus, the inputs corresponding to the fourth age input, and the second input bands for duration of pain and total duration of symptoms were not included. The final probability of acute myocardial infarction was calculated as described previously1'51 using the equation: Probability (%)= 100 x [1 - 1/(1 + exp(b0 + Zb r /J] where b0 is the constant term, br is the coefficient for any given input and yx is the numerical value for that input. Data analysis The likelihood ratio was defined as sensitivity/ 100 — specificity for an item positively associated with the diagnosis and 100 — sensitivity/specificity for an item with negative correlation. Sensitivity was defined as true positives/true-positives + false-negatives, specificity as true negatives/true negatives + false-positives, positive predictive accuracy as true positives/true positives + false-positives and accuracy as true positives + true negatives/total number of patients. We have made extensive use of the receiver operating characteristic curve. For a full review of the use of this tool see1231. Basically, as used here, the plots were of sensitivity versus 100 — specificity at different diagnostic thresholds. The plots allowed us to estimate the optimum diagnostic thresholds for the two models. The area under each of the curves, and their standard error was calculated according to the method described by Hanley and McNeill1241. This area gives a measure of the ability of each test to correctly rank normal and abnormal cases. It is related to, but not equivalent to, the diagnostic accuracy of the test and can be used to compare statistically two curves. Such comparison was achieved using the method described in a further paper by Hanley and McNeill1251. Kendell Tau correlation of the paired ratings for this calculation was performed using the SPSS program. One way analysis of variance (ANOVA) was used where indicated. Comparisons of diagnostic accuracy between models was achieved using McNemar's test in 2 x 2 contingency tables. Downloaded from by guest on October 15, 2014 The study included consecutive patients attending the Accident and Emergency Department of the Royal Infirmary, Edinburgh, Scotland, with a principal complaint of non-traumatic chest pain. The relevant clinical and electrocardiographic data (see below) was entered onto a purpose-designed proforma at, or very soon after, the patient's presentation. The study included both patients who were admitted and those who were discharged. A total of 1110 patients were recruited during the study period (September 1993 to January 1994). The final diagnosis for these patients was assigned independently by three of us — Consultant Physician (R. L. Kennedy), Research Nurse (A. M. Burton) and Cardiology Registrar (H. S. Fraser). This diagnosis made use of follow-up electrocardiograms, cardiac enzyme studies and other investigations as well as clinical history obtained from a review of the patient's notes. For patients directly discharged from Accident and Emergency, we contacted patients regarding further symptoms and, where necessary, we contacted their General Practitioners and reviewed the notes of any further hospital follow-up. Acute myocardial infarction was diagnosed on the basis of clinical, electrocardiographic and cardiac enzyme (total creatine phosphokinase, CK-MB and lactate dehydrogenase). Cardiac enzyme measurements were carried out using standard enzyme activity assays in the Department of Clinical Biochemistry, the Royal Infirmary, Edinburgh. Unstable angina was defined as either more than two episodes of pain lasting more than 10 min in a 24 h period or more than three episodes in a 48 h period or angina which was associated with the development of new electrocardiographic changes of ischaemia (either at diagnosis or in the subsequent 3 days). The input data items for the logistic regression models were all derived from data available at the time of the patient's presentation. We used up to 54 items which were coded as binary inputs as shown in the Appendix. Hospital, Sheffield. The data were collected and the final diagnosis determined as described above for the Edinburgh data. Informed consent was obtained from all patients participating in the study which was approved by the Medical Ethics Committees in the two participating centres. Logistic regression models for AMI diagnosis 1183 Results Patient data and selection of inputs for logistic regression models The logistic regression models were derived, unless otherwise stated, from 600 consecutive Edinburgh patients. These were 383 men and 217 women with a mean age of 57-4 years (SD 17-1, range 14-92). The final diagnosis in these patients was Q wave myocardial infarction in 86, non-Q wave myocardial infarction in 27, unstable angina in 41, stable angina in 159 and non-cardiac in 287. The models were tested on data from 510 Edinburgh patients — 345 men and 165 women with a mean age of 58-7 years (SD 15-9, range 17-90). Their final diagnosis was Q wave myocardial infarction in 79, non-Q wave myocardial infarction in 28, unstable angina in 44, stable angina in 160 and non-cardiac in 199. The Sheffield patients, which acted as a further test set were 389 men and 273 women with a mean age of 59-9 years (SD 14-5, range 17-92). Their final diagnosis was Q wave myocardial infarction in 147 cases, non-Q wave myocardial infarction in 50, unstable angina in 103, stable angina in 133 and non-cardiac in 229. The likelihood ratios for data items in the Edinburgh data varied widely, as shown in Fig. 1 (items are arranged in descending order of their likelihood ratio, as detailed in Table 1). Figure 2 shows likelihood ratios for the Sheffield cohort with the items arranged in the same order as in Fig. 1. There are obvious differences between the two data sets which may influence the performance of models derived from one data set and tested on the other. The five most discriminatory items for the two data sets were identical: ST elevation, new Q waves, hypoperfusion, ST depression and vomiting. The difference between the two data sets is most obvious with regard to coronary risk factors: smoking (1-23 vs 104) was associated more with a diagnosis of acute myocardial infarction in the Edinburgh patients, while hypertension (1 -49 vs 163), diabetes (111 vs 1 -63) and hyperlipidaemia (101 vs 1-87) were more discriminatory in the Sheffield cohort. Logistic regression models based on likelihood ratio data Using data from 600 Edinburgh patients, a series of logistic regression models was developed by progressively adding data items in decreasing order of their likelihood ratio for acute myocardial infarction diagnosis. These models are described in Table 1. On the 600 training sets, there was a progressive increase in the performance of the models with the addition of data items: thus, areas under the receiver operating characteristic curves were 89-6 ± 1-98, 91 2 ± 1-70, 92-8 ± 1-43, Eur Heart J, Vol. 17, August 1996 Downloaded from by guest on October 15, 2014 Figure 1 The 54 candidate data items (see Appendix) are arranged in decreasing order of likelihood ratio with respect to diagnosis of acute myocardial infarction. The actual items represented by the bars on this figure are given in Table 1. 1184 R. L. Kennedy et al. 31.1 -; •= 1 1 . 1 —; 3 93-5 ±1-31, 94-6 ±1-20, 94-9 ± 1-28, 96-4 ±0-96, respectively for 6, 9, 12, 20, 30, 40 and 53 data items (P<000\, ANOVA). However, when these models were applied to unseen data (510 Edinburgh patients), there was no difference in their overall performance: Areas under the receiver operating characteristic curves were 891 ± 2 1 8 , 89-5 ± 2 1 5 , 89-4 ±2-12 and 88-9 ±2-12, respectively for 6, 12, 30 and 53 data item models. At the optimum diagnostic threshold (15%), the six-factor model had sensitivity, specificity and accuracy of 80-4%, 85-3% and 84-3%, respectively and a sensitivity for diagnosis of Q-wave acute myocardial infarction of 89-9%. The equivalent figures for the 53 factor model were 79-4%, 83-1%, 82-4% and 89-9%, respectively (not significant). Testing of logistic regression models on Sheffield data The performance of four models on Sheffield data is shown in Fig. 3. The areas under the receiver operating characteristic curves were 92-3 ±1-30, 93-6 ± 1 0 5 , 92-8 ± 118 and 87-7 ± 1-70, respectively for 6, 12, 30 and 53 factor models. The difference between the first three models was not significant but there was a significant decline in performance when all 53 factors were used (P<0-0\). At a diagnostic threshold of 15%, the sixEur Heart J, Vol. 17, August 1996 factor model had sensitivity of 91-9%, specificity of 80-2%, accuracy of 83-6% and positive predictive value of 66-3%. The vast majority of patients who were mis-diagnosed by this model were patients with unstable cardiac disease but without acute myocardial infarction, as shown in Fig. 4. When logistic regression models were derived using Sheffield data there was again an increase in performance of the models with progressive addition of data items as shown in Fig. 5. The areas under the receiver operating characteristic curves were 92-8 ± 1-24, 947 ±0-93, 96-3 ± 0-69 and 97-5 ±0-62, respectively for 6, 12, 30 and 53 factor models. We did not test these models on an independent set of Sheffield data. Comparison with a model derived from only electrocardiogram data A model derived only from electrocardiographic data would be more readily applied to retrospective cases than one which incorporates clinical factors, which cannot always be gathered reliably from case notes. The equation for a model which used only electrocardiographic data is shown in Table 2. The area under the curve for this model tested on the 510 Edinburgh cases was 88-7 ± 2-24 which was not significantly different for the six-factor model (including items from the clinical Downloaded from by guest on October 15, 2014 Figure 2 The 54 data items for the Sheffield data set. The items are arranged in exactly the same order as those shown in Fig. 1 for the Edinburgh data. Logistic regression models for AMI diagnosis 1185 Table 1 Constants and coefficients for logistic regression models Number of inputs Input -307 316 1-37 1 95 1 54 0-47 0-68 — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — -3-83 312 109 1-78 1 38 Oil 0-63 0-97 0-86 0-84 — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — 12 -4-50 312 0-96 1-48 1-38 009 0-65 0-95 0-85 0-54 0-95 0-79 0-69 — — — — — — — — — — — — — — — — 20 -504 3-45 0-98 1 61 1-37 -006 0-59 0-84 0-8 0-63 0-88 0-96 0-87 -0-90 -0-31 -0-38 0-86 0-53 007 0-48 -006 — — — — 30 40 -5-42 3-59 1-27 203 1-78 007 0-55 1-34 0-76 0-29 107 0-90 002 - 1-57 -0-27 -013 0-80 0-27 0-24 0-87 — 0-21 -004 -1-38 0-42 -007 216 103 0-80 005 -9-45 -0-47 -0-47 -0-25 -6-30 -011 0-50 -1-03 -106 0-96 -013 807 — — — — — — — — — — — — — — — — — — — — 5-71 3-67 0-90 1-70 1-67 0005 0-25 106 0-65 0-39 0-93 0-94 0 31 - 114 -0-33 -0-08 0-91 0-32 016 0-74 011 011 -119 0-22 -0-26 1-94 0-90 0-96 0-06 -216 -0-28 — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — history) described above. At the optimum threshold, sensitivity, specificity, accuracy and positive predictive value of the electrocardiographic data model were 720%, 93-5%, 890% and 74-8%, respectively. For Sheffield data, the area under the receiver operating characteristic curve for the electrocardiographic data — 53 - 8 12 4-00 1 49 2-27 210 003 0-52 1-39 0-80 0-43 1 35 1 37 -015 - 1-70 - 0 49 -015 0-89 0-43 0-29 0-75 -0-49 -0-32 - 109 0-26 -0003 2-50 113 018 011 - 9 10 -018 -009 -0-33 -6-42 -0-37 0-40 -1-18 -0-75 1-33 011 7-72 - 1-22 -1-72 0-39 0-70 0-72 006 -0-52 - 1 99 2-35 -0-46 -0-22 - 1-27 -0-16 Downloaded from by guest on October 15, 2014 Constant ST elevation New Q waves Hypoperfusion ST depression Vomiting LVF T wave inversion Pain in right arm Nausea Sweating Age 5 Pain 2 Syncope Pain in neck/jaw Age 7 Age 6 Pain in back Pain in left arm Tight pain Hypertension Sharp pain Breathing Retrosternal pain Pain in left chest BBB Dull pain Smoker Family history Duration Pain 1 Old ischaemia Short of breath Age 1 Worse angina Episodic Previous Ml Old Q waves Posture Pain in right chest Pain 5 Tender Age 2 Previous angina Sex Age 3 Pain 3 Not sinus rhythm Palpitations CP major Diabetes Rate Ex-smoker Hyperlipidaemia 6 9 model was 90-1 ± 1 50 compared with 92-2 ± 1 30 for the six-factor model incorporating clinical data (not significant). The sensitivity, specificity, accuracy and positive predictive value of the electrocardiographic data model on Sheffield data were 87-3%, 83-8%, 84-8% and 69-6%, respectively. Eur Heart J, Vol. 17, August 1996 1186 R. L. Kennedy et al. 100 20 40 60 100-specificity 80 100 Figure 3 Receiver operating characteristic curves for the performance of four models on Sheffield data ( , 6; , 12; • • • •, 30; , 53). The models were derived from 600 Edinburgh patients. There was no difference between the performance of models derived using 6, 12 and 30 data items. However, the use of 53 data items was associated with a significant decline in performance (P<0-01, see text). Downloaded from by guest on October 15, 2014 100 I 80 3 o e 60 S J 3 I •a 4 0 ^ bjj is & 20 t Q wave Non-Q Unstable Stable Other Figure 4 Performance of a six-factor model on Sheffield data. The y-axis shows the percentage of patients in each category diagnosed as having acute myocardial infarction by the model. Patients are divided into groups according to whether they have Q wave or non-Q wave acute myocardial infarction, unstable or stable angina or other diagnoses. Comparison with published models In this part of the study, the performance of two previously published models was examined. We have compared the performance of the models using the published coefficients with new models derived using Eur Heart J, Vol. 17, August 1996 the same data inputs but with coefficient calculated from our training set of 600 cases. The performance of these models was evaluated using the 662 cases from Sheffield. The model described by Tierney et al.[l3] makes use of two electrocardiographic and two clinical factors, while the model described by Selker et alJl5] is more complex Logistic regression models for AMI diagnosis 1187 100 r 20 40 60 100-specificity 80 100 Figure 5 Receiver operating characteristic curves for logistic regression models derived from, and tested on, the Sheffield data. There was a significant increase in diagnostic performance with increasing numbers of data inputs (P<001, ANOVA). ( , 6; , 12; • • •, 31; , 53). Inputs Table 3 Logistic regression models based on previously published models Coefficients Coefficient Input Constant BBB New Q waves Not sinus rhythm ST depression ST elevation T wave inversion Tierney Selker -418 2-33 2-19 -3-93 0-31 0-62 — -3-37 1-73 0-90 -0-80 1-70 3-52 117 with 12 inputs. The coefficients for these models are to be found in the relevant papers, while the coefficients derived from our cases are shown in Table 3. Areas under the receiver operating characteristic curves for the Tierney model were 85-1 ± 169 using the published coefficients and 885 ± 1 -53 for the calculated coefficients (not significant). Sensitivity for diagnosis of acute myocardial infarction (using the calculated coefficients) was only 54-9% compared with 91-9% for the six-factor model derived from Edinburgh data (/><0-001). The specificity, accuracy and positive predictive value for the model were 97-6%, 84-9% and 908%, respectively. For the Selker model, the areas under the receiver operating characteristic curves were 85-2 ± 1-52 and 89-4 ± 1-39, respectively using published and calculated coefficients(P<005). With a sensitivity, specificity, accuracy and positive predictive value of 87-3%, 804%, 82-5% and 65-3%, respectively, the performance of this model was very similar to that of the models which we have derived de novo for this study. Constant ST elevation New Q waves Sweating Previous Ml Presence of chest pain Pain major symptom Sex Age 40 or less Age >50 Male over 50 years ST depression T waves elevated T waves inverted T wave + ST changes 1-4? lO:^_ .— —. —. — 1-23 0-88 0-71 - 1 44 0-67 -0-43 0-99 009 1-13 - 0-314 Discussion We have described the derivation and testing of logistic regression models for the early diagnosis of acute myocardial infarction. The present study confirms the findings of others that this technique is appropriate for the analysis of clinical and electrocardiographic data from patients presenting with acute chest pain/suspected acute myocardial infarction'13~15*221. Logistic regression models have also been described for estimating probability of ischaemic heart disease in patients referred for coronary angiography126' and for prognostic stratification of patients following acute myocardial Eur Heart J, Vol. 17, August 1996 Downloaded from by guest on October 15, 2014 Table 2 Logistic regression model derived using only electrocardiographic data 1188 R. L. Kennedy et al. infarction'271. In this study, we have examined in detail the influence of varying the data input items on the performance and portability of models. When tested on the training data, the performance of logistic regression models increased with increasing number of data input items. This relationship was demonstrated with data from both centres involved in the study. However, simple models comprising only six data items performed at least as well as more complex models on unseen, test data. The optimum diagnostic thresholds for the models were determined using receiver operating characteristic curve analysis, and were identical for the two populations of patients included in the study. Optimal performance of simple models may only be possible if the models are specifically derived for use in a given centre, using the most predictive factors for patients presenting to that centre. Eur Heart J, Vol. 17, August 1996 The time-insensitive predictive instrument (TIPI) developed by Selker et a/.(15] made use of seven data items available in the emergency room. Like the models described here, the time-insensitive predictive instrument included both clinical and electrocardiographic data but the model was slightly more complex than our six-factor model because a number of the items were combined, giving rise to a total of 12 inputs. The model described by Selker et al. was developed using data from 3453 patients in six different hospitals and was tested on a further 2320 cases. The area under the receiver operating characteristic curve in Selker's study was 088, which is similar to the values obtained for our models. Patients could be divided into four groups using the timeinsensitive predictive instrument and the incidence of acute ischaemic heart disease increased progressively Downloaded from by guest on October 15, 2014 The potential inputs for our models were ranked in order of their likelihood ratio (with respect to diagnosis of acute myocardial infarction) for the cohorts of patients from two different centres. Although there was a strong correlation between the likelihood ratios for the data items in the two cohorts, there were also important differences which account for the impaired performance of more complex models on unseen data from a centre other than that in which the models were derived. The difference in likelihood ratios was most apparent with cardiovascular risk factors such as smoking, diabetes and hyperlipidaemia. It is not clear whether these variations reflect differences in the pathogenesis of coronary artery disease in the two populations. Conclusions on the comparative epidemiology of IHD would be impossible to draw from the numbers involved in this study. The most discriminatory items were, however, identical between the two data sets, suggesting that simple diagnostic models may be more portable than more complex models. Although the admission electrocardiogram is highly predictive of acute myocardial infarction, it is well known that diagnostic changes are only apparent at presentation in about half of the patients who ultimately prove to have acute myocardial infarction'1 •2-28\ It is possible, but not documented, that this proportion may be diminishing as public awareness and advances in pre-hospital care are leading to patients presenting to hospital earlier after the onset of symptoms. The 12-lead electrocardiogram at presentation, and in the hours following admission, is also important prognostically — a normal electrocardiogram being associated with a very low risk of death or other major cardiovascular event[4'5>28). There was very little difference, on either test set of data, between the performance of our model which relied solely on electrocardiogram inputs and the models which also incorporated clinical data items. This is not surprising since three of our four most discriminatory data times were derived from the admission electrocardiogram. The electrocardiogram data model was much more sensitive, but had lower specificity and overall accuracy, on Sheffield data compared with Edinburgh data. This probably reflects the higher inci- dence of electrocardiographs abnormalities at presentation in the Sheffield cohort. The reason for this is not clear. It is not due to a difference in the timing of presentation — the median time from start of symptoms to presentation was 3 h in both cohorts. In our previous studies'17181, we have investigated the performance of models derived using only clinical data. Such models perform much less well than models which make use of electrocardiogram data: diagnostic performance typically declines by 10-15%, and the models are less portable than those presented here. In each of the two patient populations studied here, three of the five most predictive factors were derived from the electrocardiogram at presentation. We compared the performance of our logistic regression models with previously published models. The model described by Tierney et a/.'l3) is a very simple one comprising two items each of electrocardiographic and clinical data. The area under the receiver operating characteristic curve for Tierney's test data was 850, which is virtually identical to that for our data when the published coefficients were used. The model performed slightly, but not significantly, better when the coefficients were recalculated using our own data. Tierney et al. found that their model was significantly more specific and accurate than the physicians involved in their study. We have not presented data on the diagnostic performance of physicians in the present study. Tierney et al. reported a positive predictive value of 490% for their model, which is much less than the 90-8% described with identical data inputs in our study. It seems reasonable that a model which relies heavily on electrocardiogram inputs will be very specific but rather less sensitive than a model which takes greater account of clinical factors. The negative predictive value in Tierney's study was high at 970%, compared with 836% for the equivalent model in this study. These discrepancies must reflect differences in the patient population under study. A recent study showed that the model described by Tierney et al. and another logistic regression model described by Pozen et a/.'141, performed poorly on data from pre-hospital patients in Rotterdam'291. As with the present study, these authors found that a model derived from their own data performed markedly better. Logistic regression models for AMI diagnosis Logistic regression has a major advantage in that it is a simple and widely used statistic technique which can be applied readily to new data using standard statistical packages. The equation derived could easily be implemented on a hand-held programmable calculator, and the value obtained interpreted as a probability of acute myocardial infarction. If models consisting of only a few data items perform, as suggested here, as well as more complex models then it would be relatively simple to optimize a logistic regression equation for use in individual clinical settings. Many of the more recently described algorithms for early diagnosis of acute myocardial infarction are relatively complex, difficult to adapt and make use of many (up to 50 or more) data input items. The methods used include decision trees'12', Bayesian inference12'1 and artificial neural networks'17'19'. While these methods may offer advantages in terms of operator feedback, and in the case of neural networks, more sophisticated pattern recognition capability, their advantage over a simple technique such as logistic regression, needs to be demonstrated before they are widely applied. Also, it is important to be clear that the number and nature of the input data items being used is appropriate. In conclusion, logistic regression is a simple method with which to derive predictive models for patients presenting with suspected acute myocardial infarction. A relatively small number of data input items may yield optimal performance. The electrocardiogram inputs are by far the most important and models with only electrocardiogram data perform almost as well as those with clinical data items in addition. Care should be exercised in using published logistic regression models, although these models may help in selecting data inputs for locally-derived models. The achievement of complex, computer-based algorithms should be viewed with caution and compared carefully with the performance of simple models of the sort described in the present study. This project was supported by the Scottish Office Home and Health Department (Grant No. K/MRS/56/C2066) for collection of data in Edinburgh and by Altim Medical Systems Limited, Belper, Derbyshire, U.K. for collection of data in Sheffield. We are very grateful to the Physicians, Accident and Emergency Consultants and General Practitioners in both centres who allowed us to include their patients in the study and to Ms Janine Robinson who helped us with analysis of the data. The study would not have been possible without the active co-operation of the Accident and Emergency doctors who helped us by filling out proformas and by providing follow-up data. We would particularly like to acknowledge the following — Drs D Begg, M. Bell, S. Cowell, A. Dodds, G Foster, B. Halliday, I. Hay, R. Hill, J. Lennox, J. Poole, M. Strachan, G. Watt, A. Campbell, G. Campbell-Hewson, A. de Beaux, U. Guly, R. MacCallum, S. Maurice, R. Mitchell, M. Mnalane, P. Stuart, J. Watters, R. Casasola, J. Dalgliesh, C. Fan, E. Forrest, A. McGhee, D. Newby, R. Parris, J. Rigden, R. Rintoul and S. Robertson. References [1] Brush JE, Brand DA, Acampora D, Chalmer B, Wackers FJ. Use of the initial electrocardiogram to predict in-hospital complications of acute myocardial infarction. N Engl J Med 1985; 312: 1137-41. [2] Stark ME, Vacek JL. The initial electrocardiogram during admission for myocardial infarction. Use as a predictor of clinical course and facility utilization. Arch Intern Med 1987; 147: 843-6. [3] Yusuf S, Pearson M, Parish S, Ramsdale D, Rossi P, Sleight P. The entry ECG in the early diagnosis and prognostic stratification of patients with suspected acute myocardial infarction. Eur Heart J 1984; 5: 690-6. [4] Gheorghiade M, Anderson J, Rosman H et al. Risk identification at the time of admission to coronary care unit in patients with suspected myocardial infarction. Am Heart J 1988; 116: 1212-17. [5] Karlson BW, Herlitz J, Wiklund O, Richter A, Hjalmarson A. Early prediction of acute myocardial infarction from clinical history, examination and electrocardiogram in the emergency room. Am J Cardiol 1991; 68: 171-5. Eur Heart J, Vol. 17, August 1996 Downloaded from by guest on October 15, 2014 from 1-6% in the low probability group to 81-6% in the high probability group. It is not surprising, that when tested on our Sheffield data using the published coefficients, the time-insensitive predictive instrument did not perform quite as it did on Selker's data. However, the area under the receiver operating characteristic curve improved (/><005) when the model was re-derived using our Edinburgh data. This suggests that the selection of the variables to be included in the time-insensitive predictive instrument is appropriate. The authors are not explicit about how these variables were selected from the 120 candidate variables, but state that this selection was based on known clinical importance, predictive power and reliability when abstracted from patient's case notes. We have not attempted to test the impact that the use of logistic regression models might have on clinical practice. It is important before embarking on such a study to optimize the model being tested and to estimate the size of study which will be required to show a significant impact. This can be done conveniently using receiver operating characteristic analysis'241. Thus, we can estimate from data we have previously collected on the diagnostic performance of Accident and Emergency doctors'17181 that a study of around 1600 patients would be necessary to demonstrate a significant difference (95% probability) between doctors and an algorithm of the sort described here. It is, of course, impossible to estimate the exact impact the use of a decision support aid would have on clinical practice but it is important to be clear that an algorithm under test performs accurately and reliably on the patient population under study as well as showing that it has the capacity, not only to model, but also to enhance clinical practice. The potential benefits have to be weighed against the cost of using a computer-based device, for example, in our previous studies'17181, entering data into a computer program could take up to 10 min extra per patient. It is not clear at present whether the advantages previously shown to be associated with the use of diagnostic algorithms'1 li20>211 arise simply by ensuring that clinical data are analysed in a standard fashion, or whether additional information is generated allowing the doctors to identify particular subgroups of patients. A formal trial of any algorithm should correct for the benefits associated with standardized history taking, which is an integral part of using such an algorithm. 1189 1190 R. L. Kennedy et al. Eur Heart J, Vol. 17, August 1996 [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] Intelligence in Medicine, Maastricht, Netherlands. 1991; 119-28. Harrison RF, Marshall SJ, Kennedy RL. Neural networks, heart attack, and Bayesian decisions: an application of the Boltzmann perceptron network. J Art Neural Networks 1994; 1- 183-202. Baxt WG. Use of an artificial neural network for the diagnosis of myocardial infarction. Ann Intern Med 1991; 115: 843-8. Pozen MW, D'Agostino RB, Selker HP, Sytkowski PA, Hood WB. A predictive instrument to improve coronary-care-unit admission practices in acute ischaemic heart disease. A prospective multicenter clinical trial. N Engl J Med 1984; 310: 1273-8. Jonsbu J, Aase O, Rollag A, Liestol K, Eriksen J. Prospective evaluation of an EDB-based diagnostic program to be used in patients admitted to hospital with acute chest pain. Eur Heart J 1993; 14: 441-6. Long WJ, Griffith JL, Selker HP, D'Agostino RB. A comparison of logistic regression to decision-tree induction in a medical domain. Comp Biomed Res 1993; 26: 74-97. Zweig MH, Campbell G. Receiver-Operating Characteristic Curve (ROC) plots: A fundamental evaluation tool in clinical medicine. Clin Chem 1993; 39- 561-7 Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiol 1982; 143: 29-36. Hanley JA, McNeil BJ. A method for comparing the areas under receiver operating characteristic curves derived from the same cases. Radiol 1983; 148: 839^3. Detrano R, Bobbio M, Olson H et al. Computer probability estimates of angiographic coronary artery disease: Transportability and comparison with a cardiologist's estimates. Comp Biomed Res 1992; 25: 468-85. Parsons RW, Jamrozik KD, Hobbs MST, Thompson DL. Early identification of patients at low risk of death after myocardial infarction and potentially suitable for early hospital discharge. Br Med J 1994; 308: 1006-10. Lee TH, Cook EF, Weisberg MC, Sargent RK, Wilson C, Goldman L. Acute chest pain in the emergency room. Identification of low-risk patients. Arch Intern Med 1985; 145: 65-9. Grijseels EWM, Deckers JW, Hoes AW et al. Pre-hospital tnage of patients with suspected myocardial infarction. Evaluation of previously developed algorithms and new proposals. Eur Heart J 1995; 16: 325-32. Downloaded from by guest on October 15, 2014 [6] Kennedy RL, Harrison RF, Marshall SJ. Do we need computer-based decision support for the diagnosis of acute chest pain? J Royal Soc Med 1993; 86: 31-4. [7] Pipberger HV, Klingeman JD, Cosma J. Computer evaluation of statistical properties of clinical information in the differential diagnosis of chest pain. Meth Inform Med 1968, 7. 79-93 [8] Lubsen J, Pool T, van der Does E. A practical device for the application of a diagnostic or prognostic function. Meth Inform Med 1978; 17: 127-9. [9] Boissel JP, Vanerie R. Systeme d'aide a la decision a la phase aigue de l'infarctus du myocarde. Proceedings of the International Symposium on Medical Informatics, Toulouse, France 1977; 571-83. [10] Aase O, Jonsbu J, Liestol K, Rollag A, Eriksen J. Decision support by computer analysis of selected case history variables in the emergency room among patients with acute chest pain Eur Heart J 1993, 14: 433^40. [11] Goldman L, Weinbeg M, Weisberg M C e l al. A computerdenved protocol to aid in the diagnosis of emergency room patients with acute chest pain. N Engl J Med 1982; 307: 588-96. [12] Goldman L, Cook EF, Brand DA et al. A computer protocol to predict myocardial infarction in emergency department patients with chest pain. N Engl J Med 1988; 318: 797-803. [13] Tierney WM, Roth BJ, Psaty B el al. Predictors of myocardial infarction in emergency room patients. Crit Care Med 1985, 13: 526-31. [14] Pozen MW, D'Agostino RB, Mitchell JB el al. The usefulness of a predictive instrument to reduce inappropriate admissions to the coronary care unit. Ann Intern Med 1980; 92: 238-42 [15] Selker HP, Griffith J, D'Agostino RB. A tool for judging coronary care unit admission appropriateness, valid for both real-time and retrospective use. A time-insensitive predictive instrument (TIPI) for acute cardiac ischaemia: A multicenter study. Med Care 1991; 29: 610-27 [16] Emerson PA, Wyatt J, Dillistone L, Crichton N, Russell NJ. The development of ACORN, an expert system enabling nurses to make decisions about patients with chest pain in an accident and emergency department. Proceedings of the Conference on Medical Informatics in Clinical Medicine, Nottingham, U.K. 1988; 37-40. [17] Harrison RF, Marshall SJ, Kennedy RL. A connectionist approach to the early diagnosis of myocardial infarction. Proceedings of the 3rd European Conference on Artificial Logistic regression models for AMI diagnosis 1191 Appendix Binary inputs to the ANN Input node Age in years (under 30, 30-39, 40-49, 50-59, 60-69, 70-79, 80 and over) Smoker Ex-smoker Family history of ischaemic heart disease Diabetes mellitus Hypertension Hyperlipidaemia Is chest pain the major symptom'' Central chest pain Pain in left side of chest Pain in right side of chest Pain radiates to back Pain radiates to left arm, neck or jaw Pain radiates to right arm Worse on inspiration Pain related to posture Chest wall tenderness Pain described as sharp or stabbing Pain described as tight, heavy, gripping or crushing Sweating Short of breath Nausea Vomiting Syncope Episodic pain Hours since 1st symptom (0-5, 6-10, 11-20, 21-40, over 40) Hours of pain at presentation (0-5, 6-10, 11-20, 21-40, 41-80, over 80) History of angina Previous myocardial infarction Worse than usual angina/similar to previous acute myocardial infarction Fine crackles suggestive of pulmonary oedema Added heart sounds Signs of hypoperfusion New ST-segment elevation New pathological Q waves ST segment or T-wave changes suggestive of ischaemia Bundle branch block Old electrocardiogram features of myocardial infarction Electrocardiogram signs of ischaemia known to be old Downloaded from by guest on October 15, 2014 1-7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32-36 37-42 43 44 45 46 47 48 49 50 51 52 53 54 Parameter Eur Heart J, Vol. 17, August 1996
© Copyright 2024