" !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems. Singapore: Springer-Verlag. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603 -- Date Received: 28 February 1989 3. Past Usage: -- 1. Salzberg, S. (1988). Exemplar-based learning: Theory and implementation (Technical Report TR-10-88). Harvard University, Center for Research in Computing Technology, Aiken Computation Laboratory (33 Oxford Street; Cambridge, MA 02138). -- Steve applied his EACH program to predict survival (i.e., life or death), did not use the wall-motion attribute, and recorded 87 correct and 29 incorrect in an incremental application to this database. He also showed that, by tuning EACH to this domain, EACH was able to derive (non-incrementally) a set of 28 hyper-rectangles that could perfectly classify 119 instances. -- 2. Kan, G., Visser, C., Kooler, J., & Dunning, A. (1986). Short and long term predictive value of wall motion score in acute myocardial infarction. British Heart Journal, 56, 422-427. -- They predicted the same variable (whether patients will live one year after a heart attack) using a different set of 345 instances. Their statistical test recorded a 61% accuracy in predicting that a patient will die (post-hoc fit). -- 3. Elvin Kinney (in communication with Steven Salzberg) reported that a Cox regression application recorded a 60% accuracy in predicting that a patient will die. 4. Relevant Information: -- All the patients suffered heart attacks at some point in the past. Some are still alive and some are not. The survival and still-alive variables, when taken together, indicate whether a patient survived for at least one year following the heart attack. The problem addressed by past researchers was to predict from the other variables whether or not the patient will survive at least one year. The most difficult part of this problem is correctly predicting that the patient will NOT survive. (Part of the difficulty seems to be the size of the data set.) 5. Number of Instances: 132 6. Number of Attributes: 13 (all numeric-valued) 7. Attribute Information: 1. survival -- the number of months patient survived (has survived, if patient is still alive). Because all the patients had their heart attacks at different times, it is possible that some patients have survived less than one year but they are still alive. Check the second variable to confirm this. Such patients cannot be used for the prediction task mentioned above. 2. still-alive -- a binary variable. 0=dead at end of survival period, 1 means still alive 3. age-at-heart-attack -- age in years when heart attack occurred 4. pericardial-effusion -- binary. Pericardial effusion is fluid around the heart. 0=no fluid, 1=fluid 5. fractional-shortening -- a measure of contracility around the heart lower numbers are increasingly abnormal 6. epss -- E-point septal separation, another measure of contractility. Larger numbers are increasingly abnormal. 7. lvdd -- left ventricular end-diastolic dimension. This is a measure of the size of the heart at end-diastole. Large hearts tend to be sick hearts. 8. wall-motion-score -- a measure of how the segments of the left ventricle are moving 9. wall-motion-index -- equals wall-motion-score divided by number of segments seen. Usually 12-13 segments are seen in an echocardiogram. Use this variable INSTEAD of the wall motion score. 10. mult -- a derivate var which can be ignored 11. name -- the name of the patient (I have replaced them with \"name\") 12. group -- meaningless, ignore it 13. alive-at-1 -- Boolean-valued. Derived from the first two attributes. 0 means patient was either dead after 1 year or had been followed for less than 1 year. 1 means patient was alive at 1 year. 8. Missing Attribute Values: (denoted by \"?\") Attribute #: Number of Missing Values: (total: 132) ------------ ------------------------- 1 2 2 1 3 5 4 1 5 8 6 15 7 11 8 4 9 1 10 4 11 0 12 22 13 58 9. Distribution of attribute number 2: still-alive Value Number of instances with this value ---- ----------------------------------- 0 88 (dead) 1 43 (alive) ? 1 Total 132 10. Distribution of attribute number 13: alive-at-1 Value Number of instances with this value ---- ----------------------------------- 0 50 1 24 ? 58 Total 132 " "0" "'echoMonths'" 0.26 0.38 0.26 0.253 0.16 0.26 0.23 0.33 0.34 0.14 0.13 0.45 0.33 0.15 0.12 0.25 0.26 0.07 0.09 0.22 0.15 0.18 0.23 0.17 0.19 0.3 0.3 nan nan 0.21 0.15 0.17 nan 0.4 nan 0.61 nan 0.06 0.51 0.41 0.35 0.27 0.15 0.33 0.44 0.09 0.12 0.03 nan 0.04 0.27 0.24 0.3 0.01 0.29 0.15 0.13 0.1 0.29 0.17 0.12 0.187 0.13 0.11 0.16 0.14 0.25 0.36 0.06 0.225 0.25 0.12 0.29 0.06 0.217 0.22 0.26 0.2 0.2 0.06 0.07 0.25 0.05 nan 0.14 0.05 0.16 0.28 0.18 0.155 0.3 0.344 0.272 0.25 0.2 0.5 0.16 0.17 0.17 0.2 0.38 0.258 0.3 0.17 0.228 0.036 0.23 0.26 0.22 0.24 0.27 0.4 0.29 0.19 0.26 0.43 0.24 0.23 0.15 0.12 0.18 0.19 0.15 0.09 0.14 0.24 0.28 0.2 0.14 0.15 4.6 4.1 3.42 4.603 5.75 4.31 5.43 5.25 5.09 4.49 4.23 3.6 4 3.73 5.8 4.29 4.65 5.2 5.819 5.4 5.39 5.46 6.06 4.65 3.48 3.85 4.17 nan nan 4.16 5.05 5.32 nan 3.1 nan 4.07 5.31 nan 3.88 4.36 3.63 4.49 4.27 3.59 3.96 nan nan 6.29 nan 5 nan 5.26 3.49 5.65 6.15 4.57 4.37 5.3 4.41 5.15 6.78 5.02 4.96 4.68 5.26 4.75 5.57 5.78 5.62 5.2 4.72 4.31 4.75 5.95 4.54 4.85 4.77 4.58 5.2 6.74 4.16 4.48 4.44 nan 6.21 4.14 5.25 4.48 4.56 5.16 4.36 4.04 5.36 3.87 4.56 3.42 5.47 6.73 4.69 4.23 4.55 4.87 3.52 5.49 4.29 4.12 6.23 4.42 3.92 4.38 4.06 5.36 4.77 6.63 4.38 4.79 5.86 5.49 4.17 2.32 4.48 5.04 3.66 4.96 5.16 4.72 5.47 5.05 4.36 4.51 14 14 14 16 18 12 22.5 14 16 15.5 18 16 14 14 11.67 14 18 24 8 27 19.5 13.83 7.5 8 10 10 14 2 nan 14 10 14 6 12 14 13 5 21.5 15 nan 11 22 13 14 17.5 12 9 17 23 nan 9 18 14 39 14 13 12.33 23 14 10.5 16.67 13 17.83 11 11 10 5.5 12 13.67 24 11 15 13 21.5 16.5 15 21 14 8 12 18 11 15 28 11.5 15.5 11 22 13.5 13 14 9 12.67 18 12.5 18 16 26.08 10 12 10 11 18.16 13.5 11 13.5 14 14 11 22 12 12 9 19.5 9 10 21.5 12 14 16.5 11 19 10 13 14 12 11 14.5 15 15.5 1 1.7 1 1.45 2.25 1 1.875 1 1.14 1.19 1.8 1.14 1 1 2.33 1 1.64 2 1.333 2.25 1.625 1.38 1.5 1 1.11 1.667 1 1 1 1.56 1 1.17 3 1 1.17 1.625 1 2.15 1.67 1 1.222 2 1.3 1 1.45 2 1.25 1.31 2.3 nan 1.5 1.38 1 3 1 1.08 1.37 2.3 1.167 1.05 1.39 1.18 1.37 1 1 2.5 1.1 1 1.367 2.18 1 1.67 1.08 2.39 1.18 1.15 2.1 1 1 1.09 1.5 1 1.36 2.33 1.15 1.41 1 1.83 1.04 1 1.27 1 1.06 1.5 1.04 1.5 1.45 2.01 1 1 1 1 1.51 1.5 1 1.23 1.4 1 1 2.2 1 1 1 1.95 1 1 1.95 1.2 1.27 1.375 1.375 1.73 1 1.08 1.27 1 1.1 1.21 1.36 1.409 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 1 1 0 1 1 1 1 1 nan nan nan nan nan 0 nan nan nan nan 0 0 0 1 0 nan 0 0 0 0 nan nan 0 0 1 1 1 nan 0 1 0 0 0 0 nan 0 nan 0 nan nan 0 nan 0 0 0 1 0 0 0 nan 0 nan 1 0 nan nan 0 nan 1 1 nan nan nan 0 nan nan nan 0 nan nan 0 nan 1 1 nan 0 0 nan 1 0 0 nan 1 1 nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan 0 0 0 0 1 0 0 0 0 0 1 0 0 0 1 0 1 1 1 1 1 1 1 1 0 0 0 0 1 0 1 1 1 0 0 0 0 1 0 0 0 0 0 0 1 0 1 1 1 1 1 0 0 1 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 1 1 0 1 1 0 1 1 0 0 0 0 1 0 1 1 0 0 0 0 1 0 0 0 1 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 71 72 55 60 57 68 62 60 46 54 77 62 73 60 62 55 69 62 66 66 69 85 73 71 64 54 35 55 75 55 65 52 -2147483648 47 63 61 63 65 68 80 54 70 79 56 67 64 81 59 63 56 61 57 58 60 66 63 57 70 68 79 73 72 59 67 51 50 70 65 78 86 56 60 59 50 54 68 -2147483648 64 63 65 54 62 78 61 52 73 70 55 60 67 64 59 46 63 74 59 65 58 53 66 70 62 63 59 57 57 78 62 62 66 61 59 57 62 -2147483648 54 62 -2147483648 64 57 61 61 48 -2147483648 61 64 64 69 57 62 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 1 1 0 1 0 1 0 0 0 0 0 0 0 1 0 1 0 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 1 0 0 1 0 1 0 0 0 0 0 0 0 0 9 6 4 12 22 5 31 8 0 13 16 9 6 10 23 12 11 20 17 15 12 19 12 0 5 7 5 7 -2147483648 4 -2147483648 17 12 5 10 13 -2147483648 23 -2147483648 5 9 4 17 -2147483648 9 -2147483648 -2147483648 21 -2147483648 14 -2147483648 14 9 24 15 13 18 9 -2147483648 11 -2147483648 12 16 10 13 11 9 8 16 12 11 10 7 30 17 21 19 7 5 23 16 6 10 -2147483648 25 14 19 5 8 11 6 9 16 5 4 9 8 28 -2147483648 -2147483648 0 11 6 14 9 7 40 7 12 13 9 9 9 28 0 9 28 19 6 0 0 13 12 6 25 12 5 7 16 0 11 19 16 57 19 26 13 50 19 25 10 52 52 44 0 24 0 0 22 1 0 0 0 5 48 29 29 29 0 36 1 1 3 27 35 26 16 1 19 31 32 16 40 46 2 37 19 20 0 2 7 10 12 1 10 45 22 53 38 26 9 26 0 12 49 0 49 47 41 0 33 29 41 26 15 0 0 12 32 32 27 23 0 0 34 1 21 55 15 0 35 53 33 33 40 33 5 4 31 33 22 25 1 24 25 24 0 3 27 13 36 25 27 34 37 34 28 28 17 38 31 12 36 17 21 7 41 36 22 20 "still_alive" "age" "pericardial" "fractional" "epss" "lvdd" "wall_score" "wall_index" "alive_at_1" "class" "int0" "double1" "int2" "double3" "int4" "nominal:0,1" "numeric" "nominal:0,1" "numeric" "numeric" "numeric" "numeric" "numeric" "nominal:0,1" "numeric"