" !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems. Singapore: Springer-Verlag. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Name: Pharynx (A clinical Trial in the Trt. of Carcinoma of the Oropharynx). SIZE: 195 observations, 13 variables. DESCRIPTIVE ABSTRACT: The .dat file gives the data for a part of a large clinical trial carried out by the Radiation Therapy Oncology Group in the United States. The full study included patients with squamous carcinoma of 15 sites in the mouth and throat, with 16 participating institutions, though only data on three sites in the oropharynx reported by the six largest institutions are considered here. Patients entering the study were randomly assigned to one of two treatment groups, radiation therapy alone or radiation therapy together with a chemotherapeutic agent. One objective of the study was to compare the two treatment policies with respect to patient survival. SOURCE: The Statistical Analysis of Failure Time Data, by JD Kalbfleisch & RL Prentice, (1980), Published by John Wiley & Sons VARIABLE DESCRIPTIONS: The data are in free format. That is, at least one blank space separates each variable in the .dat file. The variables are as follows: Case: Case Number Inst: Participating Institution sex: 1=male, 2=female Treatment: 1=standard, 2=test Grade: 1=well differentiated, 2=moderately differentiated, 3=poorly differentiated, 9=missing Age: In years at time of diagnosis Condition: 1=no disability, 2=restricted work, 3=requires assistance with self care, 4=bed confined, 9=missing Site: 1=faucial arch, 2=tonsillar fossa, 3=posterior pillar, 4=pharyngeal tongue, 5=posterior wall T staging: 1=primary tumor measuring 2 cm or less in largest diameter, 2=primary tumor measuring 2 cm to 4 cm in largest diameter with minimal infiltration in depth, 3=primary tumor measuring more than 4 cm, 4=massive invasive tumor N staging: 0=no clinical evidence of node metastases, 1=single positive node 3 cm or less in diameter, not fixed, 2=single positive node more than 3 cm in diameter, not fixed, 3=multiple positive nodes or fixed positive nodes Entry Date: Date of study entry: Day of year and year Status: 0=censored, 1=dead Time: Survival time in days from day of diagnosis STORY BEHIND THE DATA: Approximately 30% of the survival times are censored owing primarily to patients surviving to the time of analysis. Some patients were lost to follow-up because the patient moved or transferred to an institution not participating in the study, though these cases were relatively rare. From a statistical point of view, an important feature of these data is the considerable lack of homogeneity between individuals being studied. Of course, as part of the study design, certain criteria for patient eligibility had to be met which eliminated extremes in the extent of disease, but still many factors are not controlled. This study included measurements of many covariates which would be expected to relate to survival experience. Six such variables are given in the data (sex, T staging, N staging, age, general condition, and grade). The site of the primary tumor and possible differences between participating institutions require consideration as well. The T,N staging classification gives a measure of the extent of the tumor at the primary site and at regional lymph nodes. T=1, refers to a small primary tumor, 2 centimeters or less in largest diameter, whereas T=4 is a massive tumor with extension to adjoining tissue. T=2 and T=3 refer to intermediate cases. N=0 refers to there being no clinical evidence of a lymph node metastasis and N=1, N=2, N=3 indicate, in increasing magnitude, the extent of existing lymph node involvement. Patients with classifications T=1,N=0; T=1,N=1; T=2,N=0; or T=2,N=1, or with distant metastases were excluded from study. The variable general condition gives a measure of the functional capacity of the patient at the time of diagnosis (1 refers to no disability whereas 4 denotes bed confinement; 2 and 3 measure intermediate levels). The variable grade is a measure of the degree of differentiation of the tumor (the degree to which the tumor cell resembles the host cell) from 1 (well differentiated) to 3 (poorly differentiated) In addition to the primary question whether the combined treatment mode is preferable to the conventional radiation therapy, it is of considerable interest to determine the extent to which the several covariates relate to subsequent survival. It is also imperative in answering the primary question to adjust the survivals for possible imbalance that may be present in the study with regard to the other covariates. Such problems are similar to those encountered in the classical theory of linear regression and the analysis of covariance. Again, the need to accommodate censoring is an important distinguishing point. In many situations it is also important to develop nonparametric and robust procedures since there is frequently little empirical or theoretical work to support a particular family of failure time distributions. " "0" "'pharynx'" 1 1 2 1 2 1 2 3 2 2 2 2 1 2 3 2 3 2 3 2 2 2 1 1 3 2 2 2 2 2 2 2 2 1 2 2 2 2 2 2 2 2 2 1 1 2 2 1 1 1 1 1 3 3 1 2 2 3 1 1 1 2 2 2 3 3 3 1 1 1 2 3 2 2 2 2 2 3 1 2 3 2 2 1 2 2 1 2 2 1 1 1 2 3 3 1 1 2 2 1 1 2 2 2 3 2 2 2 2 2 3 1 3 2 1 1 3 2 2 2 3 1 3 2 2 2 3 2 2 2 1 3 2 2 3 nan 2 2 2 2 3 2 2 2 1 2 3 2 2 2 1 3 3 2 2 2 1 2 2 1 3 2 2 2 1 2 1 1 2 2 3 3 1 1 2 2 2 3 2 1 3 2 2 2 2 2 2 2 2 1 1 2 2 3 2 1 1 2 1 1 1 1 1 2 2 1 1 1 3 1 1 1 1 3 1 1 3 1 2 1 1 1 1 2 1 3 1 2 2 2 1 1 1 2 1 1 1 1 1 1 2 3 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 2 1 1 3 1 2 1 1 1 1 2 1 1 2 2 1 1 1 1 1 2 2 1 1 1 1 1 1 2 2 2 2 1 1 1 2 1 2 1 1 1 1 1 2 1 1 1 1 2 1 1 2 1 2 2 0 1 1 1 1 1 2 1 1 1 1 1 1 1 1 2 1 1 nan 2 1 2 1 1 1 1 1 1 1 1 1 1 2 1 2 4 1 1 1 1 1 2 2 1 2 1 1 2 2 1 1 1 2 1 2 2 2 2 2 5 4 4 4 6 3 3 2 3 4 2 5 4 6 3 5 2 4 3 6 4 3 3 5 2 6 3 2 4 3 6 3 2 5 3 2 3 5 3 2 3 4 4 3 2 2 3 3 6 2 3 5 2 3 2 6 2 2 4 5 5 3 3 3 3 3 2 3 5 4 5 4 4 6 3 6 6 6 1 3 1 6 2 1 6 3 2 3 1 3 2 6 2 1 5 5 4 4 2 3 1 3 5 2 2 3 4 5 4 1 1 5 2 5 2 3 6 6 1 4 5 6 3 3 1 2 2 6 6 1 5 5 1 4 2 3 1 2 5 2 2 4 5 2 1 6 2 6 6 5 5 3 5 1 2 2 2 6 3 1 5 6 6 6 3 1 2 2 6 5 6 1 1 6 2 2 2 1 1 4 4 1 4 5 4 2 2 5 4 5 3 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 2 1 1 1 1 1 1 1 1 1 2 1 1 2 2 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 2 1 1 1 2 2 1 1 1 2 1 2 1 1 2 1 1 1 1 1 1 1 1 2 1 1 2 2 2 2 1 1 2 1 1 1 1 2 1 1 1 1 1 1 1 1 2 2 1 1 2 1 2 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 2 1 1 2 1 1 2 1 1 2 1 1 1 1 2 1 2 2 1 2 1 2 1 1 1 2 1 1 1 1 1 1 2 2 2 1 1 1 1 1 1 2 2 2 1 1 2 2 2 1 1 1 1 1 1 1 2 1 1 2 2 1 2 1 1 1 1 2 2 1 2 1 2 2 1 2 1 1 1 2 1 2 2 1 1 2 1 2 2 2 2 1 1 1 2 2 1 2 1 1 1 2 1 1 1 1 1 2 2 2 1 1 1 1 1 1 2 2 2 2 2 1 2 2 2 1 1 2 1 1 2 2 2 1 1 1 1 2 2 2 2 1 1 2 1 1 2 1 1 2 2 1 2 2 2 1 1 2 1 1 2 1 2 1 1 2 1 1 2 2 1 1 1 2 1 1 1 1 1 1 2 1 1 1 1 1 2 2 1 2 1 2 2 1 1 2 2 2 2 2 1 2 2 1 1 1 2 2 1 1 2 1 2 1 2 2 2 2 2 1 2 1 1 2 1 1 1 2 2 2 2 2 2 2 1 2 2 2 1 2 1 1 2 1 1 1 2 2 2 2 51 65 64 73 64 61 65 84 54 72 42 61 71 83 43 52 68 69 65 58 63 59 75 65 41 60 72 51 72 49 82 64 57 67 65 62 49 60 75 54 59 58 50 60 43 48 49 44 77 75 54 68 58 66 47 60 66 51 49 50 52 40 69 56 70 47 46 53 67 68 90 44 48 67 58 69 75 58 72 72 70 71 55 73 50 63 58 56 62 55 50 77 67 53 55 71 65 50 61 72 51 59 56 61 61 68 71 57 72 55 61 47 66 52 61 66 64 73 67 68 58 68 85 74 53 60 58 66 58 39 54 49 52 35 44 81 74 65 66 74 90 60 63 61 67 88 69 46 69 48 77 69 75 71 58 66 44 59 78 58 65 53 49 65 59 79 57 54 47 68 63 72 51 43 43 65 54 50 39 46 49 52 69 55 48 20 47 67 66 60 54 54 59 47 57 2 4 1 1 1 2 2 4 1 4 4 1 2 4 2 4 4 1 1 2 2 2 2 1 2 4 4 1 2 4 1 1 2 2 1 4 4 4 2 2 4 1 1 1 2 2 1 1 1 2 1 1 4 2 1 4 4 1 1 2 4 4 1 2 4 4 2 4 4 4 4 4 1 2 4 1 4 2 4 1 4 2 1 1 4 1 2 4 4 4 2 4 2 2 2 2 1 2 4 1 2 4 1 1 2 1 1 2 4 2 2 4 2 4 4 2 4 1 1 1 2 4 4 1 1 2 2 1 2 4 1 2 1 4 2 4 1 4 4 4 2 1 1 4 1 2 4 1 1 2 1 2 4 4 1 2 2 1 4 1 4 4 4 2 1 4 2 2 1 1 4 2 2 2 4 4 1 2 1 1 2 1 4 2 2 4 4 4 2 2 2 1 1 1 1 3 2 3 4 4 3 4 1 3 2 2 4 3 3 4 4 2 3 3 4 4 4 3 3 4 3 1 4 3 3 3 2 4 3 3 1 1 3 3 2 2 3 4 3 2 3 4 3 4 4 3 4 3 4 3 3 4 3 3 4 4 4 3 1 4 3 3 2 3 3 3 3 4 3 4 3 3 3 3 3 2 4 3 3 3 3 1 3 4 2 4 2 3 3 2 3 4 4 4 4 3 3 4 3 2 3 3 2 3 3 2 4 3 4 4 4 4 4 3 2 3 3 2 3 4 1 3 1 2 3 4 2 3 3 4 3 3 4 4 3 3 4 4 4 4 3 4 4 2 3 4 1 3 4 4 3 4 3 4 4 3 3 2 3 4 2 4 4 3 4 2 3 3 3 3 2 4 4 4 4 3 3 4 3 3 2 3 3 3 3 4 3 4 3 3 1 3 3 0 3 0 3 3 3 2 2 3 1 1 3 3 3 0 0 3 3 3 1 3 3 3 3 3 1 2 0 3 3 3 0 2 3 3 3 3 2 2 0 0 2 3 3 1 1 1 3 0 2 3 2 2 2 3 0 0 3 3 3 3 3 2 1 3 1 1 3 2 2 1 3 2 0 3 3 0 3 0 1 0 3 0 3 0 3 2 3 2 2 2 2 3 3 3 3 0 0 3 0 1 3 3 3 2 1 0 2 1 2 3 3 3 3 0 3 3 3 3 2 0 0 2 2 2 3 3 1 3 1 0 3 3 0 3 3 2 0 1 1 1 3 0 0 1 2 0 0 3 3 3 3 2 1 1 3 3 3 2 2 2 1 2 3 0 0 1 2 3 1 3 3 3 3 3 3 0 3 2 1 0 0 3 3 0 2 0 3 3 0 3 3 2468 2968 3368 5768 9568 10668 10768 12068 13368 15468 15468 18268 18468 19068 20768 21768 22768 23368 25968 28068 28068 28268 28268 28968 29468 29868 30468 30868 30868 31068 31868 32468 33568 33368 33868 369 769 969 1769 2469 2469 3569 4469 4569 4969 5169 5669 2769 8369 9369 11869 12569 12769 12969 13269 13569 14369 15569 15669 16669 16769 17869 19969 20469 20469 23069 24569 26669 27969 26869 28069 28969 29069 30469 30469 32869 32869 33069 33269 33569 33669 34469 35369 36369 870 4270 4470 4870 4970 5470 5770 7870 8270 9670 11070 11870 12470 13170 14470 14670 15270 15870 16070 16670 17470 18770 18970 19070 20570 21170 21970 23170 24370 25170 25470 25870 28570 28770 31670 32770 33370 33670 34170 34270 34370 34470 35570 36270 1271 1571 1871 2271 2671 3371 4371 4971 6771 7571 7771 8871 10571 11371 15371 15471 15971 16171 18371 18871 20171 20271 20271 20271 20971 21671 21871 22171 23771 25371 26371 27371 28071 28471 29471 29971 31471 31971 32171 32371 32671 33071 34071 34271 34771 1272 3572 4672 5472 5572 5672 5972 8072 8272 13671 14372 14372 15672 15772 20572 20772 20972 22772 24372 24872 27672 12371 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 0 1 0 0 1 0 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 0 0 1 0 1 1 1 0 0 1 1 0 1 1 1 1 0 1 1 0 1 1 1 0 1 0 0 1 1 1 1 1 1 1 1 0 0 1 1 1 1 0 0 1 0 1 1 0 1 1 0 1 1 0 1 0 1 0 0 0 1 1 0 1 1 1 0 1 1 0 1 1 1 1 1 1 1 1 1 0 0 1 1 1 0 1 0 1 0 0 1 1 1 1 1 1 1 1 0 1 1 1 0 1 1 0 1 0 1 1 1 0 0 1 0 1 0 0 1 1 1 0 1 1 1 0 1 631 270 327 243 916 1823 637 235 255 184 1064 414 216 324 480 245 1565 560 376 911 279 144 1092 94 177 1472 526 173 575 222 167 1565 256 134 404 1495 162 262 307 782 661 546 1766 374 1489 1446 74 1609 301 328 459 446 1644 494 279 915 228 127 1574 561 370 805 192 273 1377 407 929 548 1317 1317 517 1307 230 763 172 1455 1234 544 800 1460 785 714 338 432 1312 351 205 1219 11 666 147 1060 477 1058 1312 696 112 308 15 130 296 293 545 1086 1250 147 726 310 599 998 1089 382 932 264 11 911 89 525 532 637 112 1095 170 943 191 928 918 825 99 99 933 461 347 372 731 363 238 593 219 465 446 553 532 154 369 541 107 854 822 775 336 513 914 757 794 105 733 600 266 317 407 346 518 395 81 608 760 343 324 254 751 334 275 546 112 182 209 208 174 651 672 291 723 498 276 90 213 38 128 445 159 219 173 413 274 "Inst" "sex" "Treatment" "Grade" "Age" "Condition" "Site" "T" "N" "Entry" "Status" "class" "int0" "double1" "int2" "double3" "int4" "nominal:2,5,4,6,3,1" "nominal:2,1" "nominal:1,2" "nominal:1,2,3" "numeric" "nominal:1,2,3,0,4" "nominal:2,4,1" "nominal:3,2,4,1" "nominal:1,3,0,2" "nominal:2468,2968,3368,5768,9568,10668,10768,12068,13368,15468,18268,18468,19068,20768,21768,22768,23368,25968,28068,28268,28968,29468,29868,30468,30868,31068,31868,32468,33568,33368,33868,369,769,969,1769,2469,3569,4469,4569,4969,5169,5669,2769,8369,9369,11869,12569,12769,12969,13269,13569,14369,15569,15669,16669,16769,17869,19969,20469,23069,24569,26669,27969,26869,28069,28969,29069,30469,32869,33069,33269,33569,33669,34469,35369,36369,870,4270,4470,4870,4970,5470,5770,7870,8270,9670,11070,11870,12470,13170,14470,14670,15270,15870,16070,16670,17470,18770,18970,19070,20570,21170,21970,23170,24370,25170,25470,25870,28570,28770,31670,32770,33370,33670,34170,34270,34370,34470,35570,36270,1271,1571,1871,2271,2671,3371,4371,4971,6771,7571,7771,8871,10571,11371,15371,15471,15971,16171,18371,18871,20171,20271,20971,21671,21871,22171,23771,25371,26371,27371,28071,28471,29471,29971,31471,31971,32171,32371,32671,33071,34071,34271,34771,1272,3572,4672,5472,5572,5672,5972,8072,8272,13671,14372,15672,15772,20572,20772,20972,22772,24372,24872,27672,12371" "nominal:1,0" "numeric"