Load and return data from the Veterans’ Administration Lung Cancer Trial

The dataset has 137 samples and 6 features. The endpoint is death, which occurred for 128 patients (93.4%).

See [1] for further description.

  • x (pandas.DataFrame) – The measurements for each patient.
  • y (structured array with 2 fields) – Status: boolean indicating whether the endpoint has been reached or the event time is right censored.

    Survival_in_days: total length of follow-up


[1]Kalbfleisch, J.D., Prentice, R.L.: “The Statistical Analysis of Failure Time Data.” John Wiley & Sons, Inc. (2002)