sksurv.datasets.load_veterans_lung_cancer

sksurv.datasets.load_veterans_lung_cancer()[source]

Load and return data from the Veterans’ Administration Lung Cancer Trial

The dataset has 137 samples and 6 features. The endpoint is death, which occurred for 128 patients (93.4%).

See 1 for further description.

Returns

  • x (pandas.DataFrame) – The measurements for each patient.

  • y (structured array with 2 fields) – Status: boolean indicating whether the endpoint has been reached or the event time is right censored.

    Survival_in_days: total length of follow-up

References

1

Kalbfleisch, J.D., Prentice, R.L.: “The Statistical Analysis of Failure Time Data.” John Wiley & Sons, Inc. (2002)