sksurv.datasets.load_veterans_lung_cancer

sksurv.datasets.load_veterans_lung_cancer()[source]

Load and return data from the Veterans’ Administration Lung Cancer Trial

The dataset has 137 samples and 6 features. The endpoint is death, which occurred for 128 patients (93.4%).

See [1] for further description.

Returns:
  • x (pandas.DataFrame) – The measurements for each patient.
  • y (structured array with 2 fields) – Status: boolean indicating whether the endpoint has been reached or the event time is right censored.

    Survival_in_days: total length of follow-up

References

[1]Kalbfleisch, J.D., Prentice, R.L.: “The Statistical Analysis of Failure Time Data.” John Wiley & Sons, Inc. (2002)