sksurv.datasets.load_gbsg2()

Load and return the German Breast Cancer Study Group 2 dataset

The dataset has 686 samples and 8 features. The endpoint is recurrence free survival, which occurred for 299 patients (43.6%).

See [1], [2] for further description.

Returns: x (pandas.DataFrame) – The measurements for each patient. y (structured array with 2 fields) – cens: boolean indicating whether the endpoint has been reached or the event time is right censored.time: total length of follow-up

References

 [2] Schumacher, M., Basert, G., Bojar, H., et al. “Randomized 2 × 2 trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients.” Journal of Clinical Oncology 12, 2086–2093. (1994)