sksurv.datasets.load_gbsg2

sksurv.datasets.load_gbsg2()[source]

Load and return the German Breast Cancer Study Group 2 dataset

The dataset has 686 samples and 8 features. The endpoint is recurrence free survival, which occurred for 299 patients (43.6%).

See [1], [2] for further description.

Returns:
  • x (pandas.DataFrame) – The measurements for each patient.
  • y (structured array with 2 fields) – cens: boolean indicating whether the endpoint has been reached or the event time is right censored.

    time: total length of follow-up

References

[1]http://ascopubs.org/doi/abs/10.1200/jco.1994.12.10.2086
[2]Schumacher, M., Basert, G., Bojar, H., et al. “Randomized 2 × 2 trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients.” Journal of Clinical Oncology 12, 2086–2093. (1994)