sksurv.svm.
HingeLossSurvivalSVM
Naive implementation of kernel survival support vector machine.
A new set of samples is created by building the difference between any two feature vectors in the original data, thus this version requires \(O(\text{n_samples}^4)\) space and \(O(\text{n_samples}^6 \cdot \text{n_features})\).
See sksurv.svm.NaiveSurvivalSVM for the linear naive survival SVM based on liblinear.
sksurv.svm.NaiveSurvivalSVM
See 1, 2, 3 for further description.
solver ("cvxpy" | "cvxopt" | "osqp", optional, default: cvxpy) – Which quadratic program solver to use.
alpha (float, positive, default: 1) – Weight of penalizing the hinge loss in the objective function.
kernel ("linear" | "poly" | "rbf" | "sigmoid" | "cosine" | "precomputed") – Kernel. Default: “linear”
gamma (float, optional) – Kernel coefficient for rbf and poly kernels. Default: 1/n_features. Ignored by other kernels.
1/n_features
degree (int, default: 3) – Degree for poly kernels. Ignored by other kernels.
coef0 (float, optional) – Independent term in poly and sigmoid kernels. Ignored by other kernels.
kernel_params (mapping of string to any, optional) – Parameters (keyword arguments) and values for kernel passed as call
pairs ("all" | "nearest" | "next", optional, default: "all") –
Which constraints to use in the optimization problem.
all: Use all comparable pairs. Scales quadratic in number of samples.
nearest: Only considers comparable pairs \((i, j)\) where \(j\) is the uncensored sample with highest survival time smaller than \(y_i\). Scales linear in number of samples (cf. sksurv.svm.MinlipSurvivalSVM).
sksurv.svm.MinlipSurvivalSVM
next: Only compare against direct nearest neighbor according to observed time, disregarding its censoring status. Scales linear in number of samples.
verbose (bool, default: False) – Enable verbose output of solver.
timeit (False or int) – If non-zero value is provided the time it takes for optimization is measured. The given number of repetitions are performed. Results can be accessed from the timings_ attribute.
timings_
max_iter (int, optional) – Maximum number of iterations to perform. By default use solver’s default value.
X_fit_
Training data.
ndarray
coef_
Coefficients of the features in the decision function.
ndarray, shape = (n_samples,)
References
Van Belle, V., Pelckmans, K., Suykens, J. A., & Van Huffel, S. Support Vector Machines for Survival Analysis. In Proc. of the 3rd Int. Conf. on Computational Intelligence in Medicine and Healthcare (CIMED). 1-8. 2007
Evers, L., Messow, C.M., “Sparse kernel methods for high-dimensional survival data”, Bioinformatics 24(14), 1632-8, 2008.
Van Belle, V., Pelckmans, K., Suykens, J.A., Van Huffel, S., “Survival SVM: a practical scalable algorithm”, In: Proc. of 16th European Symposium on Artificial Neural Networks, 89-94, 2008.
__init__
Initialize self. See help(type(self)) for accurate signature.
Methods
__init__([solver, alpha, kernel, gamma, …])
Initialize self.
fit(X, y)
fit
Build a MINLIP survival model from training data.
predict(X)
predict
Predict risk score of experiencing an event.
score(X, y)
score
Returns the concordance index of the prediction.
X (array-like, shape = (n_samples, n_features)) – Data matrix.
y (structured array, shape = (n_samples,)) – A structured array containing the binary event indicator as first field, and time of event or time of censoring as second field.
self
Higher scores indicate shorter survival (high risk), lower scores longer survival (low risk).
X (array-like, shape = (n_samples, n_features)) – The input samples.
y – Predicted risk.
X (array-like, shape = (n_samples, n_features)) – Test samples.
cindex – Estimated concordance index.
float