This function extracts features based on minimum log-Loss function using Cox proportional hazard model as learner method on a high dimensional survival data. For those genes, we obtain frailty variances using CoxPH.
mlhighFrail(
cols,
idSurv,
idEvent,
idFrail,
dist = "gaussian",
per = 20,
fold = 3,
data
)A dataframe containing desired number of features with corresponding frailty variances.
A numeric vector of column numbers indicating the features for which the log Loss functions are to be computed
The name of the survival time variable
The name of the survival event variable
The name of the frailty variable
The name of the frailty distribution. Options are "gamma", "gaussian" or "t", default is "gaussian"
Percentage of features to be selected, default value 20
An integer denoting number of folds in cross validation, default value 3
A data frame that contains the survival and covariate information for the subjects
Atanu Bhattacharjee, Gajendra K. Vishwakarma & Souvik Banerjee
Performs CoxPH frailty on high doimensional survival data
Using the Cox proportional hazard model on the given survival data, this function selects the most significant feature based on minimum logarithmic loss function. The logarithmic loss function is defined as, $$L(f,t)=-log(f(t))$$ After selcting the most significant features, a Cox proportional hazard frailty model is fitted on the selected features. The CoxPH frailty model is defined as, $$\lambda(t)=\lambda 0(t)\nu exp{X'\beta}$$ where \(\nu\) is called the frailty component. The variance of the frailty term is considered as the heterogeneity among the subjects or patients. The distribution of frailty component is considered as either Gaussian, Gamma or t distribution.
Sonabend, R., Kiraly, F. J., Bender, A., Bernd Bischl B. and Lang M. mlr3proba: An R Package for Machine Learning in Survival Analysis, 2021, Bioinformatics, <https://doi.org/10.1093/bioinformatics/btab039>
mlhighHet, mlhighCox
if (FALSE) {
data(hnscc)
mlhighFrail(cols=c(10:20), idSurv="OS", idEvent="Death", idFrail="ID", dist="gaussian",
per=20, fold = 3, data=hnscc)
}
Run the code above in your browser using DataLab