Usage
Classifier.par(train, test = NULL, train.label, type = c("TSP", "GLM", "GLM_L1", "GLM_L2", "PAM", "SVM", "plsrf_x", "plsrf_x_pv", "RF"), CVtype = c("loocv", "k-fold"), outerkfold = 5, innerkfold = 5, ncpus = 2)
Arguments
train
An object of class ExpressionSet
or data frame or matrix contains predictors for the training set, where columns correspond to samples and rows to features.
test
An object of class ExpressionSet
or data frame or matrix contains predictors for the test set (optional), where columns correspond to samples and rows to features.
train.label
A numeric vector contains the actual class labels (0 or 1) of the training set. NOTE: values should be numeric not factor.
type
Type of classification algorithm used. Currently 9 well-known algorithm are available for user to choose from. They are: top scoring pair (TSP), logistic regression (GLM), GLM with L1 (lasso) penalty, GLM with L2 (ridge) penalty, prediction analysis for microarray (PAM), support vector machine (SVM), Random Forest combined with partial least square dimension reduction (plsrf_x), Random Forest combined with partial least square dimension reduction plus pre-validation (plsrf_x_pv), Random Forest (RF). NOTE: "TSP", "PAM", "plsrf_x" and "plsrf_x_pv" are exclusively for high-dimensional data.
CVtype
Cross-validation type used to obtain the predicted labels of the training set. Must be either k-fold cross-validation (k-fold), or leave-one-out-cross-validation (loocv).
outerkfold
Number of cross validation used in the training phase.
innerkfold
Number of cross validation used to estimate the model parameters. E.g. penalty parameter in "GLM_L1".
ncpus
Number of cores assign to the parallel computation.