- dataset
Data frame with an outcome variable and other columns as
features. Alternatively, the input can be in TreeSummarizedExperiment
format.
- ...
All additional arguments are passed on to caret::train(), such
as case weights via the weights argument or ntree for rf models. See
the caret::train() docs for more details.
- outcome_colname
Column name as a string of the outcome variable
(default NULL; the first column will be chosen automatically).
- assay.type
The name of assay from dataset when the object is in
TreeSummarizedExperiment format. This assay is used as an input.
- col.var
The name of sample matdata variables from colData slot of
dataset when the object is in TreeSummarizedExperiment format. These
variables are used as predictors.
- altexp
The name of alternative experiment (altExp) from dataset
when the object is in TreeSummarizedExperiment format. This can be used
to select an experiment for the input.
- name
Name of results used when the input is
TreeSummarizedExperiment. This same name is used for assay and
altExp.
- method
Methods to preprocess the data, described in
caret::preProcess() (default: c("center","scale"), use NULL for
no normalization).
- remove_var
Whether to remove variables with near-zero variance
('nzv'; default), zero variance ('zv'), or none (NULL).
- collapse_corr_feats
Whether to keep only one of correlated features
(see corr_method and corr_thresh)
- corr_method
Correlation method. Options are the same as those supported
by stats::cor: spearman, pearson, kendall. (default: spearman)
- corr_thresh
group correlations above or equal to corr_thresh
(range 0 to 1; default: 1).
- to_numeric
Whether to change features to numeric where possible.
- group_neg_corr
Whether to group negatively correlated features
together (e.g. c(0,1) and c(1,0)).
- prefilter_threshold
Remove features which only have non-zero & non-NA
values in N rows or fewer (default: 1). Set this to -1 to keep all columns
at this step. This step will also be skipped if to_numeric is set to
FALSE.