- x
A data.frame, matrix, or similar object of covariates, where each
row represents an instance. If a mi_df object is passed, y, bags are
automatically extracted, and all other columns will be used as predictors.
If a mild_df object is passed, y, bags, instances are automatically
extracted, and all other columns will be used as predictors.
- y
A numeric, character, or factor vector of bag labels for each
instance. Must satisfy length(y) == nrow(x). Suggest that one of the
levels is 1, '1', or TRUE, which becomes the positive class; otherwise, a
positive class is chosen and a message will be supplied.
- bags
A vector specifying which instance belongs to each bag. Can be a
string, numeric, of factor.
- cost
The cost parameter in SVM. If method = 'heuristic', this will
be fed to kernlab::ksvm(), otherwise it is similarly in internal
functions.
- method
The algorithm to use in fitting (default 'heuristic'). When
method = 'heuristic', which employs an algorithm similar to Andrews et
al. (2003). When method = 'mip', the novel MIP method will be used. When
method = 'qp-heuristic, the heuristic algorithm is computed using the
dual SVM. See details.
- weights
named vector, or TRUE, to control the weight of the cost
parameter for each possible y value. Weights multiply against the cost
vector. If TRUE, weights are calculated based on inverse counts of
instances with given label, where we only count one positive instance per
bag. Otherwise, names must match the levels of y.
- control
list of additional parameters passed to the method that
control computation with the following components:
kernel either a character the describes the kernel ('linear' or
'radial') or a kernel matrix at the instance level.
sigma argument needed for radial basis kernel.
nystrom_args a list of parameters to pass to kfm_nystrom(). This is
used when method = 'mip' and kernel = 'radial' to generate a Nystrom
approximation of the kernel features.
max_step argument used when method = 'heuristic'. Maximum steps of
iteration for the heuristic algorithm.
type: argument used when method = 'heuristic'. The type argument is
passed to e1071::svm().
scale argument used for all methods. A logical for whether to rescale
the input before fitting.
verbose argument used when method = 'mip'. Whether to message output
to the console.
time_limit argument used when method = 'mip'. FALSE, or a time
limit (in seconds) passed to gurobi() parameters. If FALSE, no time
limit is given.
start argument used when method = 'mip'. If TRUE, the mip program
will be warm_started with the solution from method = 'qp-heuristic' to
potentially improve speed.
- ...
Arguments passed to or from other methods.
- formula
a formula with specification mi(y, bags) ~ x which uses the
mi function to create the bag-instance structure. This argument is an
alternative to the x, y, bags arguments, but requires the data
argument. See examples.
- data
If formula is provided, a data.frame or similar from which
formula elements will be extracted.
- .fns
(argument for misvm.mild_df() method) list of functions to
summarize instances over.
- cor
(argument for misvm.mild_df() method) logical, whether to
include correlations between all features in the summarization.