- x
A data.frame, matrix, or similar object of covariates, where each
row represents an instance. If a mi_df
object is passed, y, bags
are
automatically extracted, and all other columns will be used as predictors.
If a mild_df
object is passed, y, bags, instances
are automatically
extracted, and all other columns will be used as predictors.
- y
A numeric, character, or factor vector of bag labels for each
instance. Must satisfy length(y) == nrow(x)
. Suggest that one of the
levels is 1, '1', or TRUE, which becomes the positive class; otherwise, a
positive class is chosen and a message will be supplied.
- bags
A vector specifying which instance belongs to each bag. Can be a
string, numeric, of factor.
- cost
The cost parameter in SVM. If method = 'heuristic'
, this will
be fed to kernlab::ksvm()
, otherwise it is similarly in internal
functions.
- method
The algorithm to use in fitting (default 'heuristic'
). When
method = 'heuristic'
, which employs an algorithm similar to Andrews et
al. (2003). When method = 'mip'
, the novel MIP method will be used. When
method = 'qp-heuristic
, the heuristic algorithm is computed using the
dual SVM. See details.
- weights
named vector, or TRUE
, to control the weight of the cost
parameter for each possible y value. Weights multiply against the cost
vector. If TRUE
, weights are calculated based on inverse counts of
instances with given label, where we only count one positive instance per
bag. Otherwise, names must match the levels of y
.
- control
list of additional parameters passed to the method that
control computation with the following components:
kernel
either a character the describes the kernel ('linear' or
'radial') or a kernel matrix at the instance level.
sigma
argument needed for radial basis kernel.
nystrom_args
a list of parameters to pass to kfm_nystrom()
. This is
used when method = 'mip'
and kernel = 'radial'
to generate a Nystrom
approximation of the kernel features.
max_step
argument used when method = 'heuristic'
. Maximum steps of
iteration for the heuristic algorithm.
type
: argument used when method = 'heuristic'
. The type
argument is
passed to e1071::svm()
.
scale
argument used for all methods. A logical for whether to rescale
the input before fitting.
verbose
argument used when method = 'mip'
. Whether to message output
to the console.
time_limit
argument used when method = 'mip'
. FALSE
, or a time
limit (in seconds) passed to gurobi()
parameters. If FALSE
, no time
limit is given.
start
argument used when method = 'mip'
. If TRUE
, the mip program
will be warm_started with the solution from method = 'qp-heuristic'
to
potentially improve speed.
- ...
Arguments passed to or from other methods.
- formula
a formula with specification mi(y, bags) ~ x
which uses the
mi
function to create the bag-instance structure. This argument is an
alternative to the x, y, bags
arguments, but requires the data
argument. See examples.
- data
If formula
is provided, a data.frame or similar from which
formula elements will be extracted.
- .fns
(argument for misvm.mild_df()
method) list of functions to
summarize instances over.
- cor
(argument for misvm.mild_df()
method) logical, whether to
include correlations between all features in the summarization.