Max information gains (discrete)
ComputeMaxInfoGainsDiscrete(
data,
decision,
contrast_data = NULL,
dimensions = 1,
pc.xi = 0.25,
return.tuples = FALSE,
interesting.vars = vector(mode = "integer"),
require.all.vars = FALSE
)
A data.frame
with the following columns:
IG
-- max information gain (of each variable)
Tuple.1, Tuple.2, ...
-- corresponding tuple (up to dimensions
columns, available only when return.tuples == T
)
Discretization.nr
-- always 1 (for compatibility with the non-discrete function; available only when return.tuples == T
)
Additionally attribute named run.params
with run parameters is set on the result.
input data where columns are variables and rows are observations (all discrete with the same number of categories)
decision variable as a binary sequence of length equal to number of observations
the contrast counterpart of data, has to have the same number of observations
number of dimensions (a positive integer; 5 max)
parameter xi used to compute pseudocounts (the default is recommended not to be changed)
whether to return tuples where max IG was observed (one tuple per variable) - not supported with CUDA nor in 1D
variables for which to check the IGs (none = all) - not supported with CUDA
boolean whether to require tuple to consist of only interesting.vars
# \donttest{
ComputeMaxInfoGainsDiscrete(madelon$data > 500, madelon$decision, dimensions = 2)
# }
Run the code above in your browser using DataLab