A Set of Configuration Settings for the Subgroup and Pattern Mining Algorithms
Objects are created by calls of the form
new("SDTaskConfig", ...).
qf:A quality function; one of:
Binomial-Test bin,
Chi-Square-Test chi2,
Lift lift,
Piatetsky-Shapiro ps,
Gain gain,
Relative Gain relgain,
Weighted Relative Accuracy wracc.
method:A mining method; one of
Beam-Search beam,
BSD bsd,
SD-Map sdmap,
SD-Map enabling internal disjunctions sdmap-dis.
k:The maximum number (top-k) of patterns to discover.
minqualThe minimal quality.
minsizeThe minimal size of a subgroup (minimal coverage of database records).
maxlenThe maximal description length of a pattern, i.e., the maximal number of conjunctions.
nodefaultsIgnore default values, i.e., do not include the respective first value of each attribute
relfilterControls, whether irrelevant patterns are filtered during pattern mining; negatively impacts performance.
postfilterControls, whether a post-processing
filter is applied; one of:
Minimum Improvement (Global) min-improve-global,
checks the patterns against all possible generalizations,
Minimum Improvement (Pattern Set) min-improve-set,
checks the patterns against all their generalizations
in the result set,
Relevancy Filter relevancy, removes patterns that
are strictly irrlevant,
Significant Improvement (Global) sig-improve-global,
removes patterns that do not significantly improve
(0.05 level) w.r.t. all their possible generalizations,
Significant Improvement (Set) sig-improve-set,
removes patterns that do not significantly improve
(0.05 level) w.r.t. all generalizations in the result set.
attributesThe list of attributes to consider for mining. Either a vector of attribute names, or NULL, which includes all attributes.