- data
Input data frame with continuous target variable 'regname' and the
influential variables, which need to be factors or numericals (transform logicals and character variables to factors)
- ctestv
Vector of character strings of forbidden split results;
Example: ctestv <- rbind('variable1 == {value1, value2}','variable2 <= value3'), where
character strings specified in 'value1', 'value2' are not allowed as results of a splitting operation in variable 1 in a tree.
For restrictions of the type 'variable <= xxx', all split results in a tree are excluded with 'variable <= yyy' and yyy <= xxx.
Trees with split results specified in 'ctestv' are not accepted during optimization.
A concrete example is: 'ctestv <- rbind('ETH == {C2a, C1a}','AGE <= 20')' for variables 'ETH' and 'AGE' and values 'C2a','C1a', and '20';
If no restrictions exist, the default = NA is used.
- inddep
Column indices of target variables in datain
- N
Number of repetitions of subsampling from predictors (integer) in versions "b" and "c";
default = 99
- pobs1
Percentage(s) of observations for subsampling at stage 1;
default=c(0.9,0.7)
- ppre1
Percentage(s) of predictors for subsampling at stage 1;
default=c(0.9,0.7)
- pobs2
Percentage(s) of observations for subsampling at stage 2";
default=pobs1
- ppre2
Percentage(s) of predictors for subsampling at stage 2;
default=ppre1
- conf.level
(1 - significance level) in function ctree
(numerical, > 0 and <= 1);
default = 0.95
- minsplit
Minimum number of elements in a node to be splitted;
default = 20
- minbucket
Minimum number of elements in a node;
default = 7