This function checks what are available nodes to split and then call
find_split()
on each node, then decide which node creates best split, and
call splitter()
to perform the split.
checkem(
data,
cuts,
frame,
cloc,
dist,
variables,
minsplit,
minbucket,
split_order,
ncores
)
Original data set.
Cuts data set, which has the next higher value of each variable in the original data set.
The split tree transferred as data frame.
Vector of current cluster membership.
Distance matrix of all observations in the data. exported function yet. Vector of 1 for all observations.
List of variables selected for clustering procedure. It could be a vector of variable indexes, or a vector of variable names.
The minimum number of observations that must exist in a node in order for a split to be attempted.
The minimum number of observations in any terminal leaf
node. Default is minsplit/3
.
The control argument to see how many split has been done.
Number of CPU cores on the current host.
It is not supposed to return anything because global environment was used. However, if there is nothing left to split, it returns 0 to tell the caller to stop running the loop.