Learn R Programming

monoClust (version 1.2.1)

find_split: Find the Best Split

Description

Find the best split in terms of reduction in inertia for the transferred node, indicate by row. Find the terminal node with the greatest change in inertia and bi-partition it.

Usage

find_split(
  data,
  cuts,
  frame_row,
  cloc,
  dist,
  variables,
  minsplit,
  minbucket,
  ncores
)

Arguments

data

Original data set.

cuts

Cuts data set, which has the next higher value of each variable in the original data set.

frame_row

One row of the split tree as data frame.

cloc

Vector of current cluster membership.

dist

Distance matrix of all observations in the data. exported function yet. Vector of 1 for all observations.

variables

List of variables selected for clustering procedure. It could be a vector of variable indexes, or a vector of variable names.

minsplit

The minimum number of observations that must exist in a node in order for a split to be attempted.

minbucket

The minimum number of observations in any terminal leaf node. Default is minsplit/3.

ncores

Number of CPU cores on the current host.

Value

The updated frame_row with the next split updated.