Applying the CHull
function on multiple samples of fit values at once, such as bootstrap samples.
MultiCHull(data, bound = "lower", PercentageFit = 1, type = "multifit")# S3 method for MultiCHull
plot(
x,
col = NULL,
pch = NULL,
whichticks = NULL,
las = 2,
plottype = "static",
...
)
# S3 method for MultiCHullcom
plot(x, browser = FALSE, ...)
# S3 method for MultiCHull
print(x, ...)
# S3 method for MultiCHull
summary(object, ...)
# S3 method for MultiCHullcom
summary(object, ...)
Dataframe with scree test values
Table which indicates the selected model in each sample
Table which indicates how often each model is selected
Original dataframe
Boundary of convex hull that was requested
Requested proportion of increase in fit of a more complex model
Dataframe with complexity in 1st column and fit measures in next columns
Boundary of convex hull to inspect: upper
or lower
Required proportion of increase in fit of a more complex model
Either 'multifit' or 'multicom'
object of class MultiCHullcom produced by MultiCHull
Vector of colors
used for plots
Vector of pch
symbols
Model names of ticks that should be displayed
Orientation of tick mark labels
Type of plot. Either 'interactive' or 'static'
Additional arguments
If FALSE plots are viewed in viewer panel. If TRUE, plots are viewed in a browser
An object of the type produced by MultiCHull
MultiCHull
functionMultiCHull
applies the CHull
code on multiple samples of fit values. To this end, the input parameter data
consists of a dataframe with complexity values in the first column and fit values in the next columns. The different samples can for example be bootstrap samples, or fit values obtained with different random starts, or from different fit measures, etc. It is possible that in some samples no optimal solution can be found. This will generate a warning, which will include the sample number.
Data frame st
contains per sample the scree test values of the solutions that were found on the upper
or lower
bound of the hull (see also CHull
). In each sample, the least and most complex model receive a 0 value. The other models have an NA value.
tab
is also a dataframe, which indicates per sample the top three of optimal models (indicated by a 1, 2 and 3). The other models have an NA value. Finally, in frq
the frequencies are shown for each model of being selected as the optimal model.
Applying the method plot()
on output of MultiCHull
yields a plot with the models on the x-axis, ordered by increasing complexity. By default, all model names are shown as perpendicular labels on the x-axis, but one can choose to display specific model names only (e.g., whichticks=c("model13","model20")
). The tick mark labels can be made horizontal, by putting parameter las
to 0.
Solid lines (only shown in case of 20 or less samples) indicate the scree test values per sample, and symbols indicate the top three of the models per sample. The symbols can be adjusted with the parameter pch
and the colors with col
. The model (or multiple models) that is selected most often across samples, is indicated with a horizontal line.
CHull
data <- data.frame("comp"= c(305,456,460,607,612,615,758,764,768,770,909,916,921,924),
"fit"= c(152,89,79,71,57,57,64,49,47,47,60,41,39,39))
test <- array(rnorm(14*20,sd=2.5),c(14,20))
for (i in 1:20){
data <- cbind(data, 'fit' = data[,2]+test[,i])
}
output <- MultiCHull(data)
summary(output)
plot(output)
Run the code above in your browser using DataLab