AutoClustering

is the source time series data.table

data

FeatureColumns

ModelID

SavePath

set based on number of threads your machine has available

NThreads

set based on the amount of memory your machine has available

MaxMemory

number of factors to test out in k-means to find the optimal number

MaxClusters

pick the metric to identify top model in grid tune c("totss","betweenss","withinss")

ClusterMetric

If TRUE, an autoencoder will be built to reduce the feature space. Otherwise, all features in FeatureColumns will be used for clustering

RunDimReduction

Node shrink rate for H2OAutoencoder. See that function for details.

ShrinkRate

Epochs

L2_Reg

ElasticAveraging

ElasticAveragingMovingRate

ElasticAveragingRegularization

AutoClustering adds a column to your original data with a cluster number identifier. You can run request an autoencoder to be built to reduce the dimensionality of your data before running the clusering algo.

Automates and ensures high quality output for most
of your machine learning and data science tasks. The package contains
high quality functions that run at efficient speed with minimal memory
constraints for supervised learning, unsupervised learning, feature
engineering, model evaluation and interpretation, along with some
helper functions for graphing. AutoCatBoostClassifier(),
AutoCatBoostRegression(), and AutoCatBoostMultiClass() have a
dependency to the catboost package which isn't part of the CRAN
repository at the time of this writing. The link to the catboost URL
to download the package for use is in the Additional_repositories
field below, which has the installation instructions. You need to
install that package to make use of the AutoCatBoost_ functions.

AutoClustering: AutoClustering

Description

Usage

Arguments

Value

See Also

Examples