Learn R Programming

RemixAutoML (version 0.4.8)

AutoClustering: AutoClustering

Description

AutoClustering adds a column to your original data with a cluster number identifier

Usage

AutoClustering(
  data,
  FeatureColumns = NULL,
  ModelID = "TestModel",
  SavePath = NULL,
  NThreads = 8,
  MaxMemory = "28G",
  MaxClusters = 50,
  ClusterMetric = "totss",
  Epochs = 5L,
  L2_Reg = 0.1,
  ElasticAveraging = TRUE,
  ElasticAveragingMovingRate = 0.9,
  ElasticAveragingRegularization = 0.001
)

Arguments

data

is the source time series data.table

FeatureColumns

Independent variables

ModelID

For naming the files to save

SavePath

Directory path for saving models

NThreads

set based on number of threads your machine has available

MaxMemory

set based on the amount of memory your machine has available

MaxClusters

number of factors to test out in k-means to find the optimal number

ClusterMetric

pick the metric to identify top model in grid tune c("totss","betweenss","withinss")

Epochs

For the autoencoder

L2_Reg

For the autoencoder

ElasticAveraging

For the autoencoder

ElasticAveragingMovingRate

For the autoencoder

ElasticAveragingRegularization

For the autoencoder

Value

Original data.table with added column with cluster number identifier

See Also

Other Unsupervised Learning: GenTSAnomVars(), H2OIsolationForestScoring(), H2OIsolationForest(), ResidualOutliers()

Examples

Run this code
# NOT RUN {
# }

Run the code above in your browser using DataLab