Learn R Programming

tidylearn (version 0.1.0)

tidy_clara: Tidy CLARA (Clustering Large Applications)

Description

Performs CLARA clustering (scalable version of PAM)

Usage

tidy_clara(data, k, metric = "euclidean", samples = 50, sampsize = NULL)

Value

A list of class "tidy_clara" containing clustering results

Arguments

data

A data frame or tibble

k

Number of clusters

metric

Distance metric (default: "euclidean")

samples

Number of samples to draw (default: 50)

sampsize

Sample size (default: min(n, 40 + 2*k))

Examples

Run this code
# \donttest{
# CLARA for large datasets
large_data <- iris[rep(1:nrow(iris), 10), 1:4]
clara_result <- tidy_clara(large_data, k = 3, samples = 50)
print(clara_result)
# }

Run the code above in your browser using DataLab