Learn R Programming

clustMixType (version 0.1-16)

lambdaest: compares variance of all variables

Description

Investigation of variances to specify lambda for k prototypes clustering .

Usage

lambdaest(x)

Arguments

x
Original data.

Value

lambda
Ratio of averages over all numeric/factor variables is returned.

Details

Variance of numeric variables and $1-\sum_i p_i^2$ of categorical variance is computed.

Examples

Run this code
# generate toy data with factors and numerics

n   <- 100
prb <- 0.9
muk <- 1.5 
clusid <- rep(1:4, each = n)

x1 <- sample(c("A","B"), 2*n, replace = TRUE, prob = c(prb, 1-prb))
x1 <- c(x1, sample(c("A","B"), 2*n, replace = TRUE, prob = c(1-prb, prb)))
x1 <- as.factor(x1)

x2 <- sample(c("A","B"), 2*n, replace = TRUE, prob = c(prb, 1-prb))
x2 <- c(x2, sample(c("A","B"), 2*n, replace = TRUE, prob = c(1-prb, prb)))
x2 <- as.factor(x2)

x3 <- c(rnorm(n, mean = -muk), rnorm(n, mean = muk), rnorm(n, mean = -muk), rnorm(n, mean = muk))
x4 <- c(rnorm(n, mean = -muk), rnorm(n, mean = muk), rnorm(n, mean = -muk), rnorm(n, mean = muk))

x <- data.frame(x1,x2,x3,x4)

a <- lambdaest(x)
res <- kproto(x, 2, lambda = a)

Run the code above in your browser using DataLab