shape

This artificial data was generated to have five clusters: one big circle, two small circles, and two ellipses.
It was to test if the clustering algorithm could identify and distinguish between the five different clusters or not.
The dataset is generated from the following script:<pre>
makecircle &lt;- function(N, seed) {
 n &lt;- 0
 x &lt;- NULL
 set.seed(seed)
 while(n &lt; N) {
 tmp &lt;- runif(2, min = -1, max = 1)
 if (t(tmp) %*% tmp &lt; 1) {
 n &lt;- n + 1
 x &lt;- rbind(x, tmp)
 }
 }
 return (x)
}makedata &lt;- function(n, seed) {
 f &lt;- c(10, 3, 3, 1, 1)
 center &lt;- matrix(
 c(-.3, -.3, -.55, .8, .55, .8, .9, 0, .9, -.6),
 nrow = 5, ncol = 2, byrow = TRUE
 )
 s &lt;- matrix(
 c(.7, .7, .45, .2, .45, .2, .1, .1, .1, .1),
 nrow = 5, ncol = 2, byrow = TRUE
 )
 x &lt;- NULL
 for (i in 1:5) {
 tmp &lt;- makecircle(n * f[i], seed + i)
 tmp[,1] &lt;- tmp[,1] * s[i,1] + center[i,1]
 tmp[,2] &lt;- tmp[,2] * s[i,2] + center[i,2]
 x &lt;- rbind(x, tmp)
 }
 line &lt;- cbind(runif(floor(n / 3), min = -.1, max = .1), rep(.8, floor(n / 3)))
 noise &lt;- matrix(runif(8 * n, min = -1, max = 1), nrow = 4 * n, ncol = 2)
 return(rbind(x, line, noise))
}shape &lt;- makedata(50, 1000)</pre>

Implements the self-updating process clustering algorithms proposed
in Shiu and Chen (2016) <doi:10.1080/00949655.2015.1049605>.

shape: The Artificial Data of Five Different Clusters

Description

Arguments

References