Learn R Programming

MVar (version 2.0.4)

Cluster: Cluster Analysis.

Description

Performs hierarchical and non-hierarchical cluster analysis in a data set.

Usage

Cluster(Data, Titles = NA, Hierarquico = TRUE, Analise = "Obs",  
        CorAbs = FALSE, Normaliza = FALSE, Distance = "euclidean",  
        Method = "complete", Horizontal = FALSE, NumGrupos = 0,
        Casc = TRUE)

Arguments

Data

Data to be analyzed.

Titles

Titles of the graphics, if not set, assumes the default text.

Hierarquico

Hierarchical groupings (default = TRUE), for non-hierarchical groupings (Method K-Means), only for case Analysis = "Obs".

Analise

"Obs" for analysis on observations (default), "Var" for analysis on variables.

CorAbs

Matrix of absolute correlation case Analyze = "Var" (default = FALSE).

Normaliza

Normalizes the data only for case Analyze = "Obs" (default = TRUE).

Distance

Metric of the distances in case of hierarchical groupings: "euclidean" (default), "maximum", "manhattan", "canberra", "binary" or "minkowski". Case Analysis = "Var" the metric will be the correlation matrix, according to CorAbs.

Method

Method for analyzing hierarchical groupings: "complete" (default), "ward.D", "ward.D2", "single", "average", "mcquitty", "median" or "centroid".

Horizontal

Horizontal dendrogram (default = FALSE).

NumGrupos

Number of groups to be formed.

Casc

Cascade effect in the presentation of the graphics (default = TRUE).

Value

Several graphics.

TabRes

Table with similarities and distances of the groups formed.

Groups

Original data with groups formed.

ResGroups

Results of the groups formed.

SQT

Total sum of squares.

MatrixD

Matrix of the distances.

References

MINGOTI, S. A. Analise de dados atraves de Methods de estatistica multivariada: uma abordagem aplicada. Belo Horizonte: UFMG, 2005. 297 p.

FERREIRA, D. F. Estatistica Multivariada. 2a ed. revisada e ampliada. Lavras: Editora UFLA, 2011. 676 p.

RENCHER, A. C. Methods of Multivariate Analysis. 2th. ed. New York: J.Wiley, 2002. 708 p.

Examples

Run this code
# NOT RUN {
data(DataQuan) # set of quantitative data

Data <- DataQuan[,2:8]

rownames(Data) <- DataQuan[1:nrow(DataQuan),1]

Res <- Cluster(Data, Hierarquico = TRUE, Analise = "Obs", CorAbs = FALSE, 
               Normaliza = FALSE, Distance = "euclidean", Method = "ward.D", 
               Horizontal = FALSE, NumGrupos = 2)

print("Table with similarities and distances:"); Res$TabRes
print("Groups formed:"); Res$Groups
print("Table with the results of the groups:"); Res$ResGroups
print("Total sum of squares:"); Res$SQT
print("Distance Matrix:"); Res$MatrixD 
 
write.table(file=file.path(tempdir(),"SimilarityTable.csv"), Res$TabRes, sep=";",
            dec=",",row.names = FALSE) 
write.table(file=file.path(tempdir(),"GroupedData.csv"), Res$Groups, sep=";",
            dec=",",row.names = TRUE) 
write.table(file=file.path(tempdir(),"GroupResults.csv"), Res$ResGroups, sep=";",
            dec=",",row.names = TRUE) 
# }

Run the code above in your browser using DataLab