Learn R Programming

CluMix (version 2.3.1)

Clustering and Visualization of Mixed-Type Data

Description

Provides utilities for clustering subjects and variables of mixed data types (Hummel, Edelmann, Kopp-Schneider (2017) ). Similarities between subjects are measured by Gower's general similarity coefficient with an extension of Podani for ordinal variables. Similarities between variables can be assessed i) by combination of appropriate measures of association for different pairs of data types or ii) based on distance correlation. Alternatively, variables can also be clustered by the 'ClustOfVar' approach. The main feature of the package is the generation of a mixed-data heatmap. For visualizing similarities between either subjects or variables, a heatmap of the corresponding distance matrix can be drawn. Associations between variables can be explored by a 'confounder plot', which allows visual detection of possible confounding, collinear, or surrogate factors for some variables of primary interest. Distance matrices and dendrograms for subjects and variables can be derived and used for further visualizations and applications. This work was supported by BMBF grant 01ZX1609B, Germany.

Copy Link

Version

Install

install.packages('CluMix')

Monthly Downloads

38

Version

2.3.1

License

GPL (>= 2)

Maintainer

Manuela Hummel

Last Published

January 21st, 2019

Functions in CluMix (2.3.1)

mixdata

Small example dataset with variables of different types
dendro.subjects

Subjects dendrogram
confounderPlot

Confounder Plot
similarity.subjects

Similarity matrix for subjects
similarity.variables

Similarity matrix for variables
distmap

Display similarity matrix
mix.heatmap

Heatmap for data with variables of mixed types
dist.variables

Distance matrix for variables
dendro.variables

Variables dendrogram
dist.subjects

Distance matrix for subjects
association

Function to calculate a measure for association between two variables on arbitrary scales
CluMix-package

CluMix