Learn R Programming

Numero (version 1.9.8)

Statistical Framework to Define Subgroups in Complex Datasets

Description

High-dimensional datasets that do not exhibit a clear intrinsic clustered structure pose a challenge to conventional clustering algorithms. For this reason, we developed an unsupervised framework that helps scientists to better subgroup their datasets based on visual cues, please see Gao S, Mutter S, Casey A, Makinen V-P (2019) Numero: a statistical framework to define multivariable subgroups in complex population-based datasets, Int J Epidemiology, 48:369-37, . The framework includes the necessary functions to construct a self-organizing map of the data, to evaluate the statistical significance of the observed data patterns, and to visualize the results.

Copy Link

Version

Install

install.packages('Numero')

Monthly Downloads

298

Version

1.9.8

License

GPL (>= 2)

Maintainer

Ville-Petteri Makinen

Last Published

September 17th, 2024

Functions in Numero (1.9.8)

numero.prepare

Prepare datasets for analysis
numero.quality

Self-organizing map statistics
numero.create

Create a self-organizing map
numero.summary

Summarize subgroup statistics
numero.clean

Clean datasets
nroKohonen

Self-organizing map
nroLabel

Label pruning
nroAggregate

Regional averages on a self-organizing map
nroKmeans

K-means clustering
nroDestratify

Mitigate data stratification
nroColorize

Assign colors based on value
nroMatch

Best-matching districts
nroPermute

Permutation analysis of map layout
numero.evaluate

Self-organizing map statistics
nroTrain

Train self-organizing map
numero.subgroup

Interactive subgroup assignment
nroPostprocess

Standardization using existing parameters
nroPlot

Plot a self-organizing map
numero.plot

Plot results from SOM analysis
nroSummary

Estimate subgroup statistics
nroPreprocess

Data cleaning and standardization