Learn R Programming

⚠️There's a newer version (1.9.8) of this package.Take me there.

Numero (version 1.9.6)

Statistical Framework to Define Subgroups in Complex Datasets

Description

High-dimensional datasets that do not exhibit a clear intrinsic clustered structure pose a challenge to conventional clustering algorithms. For this reason, we developed an unsupervised framework that helps scientists to better subgroup their datasets based on visual cues, please see Gao S, Mutter S, Casey A, Makinen V-P (2019) Numero: a statistical framework to define multivariable subgroups in complex population-based datasets, Int J Epidemiology, 48:369-37, . The framework includes the necessary functions to construct a self-organizing map of the data, to evaluate the statistical significance of the observed data patterns, and to visualize the results.

Copy Link

Version

Install

install.packages('Numero')

Monthly Downloads

323

Version

1.9.6

License

GPL (>= 2)

Maintainer

Ville-Petteri Makinen

Last Published

February 6th, 2024

Functions in Numero (1.9.6)

numero.clean

Clean datasets
numero.summary

Summarize subgroup statistics
nroSummary

Estimate subgroup statistics
nroKmeans

K-means clustering
nroPermute

Permutation analysis of map layout
nroPostprocess

Standardization using existing parameters
nroColorize

Assign colors based on value
nroAggregate

Regional averages on a self-organizing map
nroDestratify

Mitigate data stratification
nroLabel

Label pruning
nroKohonen

Self-organizing map
nroMatch

Best-matching districts
nroPlot

Plot a self-organizing map
numero.plot

Plot results from SOM analysis
numero.quality

Self-organizing map statistics
numero.prepare

Prepare datasets for analysis
nroTrain

Train self-organizing map
nroPreprocess

Data cleaning and standardization
numero.create

Create a self-organizing map
numero.evaluate

Self-organizing map statistics
numero.subgroup

Interactive subgroup assignment