Learn R Programming

Numero (version 1.9.10)

Statistical Framework to Define Subgroups in Complex Datasets

Description

High-dimensional datasets that do not exhibit a clear intrinsic clustered structure pose a challenge to conventional clustering algorithms. For this reason, we developed an unsupervised framework that helps scientists to better subgroup their datasets based on visual cues, please see Gao S, Mutter S, Casey A, Makinen V-P (2019) Numero: a statistical framework to define multivariable subgroups in complex population-based datasets, Int J Epidemiology, 48:369-37, . The framework includes the necessary functions to construct a self-organizing map of the data, to evaluate the statistical significance of the observed data patterns, and to visualize the results.

Copy Link

Version

Install

install.packages('Numero')

Monthly Downloads

422

Version

1.9.10

License

GPL (>= 2)

Maintainer

Ville-Petteri Makinen

Last Published

August 31st, 2025

Functions in Numero (1.9.10)

numero.create

Create a self-organizing map
nroPreprocess

Data cleaning and standardization
numero.clean

Clean datasets
numero.plot

Plot results from SOM analysis
numero.subgroup

Interactive subgroup assignment
nroSummary

Estimate subgroup statistics
numero.evaluate

Self-organizing map statistics
numero.prepare

Prepare datasets for analysis
numero.summary

Summarize subgroup statistics
nroPlot

Plot a self-organizing map
nroPostprocess

Standardization using existing parameters
nroMatch

Best-matching districts
nroKohonen

Self-organizing map
nroAggregate

Regional averages on a self-organizing map
nroPermute

Permutation analysis of map layout
nroDestratify

Mitigate data stratification
nroKmeans

K-means clustering
nroLabel

Label pruning
nroColorize

Assign colors based on value
nroTrain

Train self-organizing map
numero.quality

Self-organizing map statistics