Learn R Programming

⚠️There's a newer version (1.9.8) of this package.Take me there.

Numero (version 1.7.4)

Statistical Framework to Define Subgroups in Complex Datasets

Description

High-dimensional datasets that do not exhibit a clear intrinsic clustered structure pose a challenge to conventional clustering algorithms. For this reason, we developed an unsupervised framework that helps scientists to better subgroup their datasets based on visual cues, please see Gao S, Mutter S, Casey A, Makinen V-P (2019) Numero: a statistical framework to define multivariable subgroups in complex population-based datasets, Int J Epidemiology, 48:369-37, . The framework includes the necessary functions to construct a self-organizing map of the data, to evaluate the statistical significance of the observed data patterns, and to visualize the results.

Copy Link

Version

Install

install.packages('Numero')

Monthly Downloads

298

Version

1.7.4

License

GPL (>= 2)

Maintainer

Ville-Petteri Makinen

Last Published

January 22nd, 2021

Functions in Numero (1.7.4)

nroKohonen

Self-organizing map
nroPlot

Plot a self-organizing map
nroKmeans

K-means clustering
nroPermute

Permutation analysis of map layout
nroLabel

Label pruning
nroMatch

Best-matching districts
nroColorize

Assign colors based on value
nroAggregate

Regional averages on a self-organizing map
nroPreprocess

Data cleaning and standardization
nroPostprocess

Standardization using existing parameters
nroRcppVector

Safety check for Rcpp calls
nroRcppMatrix

Safety check for Rcpp calls
numero.create

Create a self-organizing map
numero.clean

Clean datasets
numero.summary

Summarize subgroup statistics
numero.subgroup

Interactive subgroup assignment
nroSummary

Estimate subgroup statistics
nroTrain

Train self-organizing map
nroImpute

Impute missing values
nroDestratify

Mitigate data stratification
numero.evaluate

Self-organizing map statistics
numero.plot

Plot results from SOM analysis
numero.prepare

Prepare datasets for analysis
numero.quality

Self-organizing map statistics