Learn R Programming

opticskxi (version 1.2.0)

OPTICS K-Xi Density-Based Clustering

Description

Density-based clustering methods are well adapted to the clustering of high-dimensional data and enable the discovery of core groups of various shapes despite large amounts of noise. This package provides a novel density-based cluster extraction method, OPTICS k-Xi, and a framework to compare k-Xi models using distance-based metrics to investigate datasets with unknown number of clusters. The vignette first introduces density-based algorithms with simulated datasets, then presents and evaluates the k-Xi cluster extraction method. Finally, the models comparison framework is described and experimented on 2 genetic datasets to identify groups and their discriminating features. The k-Xi algorithm is a novel OPTICS cluster extraction method that specifies directly the number of clusters and does not require fine-tuning of the steepness parameter as the OPTICS Xi method. Combined with a framework that compares models with varying parameters, the OPTICS k-Xi method can identify groups in noisy datasets with unknown number of clusters. Results on summarized genetic data of 1,200 patients are in Charlon T. (2019) .

Copy Link

Version

Install

install.packages('opticskxi')

Monthly Downloads

665

Version

1.2.0

License

GPL-3

Maintainer

Thomas Charlon

Last Published

February 26th, 2025

Functions in opticskxi (1.2.0)

m_psych_embeds

A dataset containing the embeddings matrix of psychological related words
ggplot_optics

Ggplot optics
multishapes

A dataset containing clusters of multiple shapes
ggpairs

Plot multiple axes of a data frame or a fortified dimension reduction.
%>%

Pipe
fortify_pca

Get and fortify PCA
print_vignette_table

Print vignette table
gtable_kxi_profiles

Gtable OPTICS k-Xi distance profiles
stddev_mean

stddev_mean
residuals_table

Residuals table
opticskxi_pipeline

OPTICS k-Xi models comparison pipeline
opticskxi

OPTICS k-Xi clustering algorithm
hla

The HLA data
nice_palette

Nice palette
norm_inprod

norm_inprod
dist_matrix

dist_matrix
ensemble_metrics_bootstrap

Select models based on ensemble metrics
ensemble_metrics

Compute ensemble metrics
contingency_table

Contingency table
%<>%

Assignment pipe
cosine_simi

Cosine similarity between vectors and/or matrices.
ggplot_kxi_metrics

Ggplot OPTICS k-Xi metrics
crohn

Crohn's disease data
%$%

Exposition pipe
fortify_ica

Get and fortify ICA
fortify_dimred

Fortify a dimension reduction object
get_best_kxi

Get best k-Xi model
ensemble_models

Select best models based on ensemble metrics