Learn R Programming

stops (version 1.9-1)

Structure Optimized Proximity Scaling

Description

Methods that use flexible variants of multidimensional scaling (MDS) which incorporate parametric nonlinear distance transformations and trade-off the goodness-of-fit fit with structure considerations to find optimal hyperparameters, also known as structure optimized proximity scaling (STOPS) (Rusch, Mair & Hornik, 2023,). The package contains various functions, wrappers, methods and classes for fitting, plotting and displaying different 1-way MDS models with ratio, interval, ordinal optimal scaling in a STOPS framework. These cover essentially the functionality of the package smacofx, including Torgerson (classical) scaling with power transformations of dissimilarities, SMACOF MDS with powers of dissimilarities, Sammon mapping with powers of dissimilarities, elastic scaling with powers of dissimilarities, spherical SMACOF with powers of dissimilarities, (ALSCAL) s-stress MDS with powers of dissimilarities, r-stress MDS, MDS with powers of dissimilarities and configuration distances, elastic scaling powers of dissimilarities and configuration distances, Sammon mapping powers of dissimilarities and configuration distances, power stress MDS (POST-MDS), approximate power stress, Box-Cox MDS, local MDS, Isomap, curvilinear component analysis (CLCA), curvilinear distance analysis (CLDA) and sparsified (power) multidimensional scaling and (power) multidimensional distance analysis (experimental models from smacofx influenced by CLCA). All of these models can also be fit by optimizing over hyperparameters based on goodness-of-fit fit only (i.e., no structure considerations). The package further contains functions for optimization, specifically the adaptive Luus-Jaakola algorithm and a wrapper for Bayesian optimization with treed Gaussian process with jumps to linear models, and functions for various c-structuredness indices. Hyperparameter optimization can be done with a number of techniques but we recommend either Bayesian optimization or particle swarm. For using "Kriging", users need to install a version of the archived 'DiceOptim' R package.

Copy Link

Version

Install

install.packages('stops')

Monthly Downloads

207

Version

1.9-1

License

GPL-2 | GPL-3

Maintainer

Thomas Rusch

Last Published

April 28th, 2025

Functions in stops (1.9-1)

c_dependence

c-dependence calculates c-dependence as the aggregated distance correlation of each pair if nonidentical columns
c_striatedness

c-striatedness
c_regularity

c-regularity calculates c-regularity as 1 - OPTICS cordillera for k=2. The higher the more regular.
c_sparsity

c-sparsity
stop_clca

STOPS version of CLCA.
c_skinniness

c-skinniness
knn_dist

calculate k nearest neighbours from a distance matrix
ljoptim

(Adaptive) Version of Luus-Jaakola Optimization
stop_bcmds

STOPS version of Box Cox Stress
stop_cldak

STOPS version of CLDA with free k.
stop_cldae

STOPS version of CLDA with free epsilon.
c_outlying

c-outlying
print.stops

S3 print method for stops objects
print.summary.stops

S3 print method for summary.stops
c_shepardness

c-shepardness calculates the c-shepardness as the correlation between a loess smoother of the transformed distances and the transformed dissimilarities
match_partial_ignorecase_nopunct

function for lookup that partially matched and ignores cases and punctuation
jackmds.stops

MDS Jackknife for stops objects
plot.stops

S3 plot method for stops objects
coef.stops

S3 coef method for stops objects
stop_powerstress

STOPS version of powerstress
residuals.stops

S3 residuals method for stops
c_stringiness

c-stringiness
stop_apstress

STOPS version of approximated power stress models.
stop_lmds

STOPS version of lMDS
stop_isomap2

STOPS version of isomap over real epsilon.
stop_isomap1

STOPS version of isomap to optimize over integer k.
stop_cmdscale

STOPS version of strain
stop_elastic

STOPS versions of elastic scaling models (via smacofSym)
stop_rstress

STOPS version of rstress
stop_smddak

STOPS version of sparsified multidimensional distance analysis for fixed k and tau
stop_smds

STOPS version of sparsified MDS.
stop_rpowerstress

STOPS version of restricted powerstress
stop_sammon

STOPS version of Sammon mapping
stop_powermds

STOPS version of powermds
stop_sammon2

Another STOPS version of Sammon mapping models (via smacofSym)
stop_spmddae

STOPS version of sparsified post multidimensional distance analysis for fixed tau and epsilon.
stop_powerelastic

STOPS version of elastic scaling with powers for proximities and distances
stop_spmddak

STOPS version of sparsified post multidimensional distance analysis for fixed tau and k.
stop_smacofSphere

STOPS versions of smacofSphere models
stop_powersammon

STOPS version of sammon with powers
summary.stops

S3 summary method for stops
stops

High Level STOPS Function
stoploss

Calculate the weighted multiobjective loss function used in STOPS
stop_smddae

STOPS version of sparsified multidimensional distance analysis for fixed eps and tau
stops-package

stops: Structure Optimized Proximity Scaling
stop_sstress

STOPS version of sstress
stop_spmds

STOPS version of sparsified POST-MDS for fixed tau
stop_smacofSym

STOPS version of smacofSym models
tgpoptim

Bayesian Optimization by a (treed) Bayesian Gaussian Process Prior (with jumps to linear models) surrogate model Essentially a wrapper for the functionality in tgp that has the same slots as optim with defaults for STOPS models.
c_association

c-association calculates the c-association based on the maximal information coefficient We define c-association as the aggregated association between any two columns in confs
Pendigits500

Pen digits
bootmds.stops

MDS Bootstrap for stops objects
c_clumpiness

c-clumpiness
c_complexity

c-complexity Calculates the c-complexity based on the minimum cell number We define c-complexity as the aggregated minimum cell number between any two columns in confs This is one of few c-structuredness indices not between 0 and 1, but can be between 0 and (theoretically) infinity
biplotmds.stops

S3 method for stops objects
BankingCrisesDistances

Banking Crises Distances
Swissroll

Swiss roll
c_clusteredness

c-clusteredness calculates c-clusteredness as the OPTICS cordillera. The higher the more clustered.
c_convexity

c-convexity
c_inequality

c-inequality Calculates c-inequality (as in an economic measure of inequality) as Pearsons coefficient of variation of the fitted distance matrix. This can help with avoiding degenerate solutions. This is one of few c-structuredness indices not between 0 and 1, but 0 and infinity.
c_linearity

c-linearity calculates c-linearity as the aggregated multiple correlation of all columns of the configuration.
c_manifoldness

c-manifoldness calculates c-manifoldness as the aggregated maximal correlation coefficient (i.e., Pearson correlation of the ACE transformed variables) of all pairwise combinations of two different columns in confs. If there is an NA (happens usually when the optimal transformation of any variable is a constant and therefore the covariance is 0 but also one of the sds in the denominator), it gets skipped.
c_faithfulness

c-faithfulness calculates the c-faithfulness based on the index by Chen and Buja 2013 (M_adj) with equal input neigbourhoods
c_nonmonotonicity

c-nonmonotonicity calculates the c-nonmonotonicity based on the maximum asymmetric score We define c-nonmonotonicity as the aggregated nonmonotonicity between any two columns in confs this is one of few c-structuredness indices not between 0 and 1
c_functionality

c-functionality calculates the c-functionality based on the maximum edge value We define c-functionality as the aggregated functionality between any two columns of confs
c_mine

wrapper for getting the mine coefficients
c_hierarchy

c-hierarchy captures how well a partition/ultrametric (obtained by hclust) explains the configuration distances. Uses variance explained for euclidean distances and deviance explained for everything else.