# ORPHANED

#### 30 packages on CRAN

Various data sets used in examples and exercises in the book Maindonald, J.H. and Braun, W.J. (2003, 2007, 2010) "Data Analysis and Graphics Using R".

Useful when reading the book above mentioned, in the documentation referred to as `the book'.

Provide the implementation of a family of Lasso variants including Dantzig Selector, LAD Lasso, SQRT Lasso, Lq Lasso for estimating high dimensional sparse linear model. We adopt the alternating direction method of multipliers and convert the original optimization problem into a sequential L1 penalized least square minimization problem, which can be efficiently solved by linearization algorithm. A multi-stage screening approach is adopted for further acceleration. Besides the sparse linear model estimation, we also provide the extension of these Lasso variants to sparse Gaussian graphical model estimation including TIGER and CLIME using either L1 or adaptive penalty. Missing values can be tolerated for Dantzig selector and CLIME. The computation is memory-optimized using the sparse matrix output.

This package proposes a model-based clustering algorithm for multivariate functional data. The parametric mixture model, based on the assumption of normality of the principal components resulting from a multivariate functional PCA, is estimated by an EM-like algorithm. The main advantage of the proposed algorithm is its ability to take into account the dependence among curves.

Quantification of the effect of geographic versus environmental isolation on genetic differentiation

This package provides a minimalistic functionality necessary to apply Gaussian Process in R. They provide a selection of functionalities of GPML Matlab library.

The package implements efficient ways to evaluate and solve equations of the form Ax=b, where A is a kronecker product of matrices. Functions to solve least squares problems of this type are also included.

Implements the largeVis algorithm (see Tang, et al. (2016) <DOI:10.1145/2872427.2883041>) for visualizing very large high-dimensional datasets. Also very fast search for approximate nearest neighbors; outlier detection; and optimized implementations of the HDBSCAN*, DBSCAN and OPTICS clustering algorithms; plotting functions for visualizing the above.

Simulates continuous distributions of random vectors using Markov chain Monte Carlo (MCMC). Users specify the distribution by an R function that evaluates the log unnormalized density. Algorithms are random walk Metropolis algorithm (function metrop), simulated tempering (function temper), and morphometric random walk Metropolis (Johnson and Geyer, 2012, <doi:10.1214/12-AOS1048>, function morph.metrop), which achieves geometric ergodicity by change of variable.

Functions for implementing species dispersal into projections of species distribution models (e.g. under climate change scenarios).

Finds the maximum likelihood estimate of the mean vector and variance-covariance matrix for multivariate normal data with missing values.

Fitting possibly high dimensional penalized regression models. The penalty structure can be any combination of an L1 penalty (lasso and fused lasso), an L2 penalty (ridge) and a positivity constraint on the regression coefficients. The supported regression models are linear, logistic and Poisson regression and the Cox Proportional Hazards model. Cross-validation routines allow optimization of the tuning parameters.

Contains linear and nonlinear regression methods based on Partial Least Squares and Penalization Techniques. Model parameters are selected via cross-validation, and confidence intervals ans tests for the regression coefficients can be conducted via jackknifing.

Method for protein quantification based on identified and quantified peptides. protiq can be used for absolute and relative protein quantification. Input peptide abundance scores can come from various sources, including SRM transition areas and intensities or spectral counts derived from shotgun experiments. The package is still being extended to also include the model for protein identification, MIPGEM, presented in Gerster, S., Qeli, E., Ahrens, C.H. and Buehlmann, P. (2010). Protein and gene model inference based on statistical modeling in k-partite graphs. Proceedings of the National Academy of Sciences 107(27):12101-12106.

Finds the k nearest neighbours for every point in a given dataset in O(N log N) time using Arya and Mount's ANN library (v1.1.3). There is support for approximate as well as exact searches, fixed radius searches and 'bd' as well as 'kd' trees. The distance is computed using the L1 (Manhattan, taxicab) metric. Please see package 'RANN' for the same functionality using the L2 (Euclidean) metric.

Functions to prepare files needed for running BUGS in batch-mode, and running BUGS from R. Support for Linux and Windows systems with OpenBugs is emphasized.

Shrunken Centroids Regularized Discriminant Analysis for the classification purpose in high dimensional data.

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

Functions to fit Gaussian linear model by maximising the residual log likelihood where the covariance structure can be written as a linear combination of known matrices. Can be used for multivariate models and random effects models. Easy straight forward manner to specify random effects models, including random interactions. Code now optimised to use Sherman Morrison Woodbury identities for matrix inversion in random effects models. We've added the ability to fit models using any kernel as well as a function to return the mean and covariance of random effects conditional on the data (BLUPs).

Constrained clustering, transfer functions, and other methods for analysing Quaternary science data.

Methods for robust statistics, a state of the art in the early 2000s, notably for robust regression and robust multivariate analysis.

Sensitivity indices with dependent correlated inputs, using a method based on PLS regression.

The package implements the model-based kernel machine method for detecting gene-centric gene-gene interactions of Li and Cui (2012).

This package contains a database of city, state, latitude, and longitude information for U.S. ZIP codes from the CivicSpace Database (August 2004) augmented by Daniel Coven's federalgovernmentzipcodes.us web site (updated January 22, 2012). Previous versions of this package (before 1.0) were based solely on the CivicSpace data, so an original version of the CivicSpace database is also included.