Max Kuhn

Max Kuhn

21 packages on CRAN

1 packages on GitHub

99.99th

Percentile

Raw and processed versions of the data from De Cock (2011) <http://ww2.amstat.org/publications/jse> are included in the package.

99.99th

Percentile

A few functions and several data set for the Springer book 'Applied Predictive Modeling'.

C50

cran
99.99th

Percentile

C5.0 decision trees and rule-based models for pattern recognition that extend the work of Quinlan (1993, ISBN:1-55860-238-0).

caret

cran
99.99th

Percentile

Misc functions for training and plotting classification and regression models.

contrast

cran
99.99th

Percentile

One degree of freedom contrasts for lm, glm, gls, and geese objects.

Cubist

cran
99.99th

Percentile

Regression modeling using rules with added instance-based corrections.

99.99th

Percentile

S3 classes for multivariate optimization using the desirability function by Derringer and Suich (1980).

dials

cran
99.99th

Percentile

Many models contain tuning parameters (i.e. parameters that cannot be directly estimated from the data). These tools can be used to define objects for creating, simulating, or validating values for such parameters.

embed

cran
99.99th

Percentile

Predictors can be converted to one or more numeric representations using simple generalized linear models <arXiv:1611.09477> or nonlinear models <arXiv:1604.06737>. All encoding methods are supervised.

odfWeave

cran
99.99th

Percentile

Sweave processing of Open Document Format (ODF) files

QSARdata

cran
99.99th

Percentile

Molecular descriptors and outcomes for several public domain data sets

recipes

cran
99.99th

Percentile

An extensible framework to create and preprocess design matrices. Recipes consist of one or more data manipulation and analysis "steps". Statistical parameters for the steps can be estimated from an initial data set and then applied to other data sets. The resulting design matrices can then be used as inputs into statistical or machine learning models.

rsample

cran
99.99th

Percentile

Classes and functions to create and summarize different types of resampling objects (e.g. bootstrap, cross-validation).

sparseLDA

cran
99.99th

Percentile

Performs sparse linear discriminant analysis for Gaussians and mixture of Gaussian models.

parsnip

github
99.99th

Percentile

A common interface is provided to allow users to specify a model without having to remember the different argument names across different functions or computational engines (e.g. R, spark, stan, etc).

tidymodels

cran
99.99th

Percentile

The tidy modeling "verse" is a collection of packages for modeling and statistical analysis that share the underlying design philosophy, grammar, and data structures of the tidyverse.

99.99th

Percentile

Bayesian analysis used here to answer the question: "when looking at resampling results, are the differences between models 'real'?" To answer this, a model can be created were the performance statistic is the resampling statistics (e.g. accuracy or RMSE). These values are explained by the model types. In doing this, we can get parameter estimates for each model's affect on performance and make statistical (and practical) comparisons between models. The methods included here are similar to Benavoli et al (2017) <http://jmlr.org/papers/v18/16-305.html>.

DescTools

cran
99.99th

Percentile

A collection of miscellaneous basic statistic functions and convenience wrappers for efficiently describing data. The author's intention was to create a toolbox, which facilitates the (notoriously time consuming) first descriptive tasks in data analysis, consisting of calculating descriptive statistics, drawing graphical summaries and reporting the results. The package contains furthermore functions to produce documents using MS Word (or PowerPoint) and functions to import data from Excel. Many of the included functions can be found scattered in other packages and other sources written partly by Titans of R. The reason for collecting them here, was primarily to have them consolidated in ONE instead of dozens of packages (which themselves might depend on other packages which are not needed at all), and to provide a common and consistent interface as far as function and arguments naming, NA handling, recycling rules etc. are concerned. Google style guides were used as naming rules (in absence of convincing alternatives). The 'camel style' was consequently applied to functions borrowed from contributed R packages as well.

probably

cran
99.99th

Percentile

Models can be improved by post-processing class probabilities, by: recalibration, conversion to hard probabilities, assessment of equivocal zones, and other activities. 'probably' contains tools for conducting these operations.

rngtools

cran
99.99th

Percentile

Provides a set of functions for working with Random Number Generators (RNGs). In particular, a generic S4 framework is defined for getting/setting the current RNG, or RNG data that are embedded into objects for reproducibility. Notably, convenient default methods greatly facilitate the way current RNG settings can be changed.

spectacles

cran
99.99th

Percentile

Stores and eases the manipulation of spectra and associated data, with dedicated classes for spatial and soil-related data.

yardstick

cran
99.99th

Percentile

Tidy tools for quantifying how well model fits to a data set such as confusion matrices, class probability curve summaries, and regression metrics (e.g., RMSE).