create_splits

holdout

cv_folds

Split the observed cells of a data matrix into training and validation sets 
for hyperparameter tuning. Methods are available for repeated holdout 
validation and \(K\)-fold cross-validation.

utilities

Collection of methods for rating matrix completion, which is a statistical framework for recommender systems. Another relevant application is the imputation of rating-scale survey data in the social and behavioral sciences. Note that matrix completion and imputation are synonymous terms used in different streams of the literature. The main functionality implements robust matrix completion for discrete rating-scale data with a low-rank constraint on a latent continuous matrix (Archimbaud, Alfons, and Wilms (2025) <doi:10.48550/arXiv.2412.20802>). In addition, the package provides wrapper functions for 'softImpute' (Mazumder, Hastie, and Tibshirani, 2010, <https://www.jmlr.org/papers/v11/mazumder10a.html>; Hastie, Mazumder, Lee, Zadeh, 2015, <https://www.jmlr.org/papers/v16/hastie15a.html>) for easy tuning of the regularization parameter, as well as benchmark methods such as median imputation and mode imputation.

Andreas Alfons

RMCLab

Lab for Matrix Completion and Imputation of Discrete Rating Data

Aurore Archimbaud

create_splits function

<dl><dt>indices</dt>
<dd>an integer vector giving the indices of observed cells in a 
data matrix.</dd>
<dt>control</dt>
<dd>a control object inheriting from class 
<code>"split_control"</code> as generated by <code>holdout_control()</code> 
for repeated holdout validation or <code>cv_folds_control()</code> for 
\(K\)-fold cross-validation.</dd>
<dt>pct</dt>
<dd>numeric in the interval (0, 1); the percentage of observed cells 
in the data matrix to be randomly selected into the validation set (defaults 
to 0.1).</dd>
<dt>R</dt>
<dd>an integer giving the number of random splits into training and 
validation sets (defaults to 10).</dd>
<dt>K</dt>
<dd>an integer giving the number of cross-validation folds (defaults 
to 5).</dd></dl>

Arguments

Author

Create splits of observed data cells for hyperparameter tuning — create_splits

<dl>

<dt>indices</dt>
<dd>an integer vector giving the indices of observed cells in a 
data matrix.</dd>


<dt>control</dt>
<dd>a control object inheriting from class 
<code>"split_control"</code> as generated by <code>holdout_control()</code> 
for repeated holdout validation or <code>cv_folds_control()</code> for 
\(K\)-fold cross-validation.</dd>


<dt>pct</dt>
<dd>numeric in the interval (0, 1); the percentage of observed cells 
in the data matrix to be randomly selected into the validation set (defaults 
to 0.1).</dd>


<dt>R</dt>
<dd>an integer giving the number of random splits into training and 
validation sets (defaults to 10).</dd>


<dt>K</dt>
<dd>an integer giving the number of cross-validation folds (defaults 
to 5).</dd>

</dl>

create_splits: Create splits of observed data cells for hyperparameter tuning

Description

Usage

Value

Arguments

Author

Details

See Also

Examples