FTRL

Creates 'Follow the Regularized Leader' model.
Only logistic regression implemented at the moment.

datasets

Implements many algorithms for statistical learning on
sparse matrices - matrix factorizations, matrix completion,
elastic net regressions, factorization machines.
Also 'rsparse' enhances 'Matrix' package by providing methods for
multithreaded <sparse, dense> matrix products and native slicing of
the sparse matrices in Compressed Sparse Row (CSR) format.
List of the algorithms for regression problems:
1) Elastic Net regression via Follow The Proximally-Regularized Leader (FTRL)
Stochastic Gradient Descent (SGD), as per McMahan et al(, <doi:10.1145/2487575.2488200>)
2) Factorization Machines via SGD, as per Rendle (2010, <doi:10.1109/ICDM.2010.127>)
List of algorithms for matrix factorization and matrix completion:
1) Weighted Regularized Matrix Factorization (WRMF) via Alternating Least
Squares (ALS) - paper by Hu, Koren, Volinsky (2008, <doi:10.1109/ICDM.2008.22>)
2) Maximum-Margin Matrix Factorization via ALS, paper by Rennie, Srebro
(2005, <doi:10.1145/1102351.1102441>)
3) Fast Truncated Singular Value Decomposition (SVD), Soft-Thresholded SVD,
Soft-Impute matrix completion via ALS - paper by Hastie, Mazumder
et al. (2014, <arXiv:1410.2596>)
4) Linear-Flow matrix factorization, from 'Practical linear models for
large-scale one-class collaborative filtering' by Sedhain, Bui, Kawale et al
(2016, ISBN:978-1-57735-770-4)
5) GlobalVectors (GloVe) matrix factorization via SGD, paper by Pennington,
Socher, Manning (2014, <https://www.aclweb.org/anthology/D14-1162>)
Package is reasonably fast and memory efficient - it allows to work with large
datasets - millions of rows and millions of columns. This is particularly useful
for practitioners working on recommender systems.

Dmitriy Selivanov

rsparse

Statistical Learning on Sparse Matrices

Drew Schmidt

Wei-Chen Chen

FTRL function

Format

<dl class="dl-horizontal">
<dt><code>verbose</code></dt><dd><code>logical = TRUE</code> whether to display training inforamtion</dd>
</dl>

Fields

For usage details see Methods, Arguments and Examples sections.<pre>
ftrl = FTRL$new(learning_rate = 0.1, learning_rate_decay = 0.5,
lambda = 0, l1_ratio = 1, dropout = 0, family = "binomial")
ftrl$partial_fit(x, y, ...)
ftrl$predict(x, ...)
ftrl$coef()
</pre>

Usage

<dl class="dl-horizontal">
 <dt><code>FTRL$new(learning_rate = 0.1, learning_rate_decay = 0.5, lambda = 0,
 l1_ratio = 1, dropout = 0, family = "binomial")</code></dt><dd>Constructor
 for FTRL model. For description of arguments see Arguments section.</dd>
 <dt><code>$partial_fit(x, y, ...)</code></dt><dd>fits/updates model
 given input matrix <code>x</code> and target vector <code>y</code>.
 <code>x</code> shape = (n_samples, n_features)</dd>
 <dt><code>$predict(x, ...)</code></dt><dd>predicts output <code>x</code></dd>
 <dt><code>$coef()</code></dt><dd>return coefficients of the regression model</dd>
 <dt><code>$dump()</code></dt><dd>create dump of the model (actually <code>list</code> with current model parameters)</dd>
 <dt><code>$load(x)</code></dt><dd>load/initialize model from dump)</dd>
</dl>

Methods

<dl class="dl-horizontal">
 <dt>ftrl</dt><dd><code>FTRL</code> object</dd>
 <dt>x</dt><dd>Input sparse matrix - native format is <code>Matrix::RsparseMatrix</code>.
 If <code>x</code> is in different format, model will try to convert it to <code>RsparseMatrix</code>
 with <code>as(x, "RsparseMatrix")</code> call</dd>
 <dt>learning_rate</dt><dd>learning rate</dd>
 <dt>learning_rate_decay</dt><dd>learning rate which controls decay. Please refer to FTRL paper for details.
 Usually convergense does not heavily depend on this parameter, so default value 0.5 is safe.</dd>
 <dt>lambda</dt><dd>regularization parameter</dd>
 <dt>l1_ratio</dt><dd>controls L1 vs L2 penalty mixing.
 1 = Lasso regression, 0 = Ridge regression. Elastic net is in between.</dd>
 <dt>dropout</dt><dd>dropout - percentage of random features to
 exclude from each sample. Acts as regularization.</dd>
 <dt>family</dt><dd>a description of the error distribution and link function to be used in the model.
 Only <code>binomial</code> (or logistic regression) supported at the moment.</dd>
</dl>

FTRL: Creates FTRL proximal logistic regression model.

Description

Usage

Format

Fields

Usage

Methods

Arguments

Examples