fit_logistic_regression

This function fits a logistic regression model to binary classification data.
It supports both dense and sparse matrix inputs for the predictor variables.
The optimization is performed using the L-BFGS algorithm.

High-performance implementation of 36 optimal binning algorithms
(16 categorical, 20 numerical) for Weight of Evidence ('WoE') transformation,
credit scoring, and risk modeling. Includes advanced methods such as Mixed
Integer Linear Programming ('MILP'), Genetic Algorithms, Simulated Annealing,
and Monotonic Regression. Features automatic method selection based on
Information Value ('IV') maximization, strict monotonicity enforcement, and
efficient handling of large datasets via 'Rcpp'. Fully integrated with the
'tidymodels' ecosystem for building robust machine learning pipelines.
Based on methods described in Siddiqi (2006) <doi:10.1002/9781119201731>
and Navas-Palencia (2020) <doi:10.48550/arXiv.2001.08025>.

Lopes J. E.

OptimalBinningWoE

Optimal Binning and Weight of Evidence Framework for Modeling

fit_logistic_regression function

<dl><dt>X_r</dt>
<dd>A numeric matrix or sparse matrix (dgCMatrix) of predictor variables.
Rows represent observations and columns represent features.</dd>
<dt>y_r</dt>
<dd>A numeric vector of binary outcome values (0 or 1). Must have the
same number of observations as rows in <code>X_r</code>.</dd>
<dt>maxit</dt>
<dd>Integer. Maximum number of iterations for the optimizer.
Default is 300.</dd>
<dt>eps_f</dt>
<dd>Numeric. Convergence tolerance for the function value.
Default is 1e-8.</dd>
<dt>eps_g</dt>
<dd>Numeric. Convergence tolerance for the gradient norm.
Default is 1e-5.</dd></dl>

Arguments

Fit Logistic Regression Model — fit_logistic_regression

<dl>

<dt>X_r</dt>
<dd>A numeric matrix or sparse matrix (dgCMatrix) of predictor variables.
Rows represent observations and columns represent features.</dd>


<dt>y_r</dt>
<dd>A numeric vector of binary outcome values (0 or 1). Must have the
same number of observations as rows in <code>X_r</code>.</dd>


<dt>maxit</dt>
<dd>Integer. Maximum number of iterations for the optimizer.
Default is 300.</dd>


<dt>eps_f</dt>
<dd>Numeric. Convergence tolerance for the function value.
Default is 1e-8.</dd>


<dt>eps_g</dt>
<dd>Numeric. Convergence tolerance for the gradient norm.
Default is 1e-5.</dd>

</dl>

fit_logistic_regression: Fit Logistic Regression Model

Description

Usage

Value

Arguments

Details

Examples