powered by
Generates comprehensive summary statistics for optimal binning results, including predictive power classification based on established IV thresholds (Siddiqi, 2006), aggregate metrics, and feature-level diagnostics.
# S3 method for obwoe summary(object, sort_by = "iv", decreasing = TRUE, ...)
An S3 object of class "summary.obwoe" containing:
"summary.obwoe"
feature_summary
Data frame with per-feature statistics including IV classification (Unpredictive/Weak/Medium/Strong/Suspicious)
aggregate
Named list of aggregate statistics:
n_features
Total features processed
n_successful
Features without errors
n_errors
Features with errors
total_iv_sum
Sum of all feature IVs
mean_iv
Mean IV across features
median_iv
Median IV across features
mean_bins
Mean number of bins
iv_range
Min and max IV values
iv_distribution
Table of IV classification counts
target
Target column name
target_type
Target type (binary/multinomial)
An object of class "obwoe".
"obwoe"
Character string specifying the column to sort by. Options: "iv" (default), "n_bins", "feature".
"iv"
"n_bins"
"feature"
Logical. Sort in decreasing order? Default is TRUE for IV, FALSE for feature names.
TRUE
FALSE
Additional arguments (currently ignored).
Following Siddiqi (2006), features are classified by predictive power:
Features with IV > 0.50 should be examined for data leakage or overfitting, as such high values are rarely observed in practice.
Siddiqi, N. (2006). Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring. John Wiley & Sons. tools:::Rd_expr_doi("10.1002/9781119201731")
obwoe for the main binning function, print.obwoe, plot.obwoe.
obwoe
print.obwoe
plot.obwoe
# \donttest{ set.seed(42) df <- data.frame( x1 = rnorm(500), x2 = rnorm(500), x3 = rnorm(500), target = rbinom(500, 1, 0.2) ) result <- obwoe(df, target = "target") summary(result) # }
Run the code above in your browser using DataLab