woebin

A data frame with both x (predictor/feature) and y (response/label) variables.

Name of x variables. Default NULL If x is NULL, all variables exclude y will counted as x variables.

List of break points, defaults NULL If it is not NULL, variable binning will based on the provided breaks.

breaks_list

The share of initial binning class number over total. Accepted range: 0.01-0.2; default 0.02.

min_perc_total

Stop binning segmentation when information value gain ratio less than the stop_limit. Accepted range: 0-0.5; default 0.1.

stop_limit

Value of positive class, default "bad|1".

positive

Logical. If it is TRUE, print the variable name when generate binning.

print_step

<code>woebin</code> generates optimal binning for both numerical and categorical variables using tree-like segmentation. <code>woebin</code> can also customizing breakpoints for both numerical and categorical variables.

Makes the development of credit risk scorecard easily and efficiently by providing functions such as information value, variable filter, optimal woe binning, scorecard scaling and performance evaluation etc. The references including:
1. Refaat, M. (2011, ISBN: 9781447511199). Credit Risk Scorecard: Development and Implementation Using SAS.
2. Siddiqi, N. (2006, ISBN: 9780471754510). Credit risk scorecards. Developing and Implementing Intelligent Credit Scoring.

Data Engineering and BI courses are free this week!

woebin: WOE Binning

Description

Usage

Arguments

Value

See Also

Examples