SmartEDA (version 0.3.2)

ExpStat: Function provides summary statistics for individual categorical predictors

Description

Provides bivariate summary statistics for all the categorical predictors against target variables. Output includes chi - square value, degrees of freedom, information value, p-value

Usage

ExpStat(X,Y,valueOfGood = NULL)

Arguments

X

Independent categorical variable.

Y

Binary response variable, it can take values of either 1 or 0.

valueOfGood

Value of Y that is used as reference category.

Value

The function provides summary statistics like

Unique levels

Chi square statistics

P value

Degrees of freedom

Information value

Predictive class

Details

For a given binary Y variable and X categorical variables, the summary statistics are computed. Summary statistics included Pearson's Chi-squared Test for Count Data, "chisq.test" which performs chi-squared contingency table tests and goodness-of-fit tests. If any NA value present in X or Y variable, which will be considered as NA as in category while computing the contingency table.

Also added unique levels for each X categorical variables and degrees of freedom

See Also

chisq.test

Examples

Run this code
# NOT RUN {
X = mtcars$carb
Y = mtcars$am
ExpStat(X,Y,valueOfGood = 1)
# }

Run the code above in your browser using DataCamp Workspace