incon: Computing the inconsistency rate for Chi2 discretization algorithm
Description
This function computes the inconsistency rate of dataset.
Usage
incon(data)
Arguments
data
dataset matrix
Value
inConRate
the inconsistency rate of the dataset
Details
The inconsistency rate of dataset is calculated as follows:
(1) two instances are considered inconsistent if they match except for their class labels;
(2) for all the matching instances (without considering their class labels), the inconsistency count is the number of the instances minus the largest number of instnces of class labels; (3) the inconsistency rate is the sum of all the inconsistency counts divided by the total number of instances.
References
Liu, H. and Setiono, R. (1995), Chi2: Feature selection and discretization of numeric attributes , Tools with Artificial Intelligence, 388--391.
Liu, H. and Setiono, R. (1997), Feature selection and discretization, IEEE transactions on knowledge and data engineering, Vol.9, no.4, 642--645.