Learn R Programming

StatMeasures (version 1.0)

Easy Data Manipulation, Data Quality and Statistical Checks

Description

Offers useful functions to perform day-to-day data manipulation operations, data quality checks and post modelling statistical checks. One can effortlessly change class of a number of variables to factor, remove duplicate observations from the data, create deciles of a variable, perform data quality checks for continuous (integer or numeric), categorical (factor) and date variables, and compute goodness of fit measures such as auc for statistical models. The functions are consistent for objects of class 'data.frame' and 'data.table', which is an enhanced 'data.frame' implemented in the package 'data.table'.

Copy Link

Version

Install

install.packages('StatMeasures')

Monthly Downloads

14

Version

1.0

License

GPL-2

Maintainer

Akash Jain

Last Published

March 27th, 2015

Functions in StatMeasures (1.0)

actvspred

Comparison of actual and predicted linear response
outliers

Identify outliers in a variable
mape

Compute mean absolute percentage error
imputemiss

Impute missing values in a variable
auc

Area under curve of predicted binary response
dqdate

Data quality check of date variables
contents

Basic summary of the data
iv

Information value of an independent variable in predicting a binary response
pentile

Create pentiles of a variable
rmdupkey

Remove observations with duplicate keys from data
gini

Gini coefficient of a distribution
dqcategorical

Data quality check of categorical variables
decile

Create deciles of a variable
accuracy

Confusion matrix and overall accuracy of predicted binary response
randomise

Order the rows of a data randomly
splitdata

Split modeling data into test and train set
factorise

Change the class of variables to factor
dqcontinuous

Data quality check of continuous variables
rmdupobs

Remove duplicate observations from data
ks

Kolmogorov-Smirnov statistic for predicted binary response