Learn R Programming

⚠️There's a newer version (0.6.5) of this package.Take me there.

dlookr (version 0.3.2)

Tools for Data Diagnosis, Exploration, Transformation

Description

A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values and outliers and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and relationship between target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputates missing values and outliers, resolving skewness. And it creates automated reports that support these three tasks.

Copy Link

Version

Install

install.packages('dlookr')

Monthly Downloads

2,837

Version

0.3.2

License

GPL-2

Maintainer

Choonghyun Ryu

Last Published

July 21st, 2018

Functions in dlookr (0.3.2)

diagnose_report

Reporting the information of data diagnosis
diagnose_report.tbl_dbi

Reporting the information of data diagnosis for table of the DBMS
dlookr-package

dlookr: Tools for Data Diagnosis, Exploration, Transformation
plot_normality.tbl_dbi

Plot distribution information of numerical data
plot_outlier

Plot outlier information of numerical data diagnosis
get_os

Finding Users Machine's OS
get_column_info

Describe column of table in the DBMS
eda_report

Reporting the information of EDA
plot_correlate.tbl_dbi

Visualize correlation plot of numerical data
normality

Performs the Shapiro-Wilk test of normality
eda_report.tbl_dbi

Reporting the information of EDA for table of the DBMS
normality.tbl_dbi

Performs the Shapiro-Wilk test of normality
plot_normality

Plot distribution information of numerical data
relate

Relationship between target variable and variable of interest
summary.bins

Summarizing Binned Variable
diagnose_outlier

Diagnose outlier of numerical variables
diagnose_outlier.tbl_dbi

Diagnose outlier of numerical variables in the DBMS
plot.bins

Visualize Distribution for an "bins" object
summary.imputation

Summarizing imputation information
plot.imputation

Visualize Information for an "imputation" Object
summary.transform

Summarizing transformation information
find_skewness

Finding skewed variables
get_class

Extracting a class of variables
find_class

Extract variable names or indices of a specific class
transform

Data Transformations
find_na

Finding variables including missing values
find_outliers

Finding variables including outliers
transformation_report

Reporting the information of transformation
diagnose_numeric

Diagnose data quality of numerical variables
plot.optimal_bins

Visualize Distribution for an "optimal_bins" Object
diagnose_numeric.tbl_dbi

Diagnose data quality of numerical variables in the DBMS
plot.relate

Visualize Information for an "relate" Object
imputate_na

Imputate Missing values
target_by

Target by one variables
imputate_outlier

Imputate Outliers
target_by.tbl_dbi

Target by one column in the DBMS
plot.transform

Visualize Information for an "transform" Object
plot_correlate

Visualize correlation plot of numerical data
plot_outlier.tbl_dbi

Plot outlier information of numerical data diagnosis in the DBMS
print.relate

Summarizing relate information
diagnose.tbl_dbi

Diagnose data quality of variables in the DBMS
diagnose_category

Diagnose data quality of categorical variables
correlate.tbl_dbi

Compute the correlation coefficient between two numerical data
binning

Binning the Numeric Data
diagnose

Diagnose data quality of variables
correlate

Compute the correlation coefficient between two numerical data
describe

Compute descriptive statistic
binning_by

Optimal Binning for Scoring Modeling
diagnose_category.tbl_dbi

Diagnose data quality of categorical variables in the DBMS
describe.tbl_dbi

Compute descriptive statistic