Learn R Programming

preproviz (version 0.2.0)

Tools for Visualization of Interdependent Data Quality Issues

Description

Data quality issues such as missing values and outliers are often interdependent, which makes preprocessing both time-consuming and leads to suboptimal performance in knowledge discovery tasks. This package supports preprocessing decision making by visualizing interdependent data quality issues through means of feature construction. The user can define his own application domain specific constructed features that express the quality of a data point such as number of missing values in the point or use nine default features. The outcome can be explored with plot methods and the feature constructed data acquired with get methods.

Copy Link

Version

Install

install.packages('preproviz')

Monthly Downloads

9

Version

0.2.0

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Markus Vattulainen

Last Published

July 9th, 2016

Functions in preproviz (0.2.0)

BaseClass-class

An abstract S4 class representing contructed features
getlongformatminmaxconstructeddata

getlongformatminmaxconstructeddata
getlongformatconstructeddata

get constructed data in long format
ControlClass-class

An S4 class representing setups to be executed
getlofsumdata

getlofsumdata
getvariableimportancedata

get random forest variable importance data
getparameters

getparameters
DataClass-class

An S4 class representing data objects
getlofscores

getlofscores
getclasslabels

getclasslabels
getcmdsdata

get classical multidimensional scaling from minmaxconstructed data
ParameterClass-class

An S4 class representing selected constructed features
plotCMDS

generic function for plotting classical multidimensional scaling
initializedataobject

constructor function for initializing a DataClass object
initializecontrolclassobject

constructor function for intializing a ControlClass object
SetUpClass-class

An S4 class representing setups
RunClass-class

An S4 class representing preproviz output (data and visualizations)
ReportClass-class

An S4 class representing visualizations
preproviz

the MAIN execution function
getminmaxconstructeddata

get contructed data that have been min-max normalized
getname

get name of an object
computeValue

generic function for computing constructed feature vectors
constructfeature

constructor function for adding constructed features to the system
getconstructeddata

getconstructeddata
getcombineddata

get basedata and constructed data combined
initializesetupclassobject

constructor function for initializing a SetUpClass object
initializeparameterclassobject

constructor function for intializing a ParameterClass objects
plotOUTLIERS

generic function for plotting density of LOF scores
plotLOFSUM

generic function for plotting lof sum of constructed features
getbasedata

getbasedata
getnumericbasedata

getnumericbasedata
getnumericombineddata

get numeric columns of combined data
plotDENSITY

generic function for plotting density estimates of constructed features
plotHEATMAP

generic function for plotting heatmap
plotVARIMP

generic function for plotting variable importance
plotVARCLUST

generic function for plotting variable clusters
defaultParameters

defaultParameters
AnalysisClass-class

An S4 class representing analysis data