Learn R Programming

dataReporter (version 1.0.5)

Reproducible Data Screening Checks and Report of Possible Errors

Description

Data screening is an important first step of any statistical analysis. 'dataReporter' auto generates a customizable data report with a thorough summary of the checks and the results that a human can use to identify possible errors. It provides an extendable suite of test for common potential errors in a dataset. See Petersen AH, Ekstrøm CT (2019). "dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R." _Journal of Statistical Software_, *90*(6), 1-38 for more information.

Copy Link

Version

Install

install.packages('dataReporter')

Monthly Downloads

571

Version

1.0.5

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Claus Ekstrom

Last Published

April 13th, 2025

Functions in dataReporter (1.0.5)

allVisualFunctions

Overview of all available visualFunctions
classes

Extract the contents of the attribute classes
basicVisualCFLB

importFrom stats na.omit
allCheckFunctions

Overview of all available checkFunctions
defaultDateChecks

Default checks for Date variables
basicVisual

allClasses

Vector of all variable classes in dataReporter
checkFunction

Create an object of class checkFunction
countMissing

Summary function for missing values
bigPresidentData

Semi-artificial data about the US presidents (extended version)
defaultHavenlabelledChecks

Default checks for haven_labelled variables
description

Extract the contents of the attribute description
defaultHavenlabelledSummaries

Default summary functions for haven_labelled variables
defaultDateSummaries

Default summary functions for Date variables
centralValue

summaryFunction for central values
defaultCharacterChecks

Default checks for character variables
defaultLogicalChecks

Default checks for logical variables
checkResult

Create object of class checkResult
identifyCaseIssues

A checkFunction for identifying case issues
check

Perform checks of potential errors in variable/dataset
exampleData

Example data with zero-inflated variables
defaultFactorChecks

Default checks for factor variables
minMax

summaryFunction for minimum and maximum
identifyLoners

A checkFunction for identifying sparsely represented values (loners)
defaultCharacterSummaries

Default summary functions for character variables
defaultLogicalSummaries

Default summary functions for logical variables
defaultNumericChecks

Default checks for numeric variables
defaultLabelledChecks

Default checks for labelled variables
defaultFactorSummaries

Default summary functions for factor variables
defaultIntegerChecks

Default checks for integer variables
isSupported

Check if a variable has a class supported by dataReporter
defaultIntegerSummaries

Default summary functions for integer variables
render

Simplified Rmarkdown rendering
defaultLabelledSummaries

Default summary functions for labelled variables
identifyWhitespace

A checkFunction for identifying whitespace
identifyOutliers

A checkFunction for identifying outliers
defaultNumericSummaries

Default summary functions for numeric variables
presidentData

Semi-artificial data about the US presidents
identifyOutliersTBStyle

A checkFunction for identifying outliers Turkey Boxstole style
isKey

Check if a variable qualifies as a key
setChecks

Set check arguments for makeDataReport
summaryResult

Create object of class summaryResult
makeCodebook

Produce a data codebook
isCPR

Check if a variable consists of Danish CPR numbers
identifyMissing

A checkFunction for identifying miscoded missing values.
quartiles

summaryFunction for quartiles
isSingular

Check if a variable only contains a single value
refCat

summaryFunction that finds reference level for factor variables
smartNum

Smart class to handle numerics as factor
testData

Extended example data to test the features of dataReporter
summarize

Summarize a variable/dataset
identifyNums

A checkFunction
tableVisual

Produce tables for the makeDataReport visualizations.
toyData

Small example data to show the features of dataReporter
standardVisual

Produce distribution plots using ggplot from ggplot2.
makeDataReport

Produce a data report
messageGenerator

Produce a message for the output of a checkFunction
setSummaries

Set summary arguments for makeDataReport
summaryFunction

Create an object of class summaryFunction
uniqueValues

summaryFunction for unique values
setVisuals

Set visual arguments for makeDataReport
variableType

Summary function for original class
visualFunction

Create an object of class visualFunction
visualize

Produce distribution plots
whoami_available

Find out if the whoami package binaries is installed (git + whoami)
allSummaryFunctions

Overview of all available summaryFunctions
artData

Semi-artificial data about masterpieces of art