dataCompareR v0.1.1

0

Monthly downloads

0th

Percentile

Compare Two Data Frames and Summarise the Difference

Easy comparison of two tabular data objects in R. Specifically designed to show differences between two sets of data in a useful way that should make it easier to understand the differences, and if necessary, help you work out how to remedy them. Aims to offer a more useful output than all.equal() when your two data sets do not match, but isn't intended to replace all.equal() as a way to test for equality.

Functions in dataCompareR

Name Description
allVarMatchMessage allVarMatchMessage
checkEmpty checkEmpty
checkKeysExist checkKeysExist
checkForRCompareCol checkForRcompareCol
colsWithUnequalValues colsWithUnequalValues: a dataframe summarising a column with unequal values
collapseClasses collapseClasses. Collapse the classes of an object to a single string
checkUniqueness Checks that a list of indexes areunique
createRowMatching function for updating a compare object with information passed to it from the match rows function
cleanColNames cleanColNames : get colnames, remove leading and trailing whitespace and push to upper case
createTextSummary createTextSummary: create a text based summary of an dataCompareR object
coerceData coerceData
createColMatching Converts the output of the column matching logic to something consumable by updateCompareObject.
matchColumns matchColumns : create subset of DFA and DFB to contain matching column names for both data frames
createCompareObject Generates an empty list of the correct class to store results
matchMultiIndex Generate two dataframes that contain the same rows based on a two-column index
createMismatches Create mismatch object
mismatchHighStop mismatchHighStop Checks if we've exceeded threshold of mismatches
createReportText createReportText: prepares text which is used in the summary report Saves R markdown and HTML reports in the area specified by the user. Reports are called RcompareReport.Rmd (.html) Uses knitr package to create tables in the markdown (createReportText function) and HTML report.
orderColumns orderColumns: order columns by treated column names
generateMismatchData Extract data from a dataCompareR comparison
getCoercions Subsets on the variables that have a coercion.
createAntiSubset Create a dataframe of the rows that don't match
saveReport Save a report based on a dataCompareR object
createCleaningInfo Converts cleaning info into a format consumable by updateCompareObject.
subsetDataColumns subsetDataColumns : create subset of DFA and DFB to contain matching column names for both data frames
currentObjVersion Place to store and access the current object version.
compareData Compare data. Wrapper for comparison functionality.
executeCoercions executeCoercions:
compareNames compareNames : compare the intersect of colInfoA and colInfoB and return boolean of matched columns for each data frame
isNotNull isNotNull: is object not null
makeValidKeys makeValidKeys
listObsNotVerbose listObsNotVerbose
makeValidNames makeValidNames
matchNoIndex Generate two dataframes that contain the same rows based on a two-column index
rcompObjItemLength rcompObjItemLength: return length of an item, returning 0 if null, and handling the fact that we might have a data frames or a vector
rounddf Round all numeric fields in a data frame
matchSingleIndex Generate two dataframes that contain the same rows based on a single index
updateCompareObject.colmatching Adds a colMatching block to the output
metaDataInfo Creates a list of info about the dataframe.
updateCompareObject.matches Adds a colMatching block to the output
matchRows Generate two dataframes and returns subsets of these dataframes that have shared rows.
print.dataCompareRobject Printing RCompare Output
processFlow processFlow Handles the process flow for the whole package
validateData validateData : routine to validate the input data
rCompare Compare two data frames
variableDetails Create variable mismatch details
summary.dataCompareRobject Summarizing RCompare Output
createMeta Takes the raw info for the meta block of the output and puts it in a format usable by the updateCompareObject function
trimCharVars trimCharVars: trim white spaces in character variables from an input dataframe
updateCompareObject.rowmatching Adds a rowMatching block to the output
createMismatchObject Create mismatch object
validateArguments validateArguments
getMismatchColNames Extracts the column names only in one data frame from a table of match information
variableMismatches Create variable mismatch table
print.summary.dataCompareRobject Printing summaryRCompare Output
outputSectionHeader outputSectionHeader: creates an outputSectionHeader
updateCompareObject.meta Takes raw info for meta and adds it to the compare object
prepareData prepareData Prepares data for comparison in 3 stages. 1. Match columns - filter dataframes to those columns that match and summarise differences 2. Match rows - filter dataframes to those rows that match and summarise differences 3. Coerce data
updateCompareObject Generic function for updating a compare object with information passed to it, that has methods based on the class of the info argument.
updateCompareObject.mismatches Adds a colMatching block to the output
updateCompareObject.cleaninginfo Updates cleaning info in the compare object
is.dataCompareRobject Check object is of class dataCompareRobject
listObsVerbose listObsVerbose
locateMismatches Checks whether elements in two input data frames are equal.
coerceFactorsToChar coerceFactorsToChar: convert all factor type fields to characters
No Results!

Vignettes of dataCompareR

Name
dataCompareR.Rmd
No Results!

Last month downloads

Details

URL https://github.com/capitalone/dataCompareR
BugReports https://github.com/capitalone/dataCompareR/issues
License Apache License 2.0 | file LICENSE
LazyData true
RoxygenNote 6.0.1
VignetteBuilder knitr
NeedsCompilation no
Packaged 2017-11-10 15:31:21 UTC; rob
Repository CRAN
Date/Publication 2017-11-14 09:01:03 UTC

Include our badge in your README

[![Rdoc](http://www.rdocumentation.org/badges/version/dataCompareR)](http://www.rdocumentation.org/packages/dataCompareR)