Generates an empty list of the correct class to store results
makeValidKeys
processFlow Handles the process flow for the whole package
matchColumns : create subset of DFA and DFB to contain matching column names for both data frames
Create variable mismatch table
compareNames : compare the intersect of colInfoA and colInfoB and return boolean of matched columns for each data frame
Generate two dataframes that contain the same rows based on a two-column index
checkEmpty
Compare two data frames
Create a dataframe of the rows that don't match
collapseClasses. Collapse the classes of an object to a single string
Takes the raw info for the meta block of the output
and puts it in a format usable by the updateCompareObject
function
Converts cleaning info into a format consumable by updateCompareObject.
Converts the output of the column matching logic to something consumable by updateCompareObject.
makeValidNames
Subsets on the variables that have a coercion.
Check object is of class dataCompareRobject
Extracts the column names only in one data frame from a table of
match information
isNotNull: is object not null
outputSectionHeader: creates an outputSectionHeader
prepareData Prepares data for comparison in 3 stages.
1. Match columns - filter dataframes to those columns that match and summarise differences
2. Match rows - filter dataframes to those rows that match and summarise differences
3. Coerce data
isSingleNA
listObsNotVerbose
Create mismatch object
checkKeysExist
colsWithUnequalValues: a dataframe summarising a column with unequal values
Create mismatch object
Warn users if the calculation is likely to be slow
executeCoercions:
Extract data from a dataCompareR comparison
Printing RCompare Output
listObsVerbose
summary.dataCompareRobject
Summarizing RCompare Output
updateCompareObject.colmatching
Adds a colMatching block to the output
Checks whether elements in two input data frames are equal.
trimCharVars: trim white spaces in character variables from an input dataframe
Compare data. Wrapper for comparison functionality.
print.summary.dataCompareRobject
Printing summaryRCompare Output
updateCompareObject.matches
Adds a colMatching block to the output
createReportText: prepares text which is used in the summary report
Saves R markdown and HTML reports in the area specified by the user. Reports are called RcompareReport.Rmd (.html)
Uses knitr package to create tables in the markdown (createReportText function) and HTML report.
updateCompareObject.rowmatching
Adds a rowMatching block to the output
validateArguments
updateCompareObject.mismatches
Adds a colMatching block to the output
Save a report based on a dataCompareR object
subsetDataColumns : create subset of DFA and DFB to contain matching column names for both data frames
Takes raw info for meta and adds it to the compare object
function for updating a compare object with
information passed to it from the match rows function
Generate two dataframes that contain the same rows based on a two-column index
Generate two dataframes and returns subsets of these dataframes that have shared rows.
mismatchHighStop Checks if we've exceeded threshold of mismatches
orderColumns: order columns by treated column names
Generic function for updating a compare object with
information passed to it, that has methods based on the class
of the info argument.
updateCompareObject.cleaninginfo
Updates cleaning info in the compare object
Generate two dataframes that contain the same rows based on a single index
Creates a list of info about the dataframe.
validateData : routine to validate the input data
rcompObjItemLength: return length of an item, returning 0 if null, and handling the fact that
we might have a data frames or a vector
Create variable mismatch details
Round all numeric fields in a data frame
checkForRcompareCol
allVarMatchMessage
Checks that a list of indexes areunique
cleanColNames : get colnames, remove leading and trailing whitespace and push to upper case
CheckNA
coerceData
coerceFactorsToChar: convert all factor type fields to characters
createTextSummary: create a text based summary of an dataCompareR object
Place to store and access the current object version.