cleanColNames : get colnames, remove leading and trailing whitespace and push to upper case
CheckNA
compareNames : compare the intersect of colInfoA and colInfoB and return boolean of matched columns for each data frame
Checks that a list of indexes areunique
function for updating a compare object with
information passed to it from the match rows function
outputSectionHeader: creates an outputSectionHeader
Create a dataframe of the rows that don't match
prepareData Prepares data for comparison in 3 stages.
1. Match columns - filter dataframes to those columns that match and summarise differences
2. Match rows - filter dataframes to those rows that match and summarise differences
3. Coerce data
subsetDataColumns : create subset of DFA and DFB to contain matching column names for both data frames
Generate two dataframes that contain the same rows based on a two-column index
Compare two data frames
matchColumns : create subset of DFA and DFB to contain matching column names for both data frames
processFlow Handles the process flow for the whole package
Save a report based on a dataCompareR object
Create mismatch object
Generic function for updating a compare object with
information passed to it, that has methods based on the class
of the info argument.
Create mismatch object
coerceFactorsToChar: convert all factor type fields to characters
validateArguments
collapseClasses. Collapse the classes of an object to a single string
Converts cleaning info into a format consumable by updateCompareObject.
Extract data from a dataCompareR comparison
Converts the output of the column matching logic to something consumable by updateCompareObject.
updateCompareObject.rowmatching
Adds a rowMatching block to the output
executeCoercions:
isSingleNA
listObsNotVerbose
Checks whether elements in two input data frames are equal.
orderColumns: order columns by treated column names
Place to store and access the current object version.
trimCharVars: trim white spaces in character variables from an input dataframe
listObsVerbose
mismatchHighStop Checks if we've exceeded threshold of mismatches
summary.dataCompareRobject
Summarizing RCompare Output
createTextSummary: create a text based summary of an dataCompareR object
updateCompareObject.cleaninginfo
Updates cleaning info in the compare object
Takes raw info for meta and adds it to the compare object
Printing RCompare Output
checkEmpty
Extracts the column names only in one data frame from a table of
match information
allVarMatchMessage
Subsets on the variables that have a coercion.
Generate two dataframes that contain the same rows based on a two-column index
updateCompareObject.mismatches
Adds a colMatching block to the output
rcompObjItemLength: return length of an item, returning 0 if null, and handling the fact that
we might have a data frames or a vector
Round all numeric fields in a data frame
print.summary.dataCompareRobject
Printing summaryRCompare Output
checkForRcompareCol
updateCompareObject.colmatching
Adds a colMatching block to the output
Generate two dataframes and returns subsets of these dataframes that have shared rows.
checkKeysExist
Compare data. Wrapper for comparison functionality.
colsWithUnequalValues: a dataframe summarising a column with unequal values
Check object is of class dataCompareRobject
isNotNull: is object not null
makeValidKeys
updateCompareObject.matches
Adds a colMatching block to the output
makeValidNames
Warn users if the calculation is likely to be slow
Create variable mismatch table
Generate two dataframes that contain the same rows based on a single index
validateData : routine to validate the input data
Creates a list of info about the dataframe.
Create variable mismatch details
createReportText: prepares text which is used in the summary report
Saves R markdown and HTML reports in the area specified by the user. Reports are called RcompareReport.Rmd (.html)
Uses knitr package to create tables in the markdown (createReportText function) and HTML report.
Generates an empty list of the correct class to store results
coerceData
Takes the raw info for the meta block of the output
and puts it in a format usable by the updateCompareObject
function