compare.synds:
Compare univariate distributions of synthesised and observed data
Description
Compare synthesised data set with the original (observed) data set
using percent frequency tables and histograms. When more than one
synthetic data set has been generated (object$m>1), by default pooled
synthetic data are used for comparison.
an object of class synds, which stands
for 'synthesised data set'. It is typically created by
function syn() and it includes object$m
synthesised data set(s).
data
an original (observed) data set.
vars
variables to be compared. If vars is NULL
(the default) all synthesised variables are compared.
msel
index or indices of synthetic data copies for which a comparison
is to be made. If NULL pooled synthetic data copies are compared
with the original data.
breaks
the number of cells for the histogram.
nrow
the number of rows for the plotting area.
ncol
the number of columns for the plotting area.
rel.size.x
a number representing the relative size of x-axis labels.
cols
bar colors.
...
additional parameters.
x
an object of class compare.synds.
Value
An object of class compare.synds which is a list including a list
of comparative percent frequency tables (tables) and
a ggplot object (plots) with bar charts/histograms. If multiple plots
are produced they and their corresponding frequency tables are stored as a list.
Details
Missing data categories for numeric variables are plotted on the same plot
as non-missing values. They are indicated by miss. suffix.