dmean: Summarization of location and spread between synthetic and real data
Description
For each numerical variable in the two datasets, compute the relative
difference between the mean (standard deviation) of the real data and the
mean (standard deviation) of the synthetic data. The summary is the average
of these relative differences over all numerical variables.
Usage
dmean(synth, real, tol = 1e-08, ...)
dsd(synth, real, tol = 1e-08, ...)
Value
[numeric] scalar.
Arguments
synth
[data.frame] Synthetic data
real
[data.frame] Real data
tol
[numeric] Nonnegative tolerance. If the absolute
mean (standard deviation) of a variable is smaller than
tol, it is considered zero. In that case the
absolute difference instead of the absolute relative
difference is computed.
...
Arguments passed to mean. e.g. use trim=c(0.01,0.99)
for mean estimation that is less sensitive to outliers.