dataGen

Fast generation of (primitive) synthetic multivariate normal data.

Data from statistical agencies and other institutions are mostly
confidential. This package, introduced in Templ, Kowarik and Meindl (2017) <doi:10.18637/jss.v067.i04>, can be used for the generation of anonymized
(micro)data, i.e. for the creation of public- and scientific-use files.
The theoretical basis for the methods implemented can be found in Templ (2017) <doi:10.1007/978-3-319-50272-4>.
Various risk estimation and anonymization methods are included. Note that the package
includes a graphical user interface published in Meindl and Templ (2019) <doi:10.3390/a12090191> that allows to use various methods of this
package.

Matthias Templ

sdcMicro

Statistical Disclosure Control Methods for Anonymization of Data
and Risk Estimation

Bernhard Meindl

Alexander Kowarik

Johannes Gussenbauer

Organisation For Economic Co-Operation And Development 

Statistics Netherlands 

Pascal Heus 

dataGen function

<dl><dt>obj</dt>
<dd>an <code>sdcMicroObj-class</code>-object or a <code>data.frame</code></dd>
<dt>...</dt>
<dd>see possible arguments below<dl>
<dt>n:</dt>
<dd>amount of observations for the generated data, defaults to 200</dd></dl></dd><dt>use:</dt>
<dd>howto compute covariances in case of missing values, see also argument <code>use</code> in <code><a href="/link/cov?package=sdcMicro&version=5.7.8" data-mini-rdoc="sdcMicro::cov">cov</a></code>.
The default choice is 'everything', other possible choices are 'all.obs', 'complete.obs', 'na.or.complete' or 'pairwise.complete.obs'.</dd></dl>

Arguments

Author

Fast generation of synthetic data — dataGen

<dl>

<dt>obj</dt>
<dd>an <code>sdcMicroObj-class</code>-object or a <code>data.frame</code></dd>


<dt>...</dt>
<dd>see possible arguments below<dl>
<dt>n:</dt>
<dd>amount of observations for the generated data, defaults to 200</dd>

<dt>use:</dt>
<dd>howto compute covariances in case of missing values, see also argument <code>use</code> in <code><a href='https://rdrr.io/r/stats/cor.html'>cov</a></code>.
The default choice is 'everything', other possible choices are 'all.obs', 'complete.obs', 'na.or.complete' or 'pairwise.complete.obs'.</dd>

</dl></dd>

</dl>

dataGen: Fast generation of synthetic data

Description

Usage

Value

Arguments

Author

Details

References

See Also

Examples