syn.ctree(y, x, xp, smoothing, proper = FALSE, minbucket = 5, ...)
syn.cart(y, x, xp, smoothing, proper = FALSE, minbucket = 5, cp = 1e-04, ...)
n
.n
x p
) of original covariates.k
x p
) of synthesised covariates.proper = TRUE
) a CART
model is fitted to a bootstrapped sample of the original data.rpart.control
and
ctree_control
for details.rpart.control
for details.ctree_control
for syn.ctree
and
rpart.control
for syn.cart
.k
with synthetic values of y
.xp
find the terminal node. y
from that draw as the synthetic value.
syn.ctree
uses ctree
function from the
party package and syn.cart
uses rpart
function from the rpart package. They differ, among others,
in a selection of a splitting variable and a stopping rule for the
splitting process.
A Guassian kernel smoothing can be applied to continuous variables
by setting smoothing parameter to "density"
. It is recommended
as a tool to decrease the disclosure risk. Increasing minbucket
is another means of data protection.
CART models were suggested for generation of synthetic data by
Reiter (2005) and then evaluated by Drechsler and Reiter (2011).
syn
, syn.survctree
,
rpart
, ctree