syn.ctree(y, x, xp, smoothing, proper = FALSE, minbucket = 5, ...)
syn.cart(y, x, xp, smoothing, proper = FALSE, minbucket = 5, cp = 1e-04, ...)n.n x p) of original covariates.k x p) of synthesised covariates.proper = TRUE) a CART
model is fitted to a bootstrapped sample of the original data.rpart.control and
ctree_control for details.rpart.control for details.ctree_control for syn.ctree and
rpart.control for syn.cart.k with synthetic values of y.xp find the terminal node. y from that draw as the synthetic value.
syn.ctree uses ctree function from the
party package and syn.cart uses rpart
function from the rpart package. They differ, among others,
in a selection of a splitting variable and a stopping rule for the
splitting process.
A Guassian kernel smoothing can be applied to continuous variables
by setting smoothing parameter to "density". It is recommended
as a tool to decrease the disclosure risk. Increasing minbucket
is another means of data protection.
CART models were suggested for generation of synthetic data by
Reiter (2005) and then evaluated by Drechsler and Reiter (2011).
syn, syn.survctree,
rpart, ctree