Inherited methods
Method new()
Creates a new instance of this R6 class.
Usage
DoubleMLSSM$new(
data,
ml_g,
ml_pi,
ml_m,
n_folds = 5,
n_rep = 1,
score = "missing-at-random",
normalize_ipw = FALSE,
trimming_rule = "truncate",
trimming_threshold = 1e-12,
dml_procedure = "dml2",
draw_sample_splitting = TRUE,
apply_cross_fitting = TRUE
)
Arguments
data
(DoubleMLData
)
The DoubleMLData
object providing the data and specifying the
variables of the causal model.
ml_g
(LearnerRegr
,
Learner
, character(1)
)
A learner of the class LearnerRegr
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
Alternatively, a Learner
object with public field
task_type = "regr"
can be passed, for example of class
GraphLearner
. The learner can possibly
be passed with specified parameters, for example
lrn("regr.cv_glmnet", s = "lambda.min")
.
ml_g
refers to the nuisance function \(g_0(S,D,X) = E[Y|S,D,X]\).
ml_pi
(LearnerClassif
,
Learner
, character(1)
)
A learner of the class LearnerClassif
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
Alternatively, a Learner
object with public field
task_type = "classif"
can be passed, for example of class
GraphLearner
. The learner can possibly
be passed with specified parameters, for example
lrn("classif.cv_glmnet", s = "lambda.min")
.
ml_pi
refers to the nuisance function \(pi_0(D,X) = Pr[S=1|D,X]\).
ml_m
(LearnerRegr
,
LearnerClassif
, Learner
,
character(1)
)
A learner of the class LearnerClassif
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
Alternatively, a Learner
object with public field
task_type = "classif"
can be passed, for example of class
GraphLearner
. The learner can possibly
be passed with specified parameters, for example
lrn("classif.cv_glmnet", s = "lambda.min")
.
ml_m
refers to the nuisance function \(m_0(X) = Pr[D=1|X]\).
n_folds
(integer(1)
)
Number of folds. Default is 5
.
n_rep
(integer(1)
)
Number of repetitions for the sample splitting. Default is 1
.
score
(character(1)
, function()
)
A character(1)
("missing-at-random"
or "nonignorable"
) specifying
the score function. Default is "missing-at-random"
.
normalize_ipw
(logical(1)
)
Indicates whether the inverse probability weights are normalized. Default is FALSE
.
trimming_rule
(character(1)
)
A character(1)
("truncate"
is the only choice) specifying the
trimming approach. Default is "truncate"
.
trimming_threshold
(numeric(1)
)
The threshold used for timming. Default is 1e-12
.
dml_procedure
(character(1)
)
A character(1)
("dml1"
or "dml2"
) specifying the double machine
learning algorithm. Default is "dml2"
.
draw_sample_splitting
(logical(1)
)
Indicates whether the sample splitting should be drawn during
initialization of the object. Default is TRUE
.
apply_cross_fitting
(logical(1)
)
Indicates whether cross-fitting should be applied. Default is TRUE
.
Method set_ml_nuisance_params()
Set hyperparameters for the nuisance models of DoubleML models.
Note that in the current implementation, either all parameters have to
be set globally or all parameters have to be provided fold-specific.
Usage
DoubleMLSSM$set_ml_nuisance_params(
learner = NULL,
treat_var = NULL,
params,
set_fold_specific = FALSE
)
Arguments
learner
(character(1)
)
The nuisance model/learner (see method params_names
).
treat_var
(character(1)
)
The treatment varaible (hyperparameters can be set treatment-variable
specific).
params
(named list()
)
A named list()
with estimator parameters. Parameters are used for all
folds by default. Alternatively, parameters can be passed in a
fold-specific way if option fold_specific
is TRUE
. In this case, the
outer list needs to be of length n_rep
and the inner list of length
n_folds
.
set_fold_specific
(logical(1)
)
Indicates if the parameters passed in params
should be passed in
fold-specific way. Default is FALSE
. If TRUE
, the outer list needs
to be of length n_rep
and the inner list of length n_folds
.
Note that in the current implementation, either all parameters have to
be set globally or all parameters have to be provided fold-specific.
Method tune()
Hyperparameter-tuning for DoubleML models.
The hyperparameter-tuning is performed using the tuning methods provided
in the mlr3tuning package. For more
information on tuning in mlr3, we refer to
the section on parameter tuning in the
mlr3 book.
Usage
DoubleMLSSM$tune(
param_set,
tune_settings = list(n_folds_tune = 5, rsmp_tune = mlr3::rsmp("cv", folds = 5), measure
= NULL, terminator = mlr3tuning::trm("evals", n_evals = 20), algorithm =
mlr3tuning::tnr("grid_search"), resolution = 5),
tune_on_folds = FALSE
)
Arguments
param_set
(named list()
)
A named list
with a parameter grid for each nuisance model/learner
(see method learner_names()
). The parameter grid must be an object of
class ParamSet.
tune_settings
(named list()
)
A named list()
with arguments passed to the hyperparameter-tuning with
mlr3tuning to set up
TuningInstance objects.
tune_settings
has entries
terminator
(Terminator)
A Terminator object. Specification of terminator
is required to perform tuning.
algorithm
(Tuner or character(1)
)
A Tuner object (recommended) or key passed to the
respective dictionary to specify the tuning algorithm used in
tnr(). algorithm
is passed as an argument to
tnr(). If algorithm
is not specified by the users,
default is set to "grid_search"
. If set to "grid_search"
, then
additional argument "resolution"
is required.
rsmp_tune
(Resampling or character(1)
)
A Resampling object (recommended) or option passed
to rsmp() to initialize a
Resampling for parameter tuning in mlr3
.
If not specified by the user, default is set to "cv"
(cross-validation).
n_folds_tune
(integer(1)
, optional)
If rsmp_tune = "cv"
, number of folds used for cross-validation.
If not specified by the user, default is set to 5
.
measure
(NULL
, named list()
, optional)
Named list containing the measures used for parameter tuning. Entries in
list must either be Measure objects or keys to be
passed to passed to msr(). The names of the entries must
match the learner names (see method learner_names()
). If set to NULL
,
default measures are used, i.e., "regr.mse"
for continuous outcome
variables and "classif.ce"
for binary outcomes.
resolution
(character(1)
)
The key passed to the respective
dictionary to specify the tuning algorithm used in
tnr(). resolution
is passed as an argument to
tnr().
tune_on_folds
(logical(1)
)
Indicates whether the tuning should be done fold-specific or globally.
Default is FALSE
.
Method clone()
The objects of this class are cloneable with this method.
Usage
DoubleMLSSM$clone(deep = FALSE)
Arguments
deep
Whether to make a deep clone.