Prepares a dataset for ForestSearch, including options for LASSO-based dimension reduction, GRF cuts, forced cuts, and flexible cut strategies. Returns a list with the processed data, subgroup factor names, cut expressions, and LASSO selection results.
get_FSdata(
df.analysis,
use_lasso = FALSE,
use_grf = FALSE,
grf_cuts = NULL,
confounders.name,
cont.cutoff = 4,
conf_force = NULL,
conf.cont_medians = NULL,
conf.cont_medians_force = NULL,
replace_med_grf = TRUE,
defaultcut_names = NULL,
cut_type = "default",
exclude_cuts = NULL,
outcome.name = "tte",
event.name = "event",
details = TRUE
)A named list containing:
Data frame with binary cut-point indicator columns (named
q1, q2, ...) appended to the original analysis data.
Character vector of the internal column names
(q1, q2, ...) corresponding to each candidate factor.
Character vector of candidate factor specifications (continuous cut expressions and categorical variable names).
Character vector of factors retained by LASSO (if
use_lasso = TRUE), or NULL.
Character vector of factors omitted by LASSO (if
use_lasso = TRUE), or NULL.
Data frame containing the data.
Logical. Whether to use LASSO for dimension reduction.
Logical. Whether to use GRF cuts.
Character vector of GRF cut expressions.
Character vector of confounder variable names.
Integer. Cutoff for continuous variable determination.
Character vector of forced cut expressions.
Character vector of continuous confounders to cut at median.
Character vector of additional continuous confounders to force median cut.
Logical. If TRUE, removes median cuts that overlap with GRF cuts.
Character vector of confounders to force default cuts.
Character. "default" or "median" for cut strategy.
Character vector of cut expressions to exclude.
Character. Name of outcome variable.
Character. Name of event indicator variable.
Logical. If TRUE, prints details during execution.