Group partitioning constructs data partitions such that all records with each level in the column or columns specified by the parameter partitionKeyCols occurs together in the same partition.
CreateGroupPartition(validationType, holdoutPct, partitionKeyCols,
reps = NULL, validationPct = NULL)
character. String specifying the type of partition generated, either "TVH" or "CV".
integer. The percentage of data to be used as the holdout subset.
list. List of character strings specifying the name of the variable used in defining the group partition.
integer. The number of cross-validation folds to generate; only applicable when validationType = "CV".
integer. The percentage of data to be used as the validation subset.
An S3 object of class 'partition' including the parameters required by the SetTarget function to generate a group-based partitioning of the modeling dataset.
This function is one of several convenience functions provided to simplify the task
of starting modeling projects with custom partitioning options. The other
functions are CreateRandomPartition
, CreateStratifiedPartition
, and
CreateUserPartition
.
CreateRandomPartition
, CreateStratifiedPartition
,
CreateUserPartition
.
# NOT RUN {
CreateGroupPartition(validationType = "CV",
holdoutPct = 20,
partitionKeyCols = list("groupId"),
reps = 5)
# }
Run the code above in your browser using DataLab