llama (version 0.9.2)

bsFolds: Bootstrapping folds

Description

Take data produced by input and amend it with (optionally) stratified folds determined through bootstrapping.

Usage

bsFolds(data, nfolds = 10L, stratify = FALSE)

Arguments

data

the data to use. The structure returned by input.

nfolds

the number of folds. Defaults to 10.

stratify

whether to stratify the folds. Makes really only sense for classification models. Defaults to FALSE.

Value

train

a list of index sets for training.

test

a list of index sets for testing.

the original members of data. See input.

Details

Partitions the data set into folds. Stratification, if requested, is done by the best algorithm, i.e. the one with the best performance. The distribution of the best algorithms in each fold will be approximately the same. For each fold, the training index set is assembled through .632 bootstrap. The remaining indices are used for testing. There is no guarantee on the sizes of either sets. The sets of indices are added to the original data set and returned.

If the data set has train and test partitions already, they are overwritten.

See Also

cvFolds, trainTest

Examples

Run this code
# NOT RUN {
data(satsolvers)
folds = bsFolds(satsolvers)

# use 5 folds instead of the default 10
folds5 = bsFolds(satsolvers, 5L)

# stratify
foldsU = bsFolds(satsolvers, stratify=TRUE)
# }

Run the code above in your browser using DataLab