Learn R Programming

llama (version 0.7.2)

cvFolds: Cross-validation folds

Description

Take data produced by input and amend it with (optionally) stratified folds for cross-validation.

Usage

cvFolds(data, nfolds = 10, stratify = T)

Arguments

data

the data to use. The structure returned by input.

nfolds

the number of folds. Defaults to 10. If -1 is given, leave-one-out cross-validation folds are produced.

stratify

whether to stratify the folds. Defaults to TRUE.

Value

traina list of data sets for training.
testa list of data sets for testing.
...the original members of data. See input.

Details

Partitions the data set into folds. Stratification, if requested, is done by the best algorithm, i.e. the one with the best performance. The distribution of the best algorithms in each fold will be approximately the same. The folds are assembled into training and test sets by combining $n-1$ folds for training and using the remaining fold for testing. The sets are added to the original data set and returned.

See Also

trainTest

Examples

data(satsolvers)
folds = cvFolds(satsolvers)

# use 5 folds instead of the default 10
folds5 = cvFolds(satsolvers, 5)

# don't stratify
foldsU = cvFolds(satsolvers, stratify=FALSE)

Run the code above in your browser using DataLab