Learn R Programming

daltoolbox (version 1.2.747)

sample_stratified: Stratified sampling

Description

Train/test split and k‑fold partitioning that preserve the target class proportions (strata).

Usage

sample_stratified(attribute)

Value

returns an object of class sample_stratified

Arguments

attribute

attribute target to model building

Examples

Run this code
#using stratified sampling
sample <- sample_stratified("Species")
tt <- train_test(sample, iris)

# distribution of train
table(tt$train$Species)

# preparing dataset into four folds
folds <- k_fold(sample, iris, 4)

# distribution of folds
tbl <- NULL
for (f in folds) {
 tbl <- rbind(tbl, table(f$Species))
}
head(tbl)

Run the code above in your browser using DataLab