Last chance! 50% off unlimited learning
Sale ends in
Split input data into training and test set, retrieving always same sample by setting the seed.
get_sample(data, percentage_tr_rows = 0.8, seed = 987)
input data source
percentage of training rows, range value from 0.1 to 0.99, default value=0.8 (80 percent of training data)
to generate the sample randomly, default value=987
TRUE/FALSE vector same length as 'data' param. TRUE represents that row position is for training data
# NOT RUN {
## Training and test data. Percentage of training cases default value=80%.
index_sample=get_sample(data=heart_disease, percentage_tr_rows=0.8)
## Generating the samples
data_tr=heart_disease[index_sample,]
data_ts=heart_disease[-index_sample,]
# }
Run the code above in your browser using DataLab