
This function splits automatically a dataframe into train and test datasets. You can define a seed to get the same results every time, but has a default value. You can prevent it from printing the split counter result.
msplit(df, size = 0.7, seed = 0, print = TRUE)
Dataframe
Numeric. Split rate value, between 0 and 1. If set to 1, the train and test set will be the same.
Integer. Seed for random split
Boolean. Print summary results?
List with both datasets, summary, and split rate.
Other Machine Learning:
ROC()
,
conf_mat()
,
export_results()
,
gain_lift()
,
h2o_automl()
,
h2o_predict_API()
,
h2o_predict_MOJO()
,
h2o_predict_binary()
,
h2o_predict_model()
,
h2o_selectmodel()
,
impute()
,
iter_seeds()
,
lasso_vars()
,
model_metrics()
,
model_preprocess()
Other Tools:
autoline()
,
bindfiles()
,
bring_api()
,
db_download()
,
db_upload()
,
export_plot()
,
export_results()
,
get_credentials()
,
h2o_predict_API()
,
h2o_predict_MOJO()
,
h2o_predict_binary()
,
h2o_predict_model()
,
h2o_selectmodel()
,
haveInternet()
,
image_metadata()
,
importxlsx()
,
ip_data()
,
json2vector()
,
listfiles()
,
mailSend()
,
myip()
,
quiet()
,
read.file()
,
statusbar()
,
tic()
,
try_require()
,
updateLares()
,
zerovar()
# NOT RUN {
data(dft) # Titanic dataset
splits <- msplit(dft, size = 0.7, seed = 123)
names(splits)
# }
Run the code above in your browser using DataLab