This function splits automatically a dataframe into train and test datasets. You can define a seed to get the same results every time, but has a default value. You can prevent it from printing the split counter result.
msplit(df, size = 0.7, seed = 0, print = T)
Dataframe to split
Numeric. Split rate value, between 0 and 1. If set to 1, the train and test set will be the same.
Seed for random split
Print summary results
A list with both datasets, summary, and split rate
Other Machine Learning: ROC
,
clusterKmeans
, conf_mat
,
export_results
, gain_lift
,
h2o_automl
, h2o_predict_API
,
h2o_predict_MOJO
,
h2o_predict_binary
,
h2o_predict_model
,
h2o_selectmodel
, impute
,
iter_seeds
, model_metrics
,
mplot_conf
, mplot_cuts_error
,
mplot_cuts
, mplot_density
,
mplot_full
, mplot_gain
,
mplot_importance
,
mplot_lineal
, mplot_metrics
,
mplot_response
, mplot_roc
,
mplot_splits
Other Tools: autoline
,
bindfiles
, bring_api
,
db_download
, db_upload
,
export_plot
, export_results
,
get_credentials
,
get_currency
,
h2o_predict_API
,
h2o_predict_MOJO
,
h2o_predict_binary
,
h2o_predict_model
,
h2o_selectmodel
, h2o_update
,
haveInternet
, importxlsx
,
ip_country
, iter_seeds
,
json2vector
, listfiles
,
mailSend
, matrixwd
,
myip
, pass
,
quiet
, read.file
,
statusbar
, try_require
,
updateLares
, zerovar