split_by

split_by.data.frame

The split_by() splits the data.frame or tbl_df into a train set and a test set.

A collection of tools that support data splitting, predictive modeling, and model evaluation.
A typical function is to split a dataset into a training dataset and a test dataset.
Then compare the data distribution of the two datasets.
Another feature is to support the development of predictive models and to compare the performance of several predictive models,
helping to select the best model.

Choonghyun Ryu

alookr

Model Classifier for Binary Classification

split_by function

<dl><dt>.data</dt>
<dd>a data.frame or a <code><a href="https://tibble.tidyverse.org/reference/tbl_df-class.html">tbl_df</a></code>.</dd>
<dt>...</dt>
<dd>further arguments passed to or from other methods.</dd>
<dt>target</dt>
<dd>unquoted expression or variable name. the name of the target variable</dd>
<dt>ratio</dt>
<dd>numeric. the ratio of the train dataset. default is 0.7</dd>
<dt>seed</dt>
<dd>random seed used for splitting</dd></dl>

Arguments

The attributes of the split_df class are as follows.:
<ul>
<li>split_seed : integer. random seed used for splitting</li>
<li>target : character. the name of the target variable</li>
<li>binary : logical. whether the target variable is binary class</li>
<li>minority : character. the name of the minority class</li>
<li>majority : character. the name of the majority class</li>
<li>minority_rate : numeric. the rate of the minority class</li>
<li>majority_rate : numeric. the rate of the majority class</li>
</ul>

attributes of split_by

Split Data into Train and Test Set — split_by

<dl>

<dt>.data</dt>
<dd>a data.frame or a <code><a href='https://tibble.tidyverse.org/reference/tbl_df-class.html'>tbl_df</a></code>.</dd>


<dt>...</dt>
<dd>further arguments passed to or from other methods.</dd>


<dt>target</dt>
<dd>unquoted expression or variable name. the name of the target variable</dd>


<dt>ratio</dt>
<dd>numeric. the ratio of the train dataset. default is 0.7</dd>


<dt>seed</dt>
<dd>random seed used for splitting</dd>

</dl>

The attributes of the split_df class are as follows.:
<ul>
<li>split_seed : integer. random seed used for splitting</li>
<li>target : character. the name of the target variable</li>
<li>binary : logical. whether the target variable is binary class</li>
<li>minority : character. the name of the minority class</li>
<li>majority : character. the name of the majority class</li>
<li>minority_rate : numeric. the rate of the minority class</li>
<li>majority_rate : numeric. the rate of the majority class</li>
</ul>

split_by: Split Data into Train and Test Set

Description

Usage

Value

Arguments

attributes of split_by

Details

Examples