train_val_split_method2: Train-Validation Split by Accident Period Method 2
Description
Function for training/validation splitting.
Usage
train_val_split_method2(df, tri.size, val_ratio, test = FALSE)
Value
List containing $train, $valid, $test, which should partition
the input df.
Arguments
df
Claims Triangle and other information. data.frame format of
claims and related information for each cell. Dataframe will have columns
origin and dev as columns 1 and 2 respectively.
tri.size
Triangle size.
val_ratio
Value between 0 and 1 as the approximate size of validaiton
set.
test
Returns the test set if TRUE .
Details
Approximates the validation set by defining the training set as the cells
below the function \(((b^{1/a} - x^{1/a})^a)\). Where \(b\) is equal to
the triangle size and \(a\) is optimised to best fit val_ratio.
The training set is therefore cells outside of this period but within the
upper triangle. The test set is all observations in the lower triangle.
Note that accident period 1 and development period 1 will always be within
the training set.