Split data when patients are in the data multiple times such that the same patient is always either in the train set or the test set (the same patient cannot be in both the test and train set at different times)
subjectSplitter(population, test = 0.3, train = NULL, nfold = 3, seed = NULL)
An object created using createStudyPopulation().
A real number between 0 and 1 indicating the test set fraction of the data
A real number between 0 and 1 indicating the train set fraction of the data. If not set train is equal to 1 - test
An integer >= 1 specifying the number of folds used in cross validation
If set a fixed seed is used, otherwise a random split is performed
A dataframe containing the columns: rowId and index
Returns a dataframe of rowIds and indexes with a -1 index indicating the rowId belongs to the test set and a positive integer index value indicating the rowId's cross valiation fold within the train set.