A dataframe containing all insertions and deletions observed in experimental data (pooled across all samples, Greiff, 2017) This dataframe is a subset of the dataframe used in the application note. The original dataframe which contains 11363603 rows can be downloaded from:
insertions_and_deletion_lengths_df
A data frame with 500000 rows and variables:
np1 insertions
np2 insertions
lengths of V gene deletions
lengths of 5' end D gene deletions
lengths of 3' end D gene deletions
lengths of J gene deletions
https://github.com/GreiffLab/immuneSIM or using the provided function: load_insdel_data()