Split each VCF into SBS, DBS, and ID VCFs (plus VCF-like data frame with left-over rows)
SplitListOfVCFs(
list.of.vcfs,
variant.caller,
max.vaf.diff = 0.02,
num.of.cores = 1,
suppress.discarded.variants.warnings = TRUE
)
A list containing the following objects:
SBS
: List of VCFs with only single base substitutions.
DBS
: List of VCFs with only doublet base substitutions as called
by Mutect.
ID
: List of VCFs with only small insertions and deletions.
discarded.variants
: Non-NULL only if there are variants
that were excluded from the analysis. See the added extra column
discarded.reason
for more details.
List of VCFs as in-memory data.frames.
Name of the variant caller that produces the VCF, can
be either "strelka"
, "mutect"
, "freebayes"
or
"unknown"
. If variant caller is "mutect"
, do not merge
SBSs into DBS.
The maximum difference of VAF, default value is 0.02. If
the absolute difference of VAFs for adjacent SBSs is bigger than
max.vaf.diff
, then these adjacent SBSs are likely to be "merely"
asynchronous single base mutations, opposed to a simultaneous doublet
mutation or variants involving more than two consecutive bases.
The number of cores to use. Not available on Windows
unless num.of.cores = 1
.
Logical. Whether to suppress warning messages showing information about the discarded variants. Default is TRUE.