getPopularMutationCount
determines which sequences occur frequently
for each V gene and returns the mutation count of those sequences.
getPopularMutationCount(
data,
germline_db,
v_call = "v_call",
seq = "sequence_alignment",
gene_min = 0.001,
seq_min = 50,
seq_p_of_max = 1/8,
full_return = FALSE
)
A data frame of genes that have a frequent sequence mutation count above 1.
data.frame
in the Change-O format. See
findNovelAlleles for a list of required
columns.
named list of IMGT-gapped germline sequences.
name of the column in data
with V allele calls.
Default is v_call
.
name of the column in data
with the
aligned, IMGT-numbered, V(D)J nucleotide sequence.
Default is sequence_alignment
.
portion of all unique sequences a gene must constitute to avoid exclusion.
number of copies of the V that must be present for to avoid exclusion.
ror each gene, the fraction of the most common V sequence count that a sequence must meet to avoid exclusion.
if TRUE
, will return all data
columns and
will include sequences with mutation count < 1.
getMutatedPositions can be used to find which positions of a set of sequences are mutated.
getPopularMutationCount(AIRRDb, SampleGermlineIGHV)
Run the code above in your browser using DataLab