FindSynteny(dbFile, tblName = "DNA", identifier = "", useFrames = TRUE, alphabet = c("MF", "ILV", "A", "C", "WYQHP", "G", "TSN", "RK", "DE"), geneticCode = GENETIC_CODE, sepCost = -0.01, gapCost = -0.2, shiftCost = -20, codingCost = -3, maxSep = 5000, maxGap = 5000, minScore = 200, dropScore = -100, maskRepeats = TRUE, verbose = TRUE)
FALSE
then faster but less sensitive to distant homology.
alphabet
equal to AA_STANDARD
.
FindSynteny
searches for ``hits'' between sequences that can be chained into collinear ``blocks'' of synteny. Hits are defined as k-mer exact nucleotide matches or k-mer matches in a reduced amino acid alphabet (if useFrames
is TRUE
). Hits are chained into blocks as long as they are: (1) within the same sequence, (2) within maxSep
and maxGap
distance, and (3) help maintain the score above minScore
. Blocks are extended from their first and last hit until their score drops below dropScore
from the maximum that was reached. This process results in a set of hits and blocks stored in an object of class ``Synteny''.
AlignSynteny
, Synteny-class
db <- system.file("extdata", "Influenza.sqlite", package="DECIPHER")
synteny <- FindSynteny(db, minScore=50)
synteny
pairs(synteny) # scatterplot matrix
Run the code above in your browser using DataLab