Last chance! 50% off unlimited learning
Sale ends in
parSeqSim(protlist, cores = 2, type = "local", submat = "BLOSUM62")
n
list containing n
protein sequences,
each component of the list is a character string, storing one protein sequence.
Unknown sequences should be represented as ''
.2
. Users could use the detectCores()
function
in the parallel
package to see how many cores they could use.'local'
,
could be 'global'
or 'local'
,
where 'global'
represents Needleman-Wunsch global alignment;
'local'
represents Smith-Waterman local alignment.'BLOSUM62'
, could be one of
'BLOSUM45'
, 'BLOSUM50'
, 'BLOSUM62'
, 'BLOSUM80'
, 'BLOSUM100'
,
'PAM30'
, 'PAM40'
, 'PAM70'
, 'PAM120'
, 'PAM250'
.n
x n
similarity matrix.
twoSeqSim
for protein sequence alignment
for two protein sequences. See parGOSim
for
protein similarity calculation based on
Gene Ontology (GO) semantic similarity.
# Be careful when testing this since it involves parallelisation
# and might produce unpredictable results in some environments
require(Biostrings)
require(foreach)
require(doParallel)
s1 = readFASTA(system.file('protseq/P00750.fasta', package = 'protr'))[[1]]
s2 = readFASTA(system.file('protseq/P08218.fasta', package = 'protr'))[[1]]
s3 = readFASTA(system.file('protseq/P10323.fasta', package = 'protr'))[[1]]
s4 = readFASTA(system.file('protseq/P20160.fasta', package = 'protr'))[[1]]
s5 = readFASTA(system.file('protseq/Q9NZP8.fasta', package = 'protr'))[[1]]
plist = list(s1, s2, s3, s4, s5)
psimmat = parSeqSim(plist, cores = 2, type = 'local', submat = 'BLOSUM62')
print(psimmat)
Run the code above in your browser using DataLab