Learn R Programming

microseq (version 1.2)

muscle: Multiple alignment using MUSCLE

Description

Computing a multiple sequence alignment using the MUSCLE software.

Usage

muscle(in.file, out.file, quiet = FALSE, diags = FALSE, maxiters = 16)

Arguments

in.file
Name of FASTA-file with input sequences.
out.file
Name of file to store the result.
quiet
Logical, quiet=FALSE produces screen output during computations.
diags
Logical, diags=TRUE gives faster but less reliable alignment.
maxiters
Maximum number of iterations.

Value

The result is written to the file specified in out.file.

Details

The software MUSCLE (Edgar, 2004) must be installed and available on the system. Test this by typing system("muscle") in the Console, and some sensible output should be produced. NOTE: The executable must be named muscle on your system, no version numbers etc. For more details on MUSCLE, see http://www.drive5.com/muscle.

By default diags=FALSE but can be set to TRUE to increase speed. This should be done only if sequences are highly similar.

By default maxiters=16. If you have a large number of sequences (a few thousand), or they are very long, then this may be too slow for practical use. A good compromise between speed and accuracy is to run just the first two iterations of the algorithm. On average, this gives accuracy equal to T-Coffee and speeds much faster than CLUSTALW. This is done by the option maxiters=2.

References

Edgar, R.C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Res, 32, 1792-1797.

See Also

msaTrim.

Examples

Run this code
## Not run: 
# ex.file <- file.path(file.path(path.package("microseq"),"extdata"),"small.fasta")
# muscle(in.file=ex.file,out.file="deleteMe.fasta")
# ## End(Not run)

Run the code above in your browser using DataLab