Learn R Programming

Bios2cor (version 2.1)

import.msf: Reads a multiple sequence alignment file in MSF format

Description

Reads a Multiple Sequence Alignment (MSA) file in MSF format (.msf extension).

Usage

import.msf(file, aa.to.upper = TRUE, gap.to.dash = TRUE, log.file = NULL)

Arguments

file

a string of characters to indicate the name of the MSA file to be read.

aa.to.upper

a logical value indicating whether amino acids should be converted to upper case (TRUE) or not (FALSE). Default is TRUE.

gap.to.dash

a logical value indicating whether the dot (.) and tilde (\(\sim\)) gap symbols should be converted to the dash (-) character (TRUE) or not (FALSE). Default is TRUE.

log.file

a file containing errors that occurred when trying to read fasta file

Value

A object of class 'align', which is a named list whose elements correspond to sequences, in the form of character vectors.

Details

Initially, Multiple Sequence Format (MSF) was the multiple sequence alignment format of the Wisconsin Package (WP) or GCG (Genetic Computer Group). This package is a suite of over 130 sequence analysis programs for database searching, secondary structure prediction or sequence alignment. Presently, numerous multiple sequence alignment editors (Jalview and GeneDoc for example) can read and write MSF files.

MSF file displays several specificities:

  • a header containing sequence identifiers and characteristics (length, check and weight).

  • a separator symbolized by 2 slashes (//).

  • sequences of identifiers, displayed by consecutive blocks.

See Also

read.alignment function from seqinr package. read.GDoc function from aaMI package (archived).

Examples

Run this code
# NOT RUN {
  #Importing MSA file in MSF format
  aln <- import.msf(system.file("msa/toy_align.msf", package = "Bios2cor"))
# }

Run the code above in your browser using DataLab