A dataframe containing FASTA ids (Id column) and sequences (Seq column).
If 'filter' is specified, the split sequences are stored in additional columns
Part1 and Part2.
Arguments
filename
A string specifying the name of the FASTA file to be imported.
filter
An optional string specifying a sequence motif for sequence filtering.
Only keeps those sequences containing this motif. Also splits the
matched sequences and provides the split parts in two additional columns.
non_standard_chars
A string specifying instructions for handling non-standard
nucleotide or amino acid characters. Options include 'remove',
'ignore' or throw an 'error'. Defaults to 'error'.
alphabet
A string specifying the query and database alphabet:
'nucleotide' or 'protein'. Defaults to 'nucleotide'.