Performs global pairwise alignment between query and target sequences using
VSEARCH, and reports matches based on the specified pairwise identity
threshold (id). Only alignments that meet or exceed the identity
threshold are included in the output.
fastx_input and database can either be file paths to a
FASTA/FASTQ files or FASTA/FASTQ objects. FASTA objects are tibbles that
contain the columns Header and Sequence, see
readFasta. FASTQ objects are tibbles that contain the
columns Header, Sequence, and Quality, see
readFastq.
userfields specifies the fields to include in the output file. Fields
must be given as a character string separated by "+". The default
value of userfields equals
"query+target+id+alnlen+mism+opens+qlo+qhi+tlo+thi+evalue+bits", which
gives a blast-like tab-separated format of twelve fields. See the
'Userfields' section in the VSEARCH manual for more information.
otutabout gives the option to output the results in an OTU
table format with tab-separated columns. When writing to a file, the first
line starts with the string "#OTU ID", followed by a tab-separated list of
all sample identifiers (formatted as "sample=X"). Each subsequent line,
corresponding to an OTU, begins with the OTU identifier and is followed by
tab-separated abundances for that OTU in each sample. If otutabout is
a character string, the output is written to the specified file. If
otutabout is TRUE, the function returns the OTU table as a
tibble, where the first column is named otu_id instead of "#OTU ID".
Pairwise identity (id) is calculated as the number of matching columns
divided by the alignment length minus terminal gaps.
vsearch_options allows users to pass additional command-line arguments
to VSEARCH that are not directly supported by this function. Refer to
the VSEARCH manual for more details.
Visit the VSEARCH
documentation
for information about defining gapopen and gapext.