A dataset containing BLASTP alignment results for proteins from test genomes of bacteria, including alignment metrics.
blastp_dfA data frame with multiple rows and 12 variables:
Character. Protein query identifier.
Character. Subject protein identifier from reference databases.
Numeric. Percentage identity between query and subject sequences.
Numeric. Length of the aligned sequence region.
Numeric. Number of mismatches in the alignment.
Numeric. Number of gap openings in the alignment.
Numeric. Start position in query sequence.
Numeric. End position in query sequence.
Numeric. Start position in subject sequence.
Numeric. End position in subject sequence.
Numeric. Expect value for the alignment.
Numeric. Bit score for the alignment.