A data set containing the GC content and other information about the virus segments from the official Virtool virus data base (version 1.4.0). The variables are as follows:
nucleotide_infoA data frame with 7 variables:
The virus name
The virus isolate ID
The virus segment ID
The percentage of A nucleotides in the virus segment
The percentage of C nucleotides in the virus segment
The percentage of T nucleotides in the virus segment
The percentage of G and C nucleotides in the virus segment (GC content)
The length of the virus segment
The version number of the virus database
The version number of the virus database