Categorical time series (CTS) of protein sequences from different species
data(ProteinSequences)A tsibble with four columns, which are:
ValueThe categorical values of the time series in the dataset.
SeriesInteger values indicating the considered time series (there are 40 time series in the dataset).
TimeInteger values indicating the temporal indexes of the observations.
ClassInteger values indicating the class of each time series.
The column Value is the concatenation of 40 time series
taking four categorical values (amino-acids). The column Class is formed
by integers from 1 to 4, indicating that there are 4 different classes in the database. Each class is associated with a different
family of viruses. For more information, see lopez2023hard;textualctsfeatures.
lopez2023hardctsfeatures