This data set contains information of Escherichia coli. It is a bacterium of the genus Escherichia that is commonly found in the lower intestine of warm-blooded organism.
A data frame with 336 rows, 8 variables and the class.
Accession number for the SWISS-PROT database.
McGeoch's method for signal sequence recognition.
Von Heijne's method for signal sequence recognition.
Von Heijne's Signal Peptidase II consensus sequence score. Binary attribute.
Presence of charge on N-terminus of predicted lipoproteins. Binary attribute.
Score of discriminant analysis of the amino acid content of outer membrane and periplasmic proteins.
Score of the ALOM membrane spanning region prediction program.
Score of ALOM program after excluding putative cleavable signal regions from the sequence.
Class variable. 8 possibles states.