powered by
This dataset contains protein sequences and their corresponding secondary structures, including beta-sheets (E), helices (H), and coils (_).
protein
A data frame with multiple rows and columns representing protein sequences and their secondary structures.
Sequence: Amino acid sequence (using 3-letter codes).
Sequence
Structure: Secondary structure of the protein (E for beta-sheet, H for helix, _ for coil).
Structure
Parameters: Additional parameters for neural networks (to be ignored).
Parameters
Biophysical_Constants: Biophysical constants (to be ignored).
Biophysical_Constants
# Load the dataset data(protein) # Print the first few rows of the dataset print(head(protein))
Run the code above in your browser using DataLab