protein

This dataset contains protein sequences and their corresponding secondary structures, including beta-sheets (E), helices (H), and coils (_).

datasets

The Truncated Factor Model is a statistical model designed to handle specific data structures in data analysis. This R package focuses on the Sparse Online Principal Component Estimation method, which is used to calculate data such as the loading matrix and specific variance matrix for truncated data, thereby better explaining the relationship between common factors and original variables. Additionally, the R package also provides other equations for comparison with the Sparse Online Principal Component Estimation method.The philosophy of the package is described in thesis. (2023) <doi:10.1007/s00180-022-01270-z>.

Guangbao Guo

Sparse Online Principal Component for Truncated Factor Model

Beibei Wu

protein function

A data frame with multiple rows and 2 columns representing protein sequences and their secondary structures.<dl>
<dt>V1</dt>
<dd>Amino acid sequence (using 3-letter codes).</dd><dt>V2</dt>
<dd>Secondary structure of the protein (E for beta-sheet, H for helix, _ for coil).</dd>
</dl>

Format

Protein Secondary Structure Data — protein

A data frame with multiple rows and 2 columns representing protein sequences and their secondary structures.<dl>
<dt>V1</dt>
<dd>Amino acid sequence (using 3-letter codes).</dd>

<dt>V2</dt>
<dd>Secondary structure of the protein (E for beta-sheet, H for helix, _ for coil).</dd>


</dl>

protein: Protein Secondary Structure Data

Description

Usage

Arguments

Format

Details

Examples