protein

This real data set consists of a dissimilarity matrix derived from the structural
comparison of 213 protein sequences. Each of these proteins is known to belong to one of four
classes of globins: hemoglobin-alpha (HA), hemoglobin-beta (HB), myoglobin (M) and
heterogeneous globins (G).

datasets

Various clustering algorithms that produce a credal partition,
i.e., a set of Dempster-Shafer mass functions representing the membership of objects
to clusters. The mass functions quantify the cluster-membership uncertainty of the
objects. The algorithms are: Evidential c-Means, Relational Evidential c-Means,
Constrained Evidential c-Means, Evidential Clustering, Constrained Evidential
Clustering, Evidential K-nearest-neighbor-based Clustering, Bootstrap Model-Based
Evidential Clustering, Belief Peak Evidential Clustering, Neural-Network-based
Evidential Clustering.

Thierry Denoeux

evclust

Evidential Clustering

protein function

A list with three elements:<dl>
<dt>D</dt>
<dd>The 213x213 dissimilarity matrix.</dd><dt>class</dt>
<dd>A 213-vector containing the class encoded a a factor with four levels:
"G", "HA", "HB", "M".</dd><dt>y</dt>
<dd>A 213-vector containing the class encoded by an integer between 1 and 4.</dd>
</dl>

Format

Protein dataset — protein

A list with three elements:<dl>
<dt>D</dt>
<dd>The 213x213 dissimilarity matrix.</dd>

<dt>class</dt>
<dd>A 213-vector containing the class encoded a a factor with four levels:
"G", "HA", "HB", "M".</dd>

<dt>y</dt>
<dd>A 213-vector containing the class encoded by an integer between 1 and 4.</dd>


</dl>

protein: Protein dataset

Description

Usage

Arguments

Format

References

Examples