Collection of real sub-datasets used in the examples of the HEMDAG package
data(graph)
data(labels)
data(scores)
data(wadj)
data(test.index)The DAG g contained in graph data is an object of class graphNEL. The graph g has 23 nodes and 30 edges and
represents the "ancestors view" of the HPO term Camptodactyly of finger ("HP:0100490").
The matrix L contained in the labels data is a 100 X 23 matrix, whose rows correspondes to genes
(Entrez GeneID) and columns to HPO classes.
\(L[i,j]=1\) means that the gene \(i\) belong to class \(j\), \(L[i,j]=0\) means that the gene \(i\) does not belong to class \(j\).
The classes of the matrix L correspond to the nodes of the graph g.
The matrix S contained in the scores data is a named 100 X 23 flat scores matrix, representing the likelihood
that a given gene belongs to a given class: higher the value higher the likelihood. The classes of the matrix \(S\) correspond
to the nodes of the graph g.
The matrix W contained in the wadj data is a named 100 X 100 symmetric weighted adjacency matrix, whose rows and
columns correspond to genes.The genes names (Entrez GeneID) of the adjacency matrix W correspond to the genes names of the
flat scores matrix S and to genes names of the target multilabel matrix L.
The vector of integer numbers test.index contained in the test.index data refers to the index of the examples of the scores
matrix S to be used in the test set. It is useful only in holdout experiments.