Learn R Programming

LUCIDus (version 2.1.0)

sim2: simulated dataset 2

Description

A simulated dataset for integrated clustering with binary outcome. The data is simulated under cluster number K = 2.

Usage

sim2

Arguments

Format

A matrix of 22 columns, which are

G1 - G10

Genetic features, G1 to G5 are causal genes contributed to clustering, with OR = 2; G6 to G10 are null genes that is not related to clustering

Z1 - Z10

Biomarkers, Z1 to Z5 are causal biomarkers with delta Z = 4 between 2 clusters, Z6 to Z10 are noises with delta Z = 0. All biomarkers are assumed to be independent with each other

Y

Outcome of interest, the odds ratio of the cluster is 2

X

Latent cluster assignment for each observation