This function is used to identify the recitified list of transcription factors for individual target genes after analysing the variable importance scores (where non-essential Tfs are pruned). These list of Tfs are usually different for individual target genes. Hence we maintain this in the form an adjacency matrix where the rownames correspond to all the Tfs and colnames correspond to all the target genes. Each column is a binary vector where all the values corresponding to the rectified Tfs active for that target are 1 while rest of the values are zeros.
get_colids(A, ideal_k, tfs, targets, Ntfs, Ntargets)
Adjacency Matrix A obtained after the GBM and refinement step.
A vector containing the optimal value of k (no of active TFs) for each target gene obtained from select_ideal_k
.
List of names of transcription factors.
List of names of target genes.
Total number of transcription factors used in the experiment.
Total number of target genes used in the experiment.
The function returns an adjacency matrix where the rownames correspond to all the Tfs and colnames correspond to all the target genes. Each column is a binary vector where all the values corresponding to the rectified Tfs active for that target are 1 while rest of the values are zeros.