getTCGA(disease = "GBM", data.type = "RNASeq2", type = "", filter = "Y",
p = getOption("mc.cores", 2L), clinical = FALSE, cvars = "OS")GBM" for glioblastoma multiforme.RNASeq2" for gene level RNA-Seq data from the second pipeline (RNASeqV2).FALSE.OS" for overall survival.NULL if clinical=FALSEdat and clinical data as specified by cvars. Thus, each matrix of size sample x (cvars + gene); NULL if clinical=FALSE or cvars is not a valid name for clinical covariate.dat.disease include "ACC", "BLCA", "BRCA", "CESC", "CHOL", "COAD", "COADREAD", "DLBC", "ESCA", "FPPP", "GBM", "GBMLGG", "HNSC", "KICH", "KIPAN", "KIRC", "KIRP", "LAML", "LGG", "LIHC", "LUAD", "LUSC", "MESO", "OV", "PAAD", "PCPG", "PRAD", "READ", "SARC", "SKCM", "STAD", "TGCT", "THCA", "THYM", "UCEC", "UCS", and "UVM". Values for data.type include "RNASeq2", "RNASeq", "miRNASeq", "CNA_SNP", "CNV_SNP", "CNA_CGH", "Methylation", "Mutation", "mRNA_Array", and "miRNA_Array". Note that not all combinations are permitted; Appendix A of the package vignette outlines all values of disease and data.type accommodated by TCGA2STAT.The type parameter should only be used along with these data.type parameters:
RNASeq - "count" for raw read counts (default); "RPKM" for normalized read counts (reads per kilobase per million mapped reads).
miRNASeq - "count" for raw read counts (default); "rpmmm" for normalized read counts.
Mutation - "somatic" for non-silent somatic mutations (default); "all" for all mutations.
Methylation - "27K" platform (default); "450K" platform.
CNA_CGH - "415K" for CGH Custom Microarray 2x415K (default); "244A" for CGH Microarray.
mRNA_Array - "G450" for Agilent 244K Custom Gene Expression G4502A (default); "U133" for Affymetrix Human Genome U133A 2.0 Array; "Huex" for Affymetrix Human Exon 1.0 ST Array.
The Level III RNA-Seq, miRNA-Seq, mRNA-array, and miRNA-array data imported are at gene level, but not the mutation, copy number alterations/variation (CNA/CNV), and methylation data. Our package processes and aggregates the mutation and CNA/CNV data at the gene level. The mutation data imported are in MAF files, where each file contains mutations found for the particular patient, and the number of mutations differs across patients. We filter the mutation data based on status and variant classification and then aggregate the filtered data at the gene level. The Level III CNA/CNV data imported are in segments; therefore we employ the CNTools package to merge the segmented data into gene-level data. The methylation data imported is at probe level where each probe represents a CpG site. As methylation profiles at different CpG sites within the same gene could vary a lot, it would not be biological meaningful to aggregate the probe-level methylation data into gene-level data. We return the methylation data at probe level.
library(TCGA2STAT)
rsem.ov <- getTCGA(disease="OV", data.type="RNASeq2")
rnaseq.ov <- getTCGA(disease="OV", data.type="RNASeq", type="RPKM")
rnaseq_os.ov <- getTCGA(disease="OV", data.type="RNASeq", type="RPKM", clinical=TRUE)
Run the code above in your browser using DataLab