runJunctionSeqAnalyses
function, and thus for most purposes users should not need to call this
function directly. It may be useful to advanced users performing non-standard
analyses.
estimateEffectSizes(jscs, method.expressionEstimation = c("feature-vs-gene", "feature-vs-otherFeatures"), effect.formula = formula(~ condition + countbin + condition : countbin), geneLevel.formula = formula(~ condition), calculate.geneLevel.expression = TRUE, keep.estimation.fit = FALSE, nCores=1, dispColumn="dispersion", verbose = TRUE)
JunctionSeqCountSet
. Usually initially created by
readJunctionSeqCounts
. Size factors must be
set, usually using functions estimateSizeFactors
and
estimateJunctionSeqDispersions
.
Determines the methodology used to generate feature expression estimates and relative fold changes. By default each feature is modeled separately. Under the default count-vector method, this means that the resultant relative fold changes will be a measure of the relative fold change between the feature and the gene as a whole. Alternatively, the "feature-vs-otherFeatures" method builds a large, complex model containing all features belonging to the gene. The coefficients for each feature are then "balanced" using linear contrasts weighted by the inverse of their variance. In general we have found this method to produce very similar results but less efficiently and less consistently. Additionally, this alternative method "multi-counts" reads that cover more than one feature. This can result in over-weighting of exonic regions with a large number of annotated variations in a small genomic area, as each individual read or read-pair may be counted many times in the model. Under the default option, no read or read-pair is ever counted more than once in a given model.
TRUE
, gene-level expression will be estimated using the same maximum-likelihood method used in other analyses. Default: TRUE
.
TRUE
, save the complete model fits for every gene. This will require a lot of memory, but may be useful for statistical diagnostics. Default: FALSE
.
fData(jscs)
column in which the model dispersion is stored.
data(exampleDataSet,package="JctSeqData"); jscs <- estimateEffectSizes(jscs); ## Not run: # #Full example (from scratch): # # ######################################## # #Set up example data: # decoder.file <- system.file( # "extdata/annoFiles/decoder.bySample.txt", # package="JctSeqData"); # decoder <- read.table(decoder.file, # header=TRUE, # stringsAsFactors=FALSE); # gff.file <- system.file( # "extdata/cts/withNovel.forJunctionSeq.gff.gz", # package="JctSeqData"); # countFiles <- system.file(paste0("extdata/cts/", # decoder$sample.ID, # "/QC.spliceJunctionAndExonCounts.withNovel.forJunctionSeq.txt.gz"), # package="JctSeqData"); # ######################################## # #Advanced Analysis: # # #Make a "design" dataframe: # design <- data.frame(condition = factor(decoder$group.ID)); # #Read the QoRTs counts. # jscs = readJunctionSeqCounts(countfiles = countFiles, # samplenames = decoder$sample.ID, # design = design, # flat.gff.file = gff.file # ); # #Generate the size factors and load them into the JunctionSeqCountSet: # jscs <- estimateJunctionSeqSizeFactors(jscs); # #Estimate feature-specific dispersions: # jscs <- estimateJunctionSeqDispersions(jscs); # #Fit dispersion function and estimate MAP dispersion: # jscs <- fitJunctionSeqDispersionFunction(jscs); # #Test for differential usage: # jscs <- testForDiffUsage(jscs); # #Estimate effect sizes and expression estimates: # jscs <- estimateEffectSizes( jscs); # # ## End(Not run)
Run the code above in your browser using DataCamp Workspace