Learn R Programming

contiBAIT (version 1.0.0)

preprocessStrandTable,StrandFreqMatrix-method: preprocessStrandTable -- remove low quality libraries and contigs before attempting to build a genome

Description

preprocessStrandTable -- remove low quality libraries and contigs before attempting to build a genome

Usage

## S3 method for class 'StrandFreqMatrix':
preprocessStrandTable(strandTable,
  strandTableThreshold = 0.8, filterThreshold = 0.8,
  orderMethod = "libsAndConc", lowQualThreshold = 0.9, verbose = TRUE,
  minLib = 10, ignoreInternalQual = FALSE)

Arguments

strandTable
data.frame containing the strand table to use as input
strandTableThreshold
threshold at which to call a contig WW or CC rather than WC
filterThreshold
maximum number of libraries a contig can be NA or WC in
orderMethod
the method to oder contigs. currently libsAndConc only option. Set to FALSE to not order contigs based on library quality
lowQualThreshold
background threshold at which to toss an entire library
verbose
messages written to terminal
minLib
minimum number of libraries a contig must be present in to be included in the output
ignoreInternalQual
logical that prevents function for making an overall assessment of library quality. Very chimeric assemblies can appear low quality across all libraries.

Value

  • A list of one matrix and three quality data.frames -- 1: a matrix of WW/WC/WW calls for all contigs; 3: the quality of libraries used (based on frequencies outside expected ranges); 4: A data.frame of libraries that are of low quality and therefore excluded from analysis; 5: contigs that are present as WC in more libraries than expected. These are excluded from the strandStateMatrix, but are potentially worth investigating for chimerism.

Examples

Run this code
data("exampleStrandFreq")

strandStates <- preprocessStrandTable(exampleStrandFreq, lowQualThreshold=0.8)

show(strandStates[[1]]) # WW-WC-CC matrix

Run the code above in your browser using DataLab