Learn R Programming

hiReadsProcessor (version 1.8.2)

Functions to process LM-PCR reads from 454/Illumina data

Description

hiReadsProcessor contains set of functions which allow users to process LM-PCR products sequenced using any platform. Given an excel/txt file containing parameters for demultiplexing and sample metadata, the functions automate trimming of adaptors and identification of the genomic product. Genomic products are further processed for QC and abundance quantification.

Copy Link

Version

Version

1.8.2

License

GPL-3

Maintainer

Nirav V Malani

Last Published

February 15th, 2017

Functions in hiReadsProcessor (1.8.2)

addFeature

Add a specific feature/attribute to the sampleInfo object.
extractFeature

Extract a specific feature/attribute of the sampleInfo object.
getIntegrationSites

Obtain integration sites from BLAT output
read.psl

Read PSL file(s) outputted by BLAT
isuSites

Bin values or make ISUs by assigning a unique ID to them within discrete factors.
findLinkers

Find the 3' linkers and add results to SampleInfo object.
addListNameToReads

Prepend name attribute of a list to DNAStringSet
crossOverCheck

Remove values/positions which are overlapping between discrete groups based on their frequency.
findIntegrations

Find the integration sites and add results to SampleInfo object.
pslCols

Return PSL file columns with classes
primerIDAlignSeqs

Align a short pattern with PrimerID to variable length target sequences.
findPrimers

Find the 5' primers and add results to SampleInfo object.
findVector

Find vector DNA in reads and add results to SampleInfo object.
extractSeqs

Extract sequences for a feature in the sampleInfo object.
otuSites

Bin values or make OTUs by assigning a unique ID to them within discrete factors.
doRCtest

Test if pattern aligns better in +/- orientation.
getSonicAbund

Calculate breakpoint/sonic abundance of integration sites in a population
removeReadsWithNs

Remove sequences with ambiguous nucleotides.
write.psl

Write PSL file from dataframe or GRanges
read.seqsFromSector

Read fasta/fastq/sff given the path or sampleInfo object.
chunkize

Breaks an object into chunks of N size.
read.BAMasPSL

Reads a BAM/SAM file and converts it into a PSL like format.
seqProps

Sample Integration Sites Sequencing Data
splitByBarcode

Split DNAStringSet object using first X number of bases defined by a vector.
vpairwiseAlignSeqs

Align a short pattern to variable length target sequences.
blatListedSet

Align a listed DNAStringSet to a reference using gfClient or standalone BLAT.
startgfServer

Start/Stop a gfServer instance
write.listedDNAStringSet

Write a fasta file per sample in parallel
getSectorsForSamples

Get sectors for samples defined in the sampleInfo object.
sampleSummary

Simple summary of a sampleInfo object.
findAndTrimSeq

Find and trim a short pattern sequence from the subject.
findBarcodes

Demultiplex reads by their barcodes
hiReadsProcessor

Functions to process LM-PCR reads from 454/Illumina data
pslToRangedObject

Convert psl dataframe to GRanges
pairwiseAlignSeqs

Align a short pattern to variable length target sequences.
read.SeqFolder

Read contents of a sequencing folder and make a SimpleList object
replicateReads

Replicate sequences from DNAStringSet object using counts identifier or vector
read.blast8

Read blast8 file(s) outputted by BLAT
findAndRemoveVector

Find and trim vector sequence from reads.
pairUpAlignments

Pair up alignments in a GRanges object
dereplicateReads

Removes duplicate sequences from DNAStringSet object.
read.sampleInfo

Read a sample information file and format appropriate metadata.
clusterSites

Cluster/Correct values within a window based on their frequency given discrete factors
psl

PSL file output
findLTRs

Find the 5' LTRs and add results to SampleInfo object.
blatSeqs

Align sequences using BLAT.
splitSeqsToFiles

Split DNA sequences into smaller files.
trimSeqs

Trim sequences from a specific side.
troubleshootLinkers

Compare LTRed/Primed sequences to all linkers.
annotateSites

Find the 5' primers and add results to SampleInfo object.