ChipDataSet
, which
is a container for holding processed sequencing data and the results of
all downstream analyses. All the slots of the created object are filled
during the workflow by applying specific functions to the object directly.constructCDS(peaks, reads, region, TxDb, tssOf = c("gene", "transcript"),
tss.region = c(-2000, 2000), reduce.peaks = FALSE, gapwidth = 1000,
fragment.size, unique = TRUE, swap.strand = FALSE, param = NULL)
GRanges
. Genomic region(s) to
extract reads from. If not supplied, all the reads from a BAM file are
extracted.TxDb
object.Character
. Extract Transcription Start Site (TSS)
regions from either "gene" or "transcript" annotations. Default: "gene".Logical
. Whether to merge neighboring peaks.
Default: FALSE.Numeric
. A minimum distance (in bp) between peaks to
merge. Default: 1000.Numeric
. Extend read length to the fragment size.Logical
. Whether to remove duplicated reads (based on
the genomic coordinates). Default: FALSE.Logical
. Whether to reverse the strand of the read.
Default: FALSE.ScanBamParam
object influencing what
fields and which records (reads) are imported from the Bam file.
Default: NULL.ChipDataSet
.constructCDS
initializes a
ChipDataSet
object, by providing the paths to the input
files and information relevant to the ChIP-seq library preparation
procedure. During the object construction the following steps are
executed:
GRanges
class.length
- the length of a peak (in base pairs).fragments
- total number of fragments overlapping
a peak region.density
- number of fragments per base pair of
the peak length.pileup
- highest fragment pileup in each peak
region.tssOverlap
- overlap (binary, yes/no) of the
peak with the annotated TSS region.As many peak-calling algorithms tend to divide broader peaks into the several narrower closely spaced peaks, it is advised to merge these end-to-end peaks to decrease the number of false positives and prevent unnecessary truncation of transcripts in the downstream analysis.
ChipDataSet
predictTssOverlap
### Load ChipDataSet object
data(cds)
### View a short summary of the object
cds
Run the code above in your browser using DataLab