Package: |
CONDOP |
Type: |
Package |
Version: |
1.0 |
Date: |
2016-01-01 |
License: |
GPL-3 |
pre.proc() pre-process input data for the main function run.CONDOP. run.CONDOP() integrates both genomic and transcriptomic features to develop an operon pair classifier to be used to re.define the operon map for a given bacterial organism and experimental condition.
Input data consists of annotations and count tables.
Annotations: GFF-like, representing gene/feature annotations; DOOR-like, containing operon annotations downloaded from DOOR database; FASTA-like, containing the genome sequence of the target organism.
GFF-like files can be downloaded from the NCBI genomes ftp directory, ftp://ftp.ncbi.nih.gov/genomes. While, DOOR-like files can be downloaded from http://csbl.bmb.uga.edu/DOOR/displayspecies.php. Finally, FASTA-like files can be downloaded from www.ncbi.nlm.nih.gov. Count tables represent RNA-seq expression profiles. A count table must contain two columns: fwd (coverage depth on the forward strand) and rev (coverage depth on the reverse strand). While, each row indicates the number of reads (depth coverage) at each nucleotide position. Different count table should be related to different experimental conditions. The user must use the output of the pre.proc function as first input for the run.CONDOP.