Trint.Dist.Feature

Sequence dataset to be transformed into numeric feature vectors. There should be atleat two sequences, must be an object of class <code><a rd-options="" href="/link/DNAStringSet?package=EncDNA&version=1.0.2" data-mini-rdoc="EncDNA::DNAStringSet">DNAStringSet</a></code>.

test_seq

This encoding scheme was first time adopted by Wei et al. (2013) for prediction of splice sites along with MM1 features. In this encoding technique, distribution of trinucleotides are taken into consideration independently for the exon and intron regions of splice site motifs.

 Tri-nucleotide frequency 

We describe fifteen different splice site sequence encoding schemes that have been used in earlier studies for mapping of splice site sequences into numeric feature vectors. These encoding schemes will also be helpful for transforming other nucleotide sequences into numeric forms, provided they are of equal length. These encoding schemes will help the computational biologist working in the field of classification (binary or multiclass) or prediction involving nucleic acid sequences of equal length.

Prabina Meher

EncDNA

Encoding of Nucleotide Sequences into Numeric Feature Vectors

Trint.Dist.Feature function

Sequence dataset to be transformed into numeric feature vectors. There should be atleat two sequences, must be an object of class <code><a rd-options='' href='DNAStringSet'>DNAStringSet</a></code>.

Trint.Dist.Feature: Tri-nucleotide distribution-based encoding of nucleotide sequences.

Description

Usage

Arguments

Value

Details

References

Examples