Density.Feature

Sequence dataset to be encoded, must be an object of class <code><a rd-options="" href="/link/DNAStringSet?package=EncDNA&version=1.0.2" data-mini-rdoc="EncDNA::DNAStringSet">DNAStringSet</a></code>.

test_seq

Each nucleotide sequence is encoded into a numeric vector of same length based on the distribution of nucleotides over the sequence. Here, two classes of dataset are not required for encoding, and each sequence is independently encoded instead. This encoding seheme was introduced by Wei et al. (2013) for prediction of donor and acceptor human splice sites along with the <code>MM1.Feature</code>.

Splice sites

Sequence encoding

We describe fifteen different splice site sequence encoding schemes that have been used in earlier studies for mapping of splice site sequences into numeric feature vectors. These encoding schemes will also be helpful for transforming other nucleotide sequences into numeric forms, provided they are of equal length. These encoding schemes will help the computational biologist working in the field of classification (binary or multiclass) or prediction involving nucleic acid sequences of equal length.

Prabina Meher

EncDNA

Encoding of Nucleotide Sequences into Numeric Feature Vectors

Density.Feature function

Sequence dataset to be encoded, must be an object of class <code><a rd-options='' href='DNAStringSet'>DNAStringSet</a></code>.

Density.Feature: Nucleotide sequence encoding with the distribution of trinucleotides.

Description

Usage

Arguments

Value

Details

References

Examples