Learn R Programming

DiffLogo

What is DiffLogo

The DiffLogo tool is a R package for the visualization of differences between multiple motifs for different alphabets. The user supplies a set of motifs each represented as position weight matrices (PWMs) [1]. The DiffLogo package supports the comparison of two motifs by a single DiffLogo and the comparison of multiple motifs by a DiffLogo-table. DiffLogo is based on the idea behind the well-known sequence logo [2], i.e. motifs are visualized position-wise based on two functions. First, the stackHeight function computes the height of each stack. Second, the baseDistribution function breaks down the stack height on the individual characters. The user is able to parametrise the individual functions with arbitrary functions stackHeight and baseDistribution. Default implementations of both functions are provided.

Who should use DiffLogo

DiffLogo is designed for researchers with any biological background and interest in computational biology. It is useful to document findings, share knowledge, and to present the outcome of motif prediction pipelines. It aims at an increase of the quality of comparative publications. In addition, DiffLogo eases the communication between bioinformatics scientists and biologists.

When should DiffLogo be used

DiffLogo is intended for the comparison of similar motifs. These motifs can come from different sources such as

  • different treatments, species, cell lines, or tissues
  • different motif prediction algorithms and configurations

Getting started

Download and install the R package DiffLogo available in the folder 'built'. Load your motifs of interest as PWMs [1] to R. Please find example motifs in the file 'exampleLogos.R' and in the folder 'inst/pwm' (extracted from [3]). Please find the vignette 'DiffLogoBasics.Rnw' of the DiffLogo package for example code.

Use the function diffLogoFromPwm(PWM1, PWM2) to visualize the difference between two motifs.

Use the function diffLogoTable(listOfPWMs) to visualize the pairwise difference between more than two motifs.

References

[1] http://en.wikipedia.org/wiki/Position_weight_matrix [2] Schneider TD, Stephens RM. 1990. Sequence Logos: A New Way to Display Consensus Sequences. Nucleic Acids Res. 18:6097-6100 [3] Eggeling, R., Gohr, A., Keilwagen, J., Mohr, M., Posch, S., Smith, A.D., Grosse, I.: On the value of intra-motifdependencies of human insulator protein ctcf. PLoS ONE 9(1), 85629 (2014). doi:10.1371/journal.pone.0085629

Copy Link

Version

Version

1.2.1

License

GPL (>= 2)

Issues

Pull Requests

Stars

Forks

Maintainer

Hendrik Treutler

Last Published

February 15th, 2017

Functions in DiffLogo (1.2.1)

ASN

ASN alphabet
sumOfAbsProbabilityDifferences

sum of absolute probability differences
shannonDivergence

shannon divergence
diffLogoTable

Draw DiffLogo-table
DNA

DNA alphabet
seqLogo

Draw sequence logo
diffLogoFromPwm

Draw DiffLogo from PWM
probabilities

probabilities
getPwmFromAlignment

Create PWM from alignment
RNA

RNA alphabet
lossOfAbsICDifferences

the change of information content
sumProbabilities

sum of probabilities, i.e. 1.0
diffLogo

Draw DiffLogo
createDiffLogoObject

DiffLogo object
sumOfAbsICDifferences

sum of absolute information content differences
differenceOfICs

normalized information content differences
Alphabet

built alphabet
normalizedDifferenceOfProbabilities

normalized probability differences
informationContent

information content