qdap (version 2.4.3)

kullback_leibler: Kullback Leibler Statistic

Description

A proximity measure between two probability distributions applied to speech.

Usage

kullback_leibler(x, y = NULL)

Arguments

x

A numeric vector, matrix or data frame.

y

A second numeric vector if x is also a vector. Default is NULL.

Value

Returns a matrix of the Kullback Leibler measure between each vector of probabilities.

Details

Uses Kullback & Leibler's (1951) formula: $$D_{KL}(P||Q)=\sum_i{ln\left ( \frac{P_{i}}{Q_{i}} \right )}P_{i}$$

References

Kullback, S., & Leibler, R.A. (1951). On Information and sufficiency. Annals of Mathematical Statistics 22 (1): 79-86. doi:10.1214/aoms/1177729694

Examples

Run this code
# NOT RUN {
p.df <- wfdf(DATA$state, DATA$person)
p.mat <- wfm(text.var = DATA$state, grouping.var = DATA$person)
kullback_leibler(p.mat)
(x <- kullback_leibler(p.df))
print(x, digits = 5)
kullback_leibler(p.df$greg, p.df$sam)

## p.df2 <- wfdf(raj$dialogue, raj$person)
## x <- kullback_leibler(p.df2)
# }

Run the code above in your browser using DataLab