Learn R Programming

OnomasticDiversity (version 0.1)

fHill: Calculate the Hill's diversity numbers

Description

This function obtains the Hill's diversity numbers introduced by M. O. Hill. It is a method for quantifying species biodiversity that can be adapted to the context of onomastic.

Usage

fHill(x, k, n, location, lambda)

Value

A dataframe containing the following components:

location

represents the grouping element, for example the communities / regions.

hill

the value of the Hill's diversity index.

Arguments

x

dataframe of the data values for each species.

k

name of a variable which represents absolute frequency for each species.

n

name of a variable which represents total number of individuals.

location

represents the grouping element.

lambda

free parameter.

Author

Maria Jose Ginzo Villamayor

Details

For a community \(i\), the Hill's diversity numbers are defined by the expression \(J(\lambda) = \left(\sum \limits_{k\in S_i} p_{ki}^\lambda\right)^{\frac{1}{1-\lambda}}\) with the restriction \(\lambda \geq 0\) where \(p_{ki}\) represents the relative frequency of species \(k\) and \(S_i\) are all species at the community, species richness, and \(\lambda\) is a free parameter. (This is equivalent to the exponential of Renyi's generalised entropy). The Renyi entropy of order \(\lambda\), where \(\lambda \geq 0\) and \(\lambda \neq 1\), is defined as \(\mathrm{H}_{\lambda}(X)=\frac{1}{1-\lambda} \log \left(\sum \limits_{i=1}^{n} p_{i}^{\lambda}\right)\) Here, \(X\) is a discrete random variable with possible outcomes in the set \(\mathcal{A}=\left\{x_{1}, x_{2}, \ldots, x_{n}\right\}\) and corresponding probabilities \(p_{i} \doteq \operatorname{Pr}\left(X=x_{i}\right)\) for \(i=1, \ldots, n\). The logarithm is conventionally taken to be base 2, especially in the context of information theory where bits are used. If the probabilities are \(p_{i}=1 / n\) for all \(i=1, \ldots, n\), then all the Renyi entropies of the distribution are equal: \(\mathrm{H}_{\lambda}(X)=\log n\). In general, for all discrete random variables \(X, \mathrm{H}_{\lambda}(X)\) is a non-increasing function in \(\lambda\)..

Particular cases of \(\lambda\) values: \(\lambda = 0, J(0)=S_i\), it corresponds species richness; \(\lambda = 1, J(1)=e^{H_{t}}\), it corresponds the exponential of Shannon's entropy; and \(\lambda = 2, J(2)= D_{S_i}\), it corresponds the 'inverse' Simpson index.

In onomastic context, \(p_{ki}\) denotes the relative frequency of surname \(k\) in region (\(\approx\) community diversity context) \(i\) and \(S_i\) are all surnames in region \(i\).

References

Hill, M. O. (1973). Diversity and Evenness: a unifying notation and its consequences. Ecology, 54, 427--32.

See Also

fCressieRead.

Examples

Run this code
data(surnamesgal14)
result = fHill (x= surnamesgal14, k="number", n="population",
location  = "muni", lambda= 0)
result

data(namesmengal16)
result = fHill (x= namesmengal16, k="number", n="population",
location  = "muni", lambda= 0)
result

data(nameswomengal16)
result = fHill (x= nameswomengal16, k="number", n="population",
location  = "muni", lambda= 0)
result

Run the code above in your browser using DataLab