Computes the parametric version of Rao's index of quadratic entropy (Q) on different classes of numeric matrices using a moving window algorithm.
paRao(x, dist_m="euclidean", window=9, alpha=1,
method="classic", rasterOut=TRUE, lambda=0,
na.tolerance=1.0, rescale=FALSE, diag=TRUE,
simplify=2, np=1,cluster.type="SOCK", debugging=FALSE)
input data may be a matrix, a Spatial Grid Data Frame, a RasterLayer or a list of these objects. In the latter case, if method="classic"
only the first element of the list will be considered.
define the type of distance to be calculated between numerical categories. dist_m
can be a character string which defines the name of the distance to derive such as "euclidean". The distance names allowed are the same as for proxy::dist
. Alternatively, dist_m
can be a function which calculates an user defined distance, (i.e., function(x,y) {return(cos(y-x)-sin(y-x))}
) or a matrix of distances. If method="multidimension"
then only "euclidean", "manhattan", "canberra", "minkowski" and "mahalanobis" can be used. Default value is "euclidean".
the side of the square moving window, it must be a odd numeric value greater than 1 to ensure that the target pixel is in the center of the moving window. Default value is 3. If proxy::dist
is a matrix then the function will assume that this is the distance matrix, and therefore no distance will be derived.
weight for the distance matrix. If alpha = 0
, distances will be averaged with an geometric mean, if alpha=1
with an arithmetic mean, if alpha = 2
with a quadratic mean, alpha = 3
with a cubic mean and so on and so forth. if alpha tends to infinite (i.e., higher than the maximum integer allowed in R) then the maximum of the distances will be taken. "alpha" can be a vector with length greater than 1, in this case Rao's index will be calculated over x for each value in the sequence.
Currently, there are two ways to calculate the parameteric version of Rao's index. If method="classic"
, then the normal parameteric Rao's index will be calculated on a single matrix. If method="multidimension"
(experimental!) a list of matrices must be provided as input. In the latter case, the overall distance matrix will be calculated in a multi- or hyper-dimensional system by using the distance measure defined through the function argument dist_m
. Each pairwise distance is then multiplied by the inverse of the squared number of pixels in the considered moving window, and the Rao's Q is finally derived by applying a summation. Default value is "classic"
Boolean, if TRUE the output will be a RasterLayer object with x as a raster template.
the value of the lambda of Minkowski's distance. Considered only if dist_m = "minkowski"
and method="multidimension". Default value is 0.
Numeric value \((0.0-1.0)\) which indicates the proportion of NA values that will be tolerated to calculate Rao's index in each moving window over x. If the relative proportion of NA's in a moving window is bigger than na.tolerance, then the value of the window will be set as NA, otherwise Rao's index will be calculated considering the non-NA values. Default values is 0.0 (i.e., no tolerance for NA's). Defaut value is 1.0.
Boolean. Considered only if method="multidimension"
. If TRUE, each element of x
is rescaled and centered.
Boolean. If TRUE then the diagonal of the distance matrix is filled with 0's, otherwise with NA's. If diag=TRUE
and alpha=0
, the output matrix will inexorably be composed of 0's.
Number of decimal places to be retained to calculate distances in Rao's index. Only if x is floats.
the number of processes (cores) which will be spawned. Default value is 2.
the type of cluster which will be created. The options are "MPI" (which calls "makeMPIcluster"), "FORK" and "SOCK" (which call "makeCluster"). Default type is "SOCK".
a boolean variable set to FALSE by default. If TRUE, additional messages will be printed. For debugging only.
A list of matrices of dimension dim(x)
with length equal to the length of alpha. If rasterOut=TRUE
, then the output is a list of RasterLayer objects.
The parametric Rao's Index (\(Q\)) is an extenstion of Rao's Index which considers a generalised mean between distances. The generalised formula for the parametric Rao's index is \(Q_a = (\sum_{i,j=1}^{N}{1/N^2}\times{d^{\alpha}_{j,j}})^{1/{\alpha}}\). Where N is the number of numerical categories, i and j are pair of numerical categories in the same moving window and alpha is a weight given to distances. In the "multidimension" Rao's index, first distances among categories are calculated considering more than one layer, then the pairwise distance between each pair of numerical categories is multiplied to the square of the size of the moving window (this is somewhat the same as to calculate the variance of the multidimensional distance [1].). The theoretical minimum of Rao's Q is 0, when all categories in a window have distance 0. If the distance chosen to calculate Rao's Index ranges between 0 and 1, the maximum value of Rao's Index equals the Simpson Index of Diversity \(1-1/S_i\) where S is the number of categories in window i (given alpha=1).
[1] Rocchini, D., M. Marcantonio, and C. Ricotta (2017). Measuring Rao<U+2019>s Q diversity index from remote sensing: An open source solution. Ecological Indicators. 72: 234<U+2013>238.
# NOT RUN {
#Minimal example; compute classic Rao's index
a <- matrix(c(10,10,10,20,20,20,20,30,30),ncol=3,nrow=3)
out <- paRao(x=a,window=3,dist_m="euclidean",na.tolerance=0,alpha=1)
#Compute Rao's index with a vector of alpha values
out <- paRao(x=a,window=3,dist_m="euclidean",na.tolerance=0,alpha=1:5)
#Compute multidimensional Rao's index with alpha = 1
a <- matrix(c(10,10,10,20,20,20,20,30,30),ncol=3,nrow=3)
b <- matrix(c(100,100,100,200,200,200,200,300,300),ncol=3,nrow=3)
out <- paRao(x=list(a,b),window=3,dist_m="euclidean",
na.tolerance=0,alpha=1,method="multidimensional")
# }
Run the code above in your browser using DataLab