- dts
A matrix for univariate functional data (of size n
observations by p
domain
points) or a 3-dimensional array for multivariate functional data (of size n
observations by p
domain points by d
dimension). Only the outlyingness transformation ("O(X)(t)")
supports multivariate functional data so the sequence of transformation must always start with outlyingness ("O(X)(t)")
whenever a multivariate functional data is parsed to dts
.
- sequence
A character vector usually of length between 1 and 6 containing any of the strings: "T0", "D0", "T1", "T2",
"D1", "D2"
and "O"
(in any order). These sequence of strings specifies the sequence of transformations to be applied
on the data and their meanings are described as follows:
"T0"
and "D0"
Functional boxplot applied on raw data (no transformation is applied).
"T1"
Apply vertical alignment on data, i.e. subtract from each curve its expectation over the domain of evaluation.
"T2"
Apply normalization on data, i.e. divide each curve by its L-2 norm.
"D1"
and "D2"
Apply one order of differencing on data.
"O"
Find the pointwise outlyingness of data. For multivariate functional data, this transformation replaces the multivariate
functional data with a univariate functional data of pointwise outlyingness.
Examples of sequences of transformations include: "T0"
, c("T0", "T1", "D1")
, c("T0", "T1", "T2")
,
c("T0", "D1", "D2")
and c("T0", "T1", "T2", "D1", "D2")
. See Details for their meaning.
depth_method
A character value specifying depth/outlyingness method to use in the functional boxplot applied after each stage of transformation.
Note that the same depth/outlyingness method is used in the functional boxplot applied after each transformation in the sequence. The following methods
are currently supported:
- "mbd":
The modified band depth with bands defined by 2 functions.
Uses the algorithm of Sun et al. (2012).
"tvd"
The total variation depth of Huang and Sun (2019).
"extremal"
The extremal depth of Narisetty and Nair (2016).
"dirout"
Uses the robust distance of the mean and variation of directional outlyingness (dir_out
)
defined in Dai and Genton (2018). Since this method is a measure of outlyingness of a function the negative of the
computed robust distance is used in ordering the functions.
"linfinity"
The L-infinity depth defined in Long and Huang (2015) is used in ordering functions.
"bd"
Uses the band depth with bands defined by 2 functions according to the algorithm of Sun et al. (2012)
erld
Uses the extreme rank length depth defined in Myllymäki et al. (2017) and mentioned in Dai et al. (2020).
"dq"
Uses the directional quantile (DQ) defined in Myllymäki et al. (2017) and mentioned in Dai et al. (2020).
Since DQ is a measure of outlyingness, the negative of the DQ values is used in ordering the functions.
save_data
A logical. If TRUE, the intermediate transformed data are returned in a list.
emp_factor
The empirical factor for functional boxplot. Defaults to 1.5.
central_region
A value between 0 and 1 indicating the central region probability for functional_boxplot. Defaults to 0.5.
erld_type
If depth_method = "erld"
, the type of ordering to use in computing the extreme rank length depth (ERLD).
Can be one of "two_sided"
, "one_sided_left"
or "one_sided_right"
. A "two_sided"
ordering is used by
default if erld_type
is not specified and depth_method = "erld"
. The "one_sided_right"
ERLD is especially useful for
ordering functions of outlyingness (the output of the "O"
transformation) since it considers only large values as extreme.
See extreme_rank_length for details.
dq_quantiles
If depth_method = "dq"
, a numeric vector of length 2 specifying the probabilities
of upper and lower quantiles. Defaults to c(0.025, 0.975)
for the upper and lower 2.5% quantiles.
See directional_quantile for details.
n_projections
An integer indicating the number of random projections to use in computing the point-wise outlyingness if a 3-d array
is specified in dts
i.e. (multivariate functional data), and the transformation "O"
is part of the sequence of transformations
parsed to sequence
. Defaults to 200L.
seed
The random seed to set when generating the random directions in the computation of the point-wise outlyingness. Defaults to NULL.
in which case a seed is not set.