TRC starts from bivariate Spearman correlations and obtains
a positive definite covariance matrix by back-transforming robust
univariate medians and mads of the eigenspace. TRC can cope
with missing values by a regression imputation using the a robust
regression on the best predictor and it takes sampling weights
into account.
TRC(
data,
weights,
overlap = 3,
mincor = 0,
robust.regression = "rank",
gamma = 0.5,
prob.quantile = 0.75,
alpha = 0.05,
md.type = "m",
monitor = FALSE
)TRC returns a list whose first component output is a
sublist with the following components:
sample.sizeNumber of observations
number.of.variablesNumber of variables
number.of.missing.itemsNumber of missing values
significance.level1 - alpha
computation.timeElapsed computation time
mediansComponentwise medians
madsComponentwise mads
centerLocation estimate
scatterCovariance estimate
robust.regressionInput parameter
md.typeInput parameter
cutpointThe default threshold MD-value for the cut-off of outliers
The further components returned by TRC are:
outindIndicator of outliers
distMahalanobis distances (with missing values)
a data frame or matrix with the data.
sampling weights.
minimum number of jointly observed values for calculating the rank correlation.
minimal absolute correlation to impute.
type of regression: "irls" is iteratively
reweighted least squares M-estimator, "rank" is based on the rank
correlations.
minimal number of jointly observed values to impute.
if mads are 0, try this quantile of absolute deviations.
(1 - alpha) Quantile of F-distribution is used for cut-off.
type of Mahalanobis distance when missing values occur:
"m" marginal (default), "c" conditional.
if TRUE, verbose output.
Beat Hulliger
TRC is similar to a one-step OGK estimator where the starting
covariances are obtained from rank correlations and an ad hoc missing
value imputation plus weighting is provided.
Béguin, C. and Hulliger, B. (2004) Multivariate outlier detection in incomplete survey data: the epidemic algorithm and transformed rank correlations, JRSS-A, 167, Part 2, pp. 275-294.
data(bushfirem, bushfire.weights)
det.res <- TRC(bushfirem, weights = bushfire.weights)
PlotMD(det.res$dist, ncol(bushfirem))
print(det.res)
Run the code above in your browser using DataLab