Performs maximum trimmed likelihood estimation by an exact algorithm (full enumeratiom of all k-trimmed subsets)
fulltle(Idt, CovCase=1:4, SelCrit=c("BIC","AIC"), alpha=0.75,
use.correction=TRUE, getalpha="TwoStep",
rawMD2Dist=c("ChiSq","HardRockeAsF","HardRockeAdjF"),
MD2Dist=c("ChiSq","CerioliBetaF"),
eta=0.025,multiCmpCor=c("never","always","iterstep"),
outlin=c("MidPandLogR","MidP","LogR"), reweighted=TRUE,
otpType=c("OnlyEst","SetMD2andEst"), force=FALSE, …)
An IData object representing interval-valued entities.
Configuration of the variance-covariance matrix: a set of integers between 1 and 4.
The model selection criterion.
Numeric parameter controlling the size of the subsets over which the trimmed likelihood is maximized; roughly alpha*Idt@NIVar observations are used for computing the trimmed likelihood. Allowed values are between 0.5 and 1.
whether to use finite sample correction factors; defaults to TRUE
.
Argument specifying if the ‘alpha’ parameter (roughly the percentage of the sample used for computing the trimmed likelihood) should be estimated from the data, or if the value of the argument ‘alpha’ should be used instead. When set to “TwoStep”, ‘alpha’ is estimated by a two-step procedure with the value of argument ‘alpha’ specifying the size of the samples used in the first step. Otherwise, with the value of argument ‘alpha’ is used directly.
The assumed reference distribution of the raw MCD squared distances, which is used to find to cutoffs defining the observations kept in one-step reweighted MCD estimates. Alternatives are ‘ChiSq’, ‘HardRockeAsF’ and ‘HardRockeAdjF’, respectivelly for the usual Qui-squared, and the asymptotic and adjusted scaled F distributions proposed by Hardin and Rocke (2005).
The assumed reference distributions used to find cutoffs defining the observations assumed as outliers. Alternatives are “ChiSq” and “CerioliBetaF” respectivelly for the usual Qui-squared, and the Beta and F distributions proposed by Cerioli (2010).
Nominal size of the null hypothesis that a given observation is not an outlier. Defines the raw MCD Mahalanobis distances cutoff used to choose the observations kept in the reweightening step.
Whether a multicomparison correction of the nominal size (eta) for the outliers tests should be performed. Alternatives are: ‘never’ -- ignoring the multicomparisons and testing all entities at ‘eta’. ‘always’ -- testing all n entitites at 1.- (1.-‘eta’^(1/n); and ‘iterstep’ -- as sugested by Cerioli (2010), make an initial set of tests using the nominal size 1.- (1-‘eta’^(1/n), and if no outliers were detected stop. Otherwise, make a second step testing for outliers at ‘eta’.
The type of outliers to be consideres. “MidPandLogR” if outliers may be present in both MidPpoints and LogRanges, “MidP” if outliers are only present in MidPpoints, or “LogR” if outliers are only present in LogRanges.
should a (Re)weighted estimate of the covariance matrix be used in the computation of the trimmed likelihood or just a “raw” covariance estimate; default is (Re)weighting.
The amount of output returned by fulltle. Current options are “OnlyEst” (default) where only an ‘IdtE’ object with the fulltle estimates is returned and “SetMD2andEst” which returns a list with an ‘IdtE’ object of fulltle estimates, a vector with the final trimmed subset elements used to compute these estimates and the corresponding robust squared Mahalanobis distances.
A boolean flag indicating whether, for moderate or large data sets the algorithm should proceed anyway, regardless of an expected long excution time, due to exponential explosions in the number of different subsets that need to be avaluated by fulltle
Further arguments to be passed to internal functions of ‘fulltle’.
If argument ‘otpType’ is set to “OnlyEst”, an object of class ‘IdtE’ with the fulltle estimates, their log-likelihood values, and the value of the comparison criterion used to select the covariance configurations.
If argument ‘otpType’ is set to “SetMD2andEst” a list with the following components:
An object of class ‘IdtE’ with the fulltle estimates, their log-likelihood values, and the value of the comparison criterion used to select the covariance configurations.
A vector with the final trimmed subset elements used to compute the fasttle estimates.
A vector with the robust squared Mahalanobis distances used to select the trimmed subset.
A vector of length two containing the consistency correction factor and the finite sample correction factor of the final estimate of the covariance matrix.
A matrix with the raw MCD estimator used to compute the robust squared Mahalanobis distances of RobMD2.
A vector of length two containing the consistency correction factor and the finite sample correction factor of the raw estimate of the covariance matrix.
signature(Idt = "IData")
Performs maximum trimmed likelihood estimation for interval-valued data using an exact algorithm, and assuming a Gaussian distribution, and considering alternative variance-covariance matrix configurations
Brito, P., Duarte Silva, A. P. (2012), Modelling Interval Data with Normal and Skew-Normal Distributions. Journal of Applied Statistics 39(1), 3--20.
Cerioli, A. (2010), Multivariate Outlier Detection with High-Breakdown Estimators. Journal of the American Statistical Association 105 (489), 147--156.
Hadi, A. S. and Luceno, A. (1997), Maximum trimmed likelihood estimators: a unified approach, examples, and algorithms. Computational Statistics and Data Analysis 25(3), 251--272.
Hardin, J. and Rocke, A. (2005), The Distribution of Robust Distances. Journal of Computational and Graphical Statistics 14, 910--927.
# Create an Interval-Data object containing the intervals for characteristics
# of 27 cars models.
CarsIdt <- IData(Cars[1:8],VarNames=c("Price","EngineCapacity","TopSpeed","Acceleration"))
#Display the first and last observations
# Estimate normal distributuion parameters robustly by full maximum trimmed likelihood,
CarsTE <- fulltle(CarsIdt)
cat("Cars data -- normal maximum trimmed likelhiood estimation results:\n")
print(CarsTE)
Run the code above in your browser using DataLab