Learn R Programming

Hidetify - High dimensional Influence Measure

What is Hidetify?

Hidetify is a high dimensional influence measure for identifying influential observations in high dimensional linear regression (number of features equal or greater than the number of samples). This package comes with the paper (Multiple detection of influential observations in high dimensional linear regression) describing the details of the procedure.

The package provides two main features.

Single Outlier Detection

The module shidetify apply a single outlier detection method to the data. Caution is required in the application of this method. Indeed, there is often a risk that the data may be affected by the adverse effects of swamping and masking. Unless you are sure of what you are doing, it is suggested that you use the mhidetify module.

Multiple Outlier Detection

The module mhidetify applies a group deletion procedure to mitigate the dual phenomenon of masking and swamping effects. It consists of three steps. The first stage applies an ultra conservative score to mitigate the swamping effect, the second stage uses the clean sample generated in the previous stage and applies an aggressive score to attenuate the masking phenomenon. Finally, the last step is concerned with the validation of the influential set generated by the two previous steps. The procedure is repeated iteratively until convergence is achieved.

Copy Link

Version

Install

install.packages('hidetify')

Monthly Downloads

1

Version

0.0.1

License

GPL-3

Maintainer

Amadou Barry

Last Published

August 20th, 2021

Functions in hidetify (0.0.1)

rcpp_HIM_sdetect

Arma Single Detection Statistic
mhidetify

Multiple detection asymmetric influential measure for high dimensional linear regression.
rcpp_shidetify

Arma Asymmetric Single Detection Statistic
rcpp_setdiff

Arma Set Difference of Subsets
ease_masking

Compute the max of the sum of a sequence of asymmetric influence measure.
arma_sample

Arma Samples and Permutations
hidetify

Identify the influential observations in high dimensional regression
ease_swamping

Compute the min of the min of a sequence of asymmetric influence measure
shidetify

Single detection asymmetric influential measure for high dimensional linear regression.
sim_hidetify_data

sim_hidetify_data for the hidetify package
vhidetify

Compute the single influence measure to validate the estimated influential set.
rcpp_mask_swamp_stat

Arma Masking and Swamping Statistics
rcpp_asymHIM_sdetect

Arma Asymmetric Multiple Detection Statistic