Learn R Programming

stray (version 0.1.1)

Anomaly Detection in High Dimensional and Temporal Data

Description

This is a modification of 'HDoutliers' package. The 'HDoutliers' algorithm is a powerful unsupervised algorithm for detecting anomalies in high-dimensional data, with a strong theoretical foundation. However, it suffers from some limitations that significantly hinder its performance level, under certain circumstances. This package implements the algorithm proposed in Talagala, Hyndman and Smith-Miles (2019) for detecting anomalies in high-dimensional data that addresses these limitations of 'HDoutliers' algorithm. We define an anomaly as an observation that deviates markedly from the majority with a large distance gap. An approach based on extreme value theory is used for the anomalous threshold calculation.

Copy Link

Version

Install

install.packages('stray')

Monthly Downloads

229

Version

0.1.1

License

GPL-2

Maintainer

Priyanga Dilini Talagala

Last Published

June 29th, 2020

Functions in stray (0.1.1)

data_b

A bimodal dataset with a micro cluster
stray

stray: A package for robust anomaly detection in data streams with concept drift
use_KNN

Find outliers using kNN distance with maximum gap
wheel1

wheel data set with inlier and outlier.
data_e

A bimodal dataset with an inlier
find_HDoutliers

Detect Anomalies in High Dimensional Data.
find_threshold

Find Outlier Threshold
display_HDoutliers

Display outliers with a scatterplot
data_f

A dataset with an outlier
data_c

A dataset with local anomalies and micro clusters
data_d

A wheel dataset with two inliers
ped_data

Dataset with pedestrian counts
data_a

A dataset with an outlier