Learn R Programming

⚠️There's a newer version (0.1.1) of this package.Take me there.

stray (version 0.1.0)

Anomaly Detection in High Dimensional and Temporal Data

Description

This is a modification of 'HDoutliers' package. The 'HDoutliers' algorithm is a powerful unsupervised algorithm for detecting anomalies in high-dimensional data, with a strong theoretical foundation. However, it suffers from some limitations that significantly hinder its performance level, under certain circumstances. This package implements the algorithm proposed in Talagala, Hyndman and Smith-Miles (2019) for detecting anomalies in high-dimensional data that addresses these limitations of 'HDoutliers' algorithm. We define an anomaly as an observation that deviates markedly from the majority with a large distance gap. An approach based on extreme value theory is used for the anomalous threshold calculation.

Copy Link

Version

Install

install.packages('stray')

Monthly Downloads

229

Version

0.1.0

License

GPL-2

Maintainer

Priyanga Dilini Talagala

Last Published

December 17th, 2019

Functions in stray (0.1.0)

data_d

A wheel dataset with two inliers
use_KNN

Find outliers using kNN distance with maximum gap
stray

stray: A package for robust anomaly detection in data streams with concept drift
wheel1

wheel data set with inlier and outlier.
find_HDoutliers

Detect Anomalies in High Dimensional Data.
display_HDoutliers

Display outliers with a scatterplot
data_a

A dataset with an outlier
data_b

A bimodal dataset with a micro cluster
find_theshold

Find Outlier Threshold
ped_data

Dataset with pedestrian counts
data_e

A bimodal dataset with an inlier
data_f

A dataset with an outlier
data_c

A dataset with local anomalies and micro clusters