Ramaswamy et al. proposed the k-nearest neighbors outlier detection method (kNNo).
Each point's anomaly score is the distance to its kth nearest neighbor in the
data set. Then, all points are ranked based on this distance. The higher an example's
score is, the more anomalous it is.
Usage
do_knno(data, k, top_n)
Arguments
data
Data observations.
k
Number of neighbors of a point that we are interested in.
top_n
Total number of outliers we are interested in.
Value
Vector of outliers.
References
Ramaswamy, S., Rastogi, R. and Shim, K.
Efficient Algorithms for Mining Outliers from Large Data Sets.
SIGMOD'00 Proceedings of the 2000 ACM SIGMOD international conference
on Management of data, 2000, 427-438.