Learn R Programming

forestFloor (version 1.5)

box.outliers: Box Outliers

Description

squeeze all outliers onto standard.dev-limits and/or normalize to [0;1] scale

Usage

box.outliers(x, limit = 1.5, normalize = TRUE)

Arguments

x
numeric vector, matrix, array, data.frame
limit
limit(SD,standard diviation) any number devating more than limit from mean is an outlier
normalize
TRUE/FALSE should output range be normalized to [0;1]?

Value

  • matrix(n x p) of normalized values

Details

Can be used to box high dimensional data, hence the name box.outliers. Box.outliers is used internally in forestFloor-package to compute colour gradients without assigning unique colours to few outliers. It's a box because the borders univariate/non-interacting.

See Also

scale()

Examples

Run this code
#Example:
box.outliers = function (x, limit = 1.5)  {
  x = scale(x)
  x[ x > limit] =  limit
  x[-x > limit] = -limit
  x = x - min(x)
  x = x/(limit * 2)
  return(x)
}
n=1000 #some observations
p = 5  #some dimensions
X = data.frame(replicate(p,rnorm(n))) # a dataset
Xboxed =box.outliers(X,limit=1.5) #applying normalizatino
plot(Xboxed[,1],Xboxed[,2],col="#00000088") #plot output for first two dimensions

Run the code above in your browser using DataLab