R Code: Outlier Function

I needed a quick and simple way to identify outliers, so I made a function for R that will identify outliers in a vector. Obviously, outliers can be identified in many ways.  This is a simple >1.5 x the IQR approach. The function could easily be modified to identify extreme outliers by changing the IQR multiplication factor to 3.0 from 1.5, for example.

# Make the function to identify extreme outliers
outlier.f <- function(x){
  low=as.numeric(quantile(x)[2] - IQR(x)*1.5)
  high=as.numeric(IQR(x)*1.5 + quantile(x)[4])
  list(lower.limit=low, upper.limit=high,
    lower=which(x<low), upper=which(x>high))

This returns a list with four entries: the lower and upper limits (data values) for outliers and the position number of those values above of below the limits. Short working example:

# Make a vector with some obvious outliers
foo <- c(rnorm(80),rnorm(10,mean=-5),rnorm(10,mean=5))
# Identify outliers

[1] -3.718559
[1] 3.549603
[1] 81 82 83 84 85 86 88 89 90
[1] 91 92 93 94 95 96 97 98 99 100
This entry was posted in R and tagged , . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s