This is a data set consisting of measurements of water salinity (i.e., its salt
concentration) and river discharge taken in North Carolina's
Pamlico Sound; This dataset was listed by Ruppert and Carroll
(1980). In Carrol and Ruppert (1985) the physical background of
the data is described. They indicated that observations 5 and 16
correspond to periods of very heavy discharge and showed that the
discrepant observation 5 was masked by observations 3 and 16 i.e.
only after deletion of these observations it was possible to
identify the influential observation 5.
This data set is a prime example of the masking effect.
Usage
data(salinity)
Arguments
source
P. J. Rousseeuw and A. M. Leroy (1987)
Robust Regression and Outlier Detection. Wiley, p.82, table 5.