Tutorial: Introduction to Robust Statistics with R |
Classical statistics relies on parametric models. Typically, assumptions are made on the structural and the stochastic parts of the model and optimal procedures are derived under these assumptions. Standard examples are least squares estimators in linear models and their extensions, maximum likelihood estimators, and the corresponding likelihood-based tests.
Robust statistics deals with deviations from the stochastic assumptions and their dangers for classical estimators and tests and develops statistical procedures which are still reliable and reasonably efficient in the presence of such deviations. It can be viewed as a statistical theory dealing with approximate parametric models by providing a reasonable compromise between the rigidity of a strict parametric approach and the potential difficulties of interpretation of a fully nonparametric analysis.
This tutorial will give a brief introduction to robust statistics by reviewing some basic general concepts and tools and by showing how they can be used in data analysis, especially in the framework of linear and generalized linear models, to provide an alternative complementary statistical analysis with additional useful information.
We will use R functions mainly from packages 'MASS' and CRAN packages 'robustbase' and 'robust'. There will be handouts and R scripts available to the course participants.