Publicações

Evandt, O., Coleman, S.Y., Ramalhoto, M.F. e van Lottum, C. (2004), “A Little Known Robust Estimator of the Correlation Coefficient and Its Use in a Robust Graphical Test for Bivariate Normality with Applications in the Aluminium Industry”, Quality and Reliability Engineering International, Vol. 20, Issue 5, pp. 433-546

Industrial and business data often contain outliers. The reasons why outliers occur can be unclear procedures for production tasks or measurement, operators who do not follow procedures, failures in production equipment or measurement equipment, the wrong type of raw material, failure in raw material, registration errors or the fact that the response is influenced by many other factors as well as the available explanatory variables. Often there is no identifiable cause for the outliers and they are considered to be an intrinsic part of the dataset. Since data are often considered pairwise, and more methods for analysing pairwise data are available if the data-generating process can be modelled by a bivariate normal distribution, there is a need for a straightforward test of bivariate normality that is robust against outliers. This paper looks at a graphical test, based on probability plotting, for assessing whether it is reasonable to assume that a bivariate dataset stems from an approximately bivariate normal distribution, where the possibility for outliers is taken into account. The robust graphical (Robug) test uses a little-known estimator of the correlation coefficient, which is demonstrated to be robust against outliers. The graphical test is illustrated using data from our practical work. First the little-known robust estimator of the correlation parameter in the bivariate normal distribution is compared with the traditional estimator, the product moment correlation coefficient, often called Pearson's r, and Spearman's rank correlation coefficient and Kendall's tau. The little-known estimator is a transformation of Kendall's tau. The comparison is partly based on theory, and partly on the simulation of observations from the bivariate normal distribution. Our conclusions are that when outliers are not an issue, Pearson's r, Spearman's coefficient and the transformation of Kendall's tau do not perform very differently in terms of bias, standard deviation and root mean square error, while Kendall's tau is too biased to be used for the purpose in question. Concerning robustness to outliers, Pearson's r is inferior to the other estimators. It seems likely that the transformation of Kendall's tau, which is far less well-known than Pearson's r and Spearman's rank correlation coefficient, is at least as good as Spearman's coefficient when the possibility of outliers must be taken into consideration.

Pode descarregar esta publicacao de: http://onlinelibrary.wiley.com/doi/10.1002/qre.658/pdf

Se nao conseguiu obter uma copia desta publicação: Peça uma cópia desta publicação



Para informação sobre todas as publicações do CENTEC pode descarregar: Lista Completa de Publicações do CENTEC