Artigo Acesso aberto

The Treatment of Ties in Some Nonparametric Tests

1955; Institute of Mathematical Statistics; Volume: 26; Issue: 3 Linguagem: Inglês

10.1214/aoms/1177728485

ISSN

2168-8990

Autores

Joseph Putter,

Tópico(s)

Statistical Mechanics and Entropy

Resumo

distributions, and that tied observations can therefore be ignored or treated in any convenient way, without affecting the performance characteristic of the test. In practice, however, this assumption is not a realistic one, and the distributionis involved are in general to be regarded as discontinuous, either because of intrinsic reasons (integer-valued or otherwise discrete random variables) or because of limitations on the precision of measurements. Therefore, usually, ties will occur with positive probability, and the way they are treated does affect the performance characteristic of the test. The problem of ties has therefore to be considered, in particular with a view to preserving the nonparametric character of the test, and to making sure of setting it up on the desired level of significance. The usual practice in attacking the problem has been to consider the conditional distributions of the statistics concerned given that the number of observations in each tied group is a fixed constant. This, however, was never explicitly made clear, and these conditional distributions, as well as their variances and other characteristics, are referred to as distributions (or variances, etc.) when ties are present. In this category belong Kendall's work on ties in rank correlation theory, and Kruskal's theorem concerning a generalized Wilcoxon test (see Section 8). In this paper, we attack the problem from the standpoint of the ties being random variables. Our main concern is the comparison between the randomized and the way of treating the ties. In Sections 3 and 4 we consider the one-sided sign test, and show that randomization reduces both the exact power and the asymptotic efficiency of the test. In Sections 5-8 we consider the Wilcoxon test. For small samples the nonrandomized treatment of ties presents practical difficulties, but the asymptotic (large sample) problem can be handled. Again, it is shown that randomization results in reduced efficiency. 2. Notation and theorems used. We shall use the notation DZ(a, b) for normal random variables (with mean a and variance b), and (3(n, p) for binomials. The

Referência(s)