✦ LIBER ✦

An exact test of the accuracy of binary classification models based on the probability distribution of the average rank

✍ Scribed by Jerrold H. May; Luis G. Vargas

Publisher: Elsevier Science
Year: 2009
Tongue: English
Weight: 843 KB
Volume: 50
Category: Article
ISSN: 0895-7177
DOI: 10.1016/j.mcm.2009.04.002

No coin nor oath required. For personal study only.

✦ Synopsis

We propose a new way to evaluate the discriminatory power of models that generate a continuous value as the basis for performing a binary classification task. Our hypothesis test uses the average rank of the k successes in the sample of size n, based on those continuous values. We derive the probability mass function for the average rank from the coefficients of a Gaussian polynomial distribution that results from randomly sampling k distinct positive integers, all n or less. The significance level of the test is found by counting the number of arrangements that produce average ranks more extreme than the one observed. Recursive relationships can be used to calculate the values necessary to compute the p-value. For large values of k and n, for which exact computation might be prohibitive, we present numerical results which indicate that the critical values of the distribution are nearly linear in n for a fixed k and that the coefficients of the linear relationships are nonlinear functions of k and the desired percentile. We develop regression models for those relationships to approximate the number of arrangements in order to make the test practical for large values of k and n.

📜 SIMILAR VOLUMES

An exact probability distribution on the

An exact probability distribution on the connectivity of random graphs

✍ Robert F Ling 📂 Article 📅 1975 🏛 Elsevier Science 🌐 English ⚖ 429 KB

Evaluation of pattern classifiers — Test

Evaluation of pattern classifiers — Testing the significance of classification efficiency using an exact probability technique

✍ Edgard Nyssen 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 328 KB

Tests based on L-statistics to test the

Tests based on L-statistics to test the equality in dispersion of two probability distributions

✍ Javier Rojo; Jinping Wang 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 461 KB

Hypothesis testing based on goodness-of-

Hypothesis testing based on goodness-of-fit in the moving average time series model

✍ Charles R. Nelson; Gary S. Shea 📂 Article 📅 1979 🏛 Elsevier Science 🌐 English ⚖ 334 KB

An image compression method based on mul

An image compression method based on multiple models for the probabilities of patterns

✍ Yung-Kuan Chan; Ching-Lin Wang 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 456 KB

## Abstract This article proposes an image compression method based on multiple models for the probabilities of patterns (MMPP method) to encode a gray‐level image __f__. First, the MMPP method employs a median edge detector (MED) to reduce the entropy of __f__. The intensities of two adjacent pixe

Mathematical modelling of an infilled RC

Mathematical modelling of an infilled RC frame structure based on the results of pseudo-dynamic tests

✍ Matjaž Dolšek; Peter Fajfar 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 262 KB