𝔖 Bobbio Scriptorium

✦ LIBER ✦

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - The relationship between Precision-Recall and ROC curves

✍ Scribed by Davis, Jesse; Goadrich, Mark

Book ID: 118003254
Publisher: ACM Press
Year: 2006
Weight: 170 KB
Volume: 0
Category: Article
ISBN-13: 9781595933836
DOI: 10.1145/1143844.1143874

No coin nor oath required. For personal study only.

✦ Synopsis

Receiver Operator Characteristic (ROC) curves are commonly used to present results for binary decision problems in machine learning.However, when dealing with highly skewed datasets, Precision-Recall (PR) curves give a more informative picture of an algorithm's performance. We show that a deep connection exists between ROC space and PR space, such that a curve dominates in ROC space if and only if it dominates in PR space. A corollary is the notion of an achievable PR curve, which has properties much like the convex hull in ROC space; we show an efficient algorithm for computing this curve. Finally, we also note differences in the two types of curves are significant for algorithm design. For example, in PR space it is incorrect to linearly interpolate between points. Furthermore, algorithms that optimize the area under the ROC curve are not guaranteed to optimize the area under the PR curve.

📜 SIMILAR VOLUMES

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - The relationship between Precision-Recall and ROC curves

✍ Davis, Jesse; Goadrich, Mark 📂 Article 📅 2006 🏛 ACM Press ⚖ 170 KB

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - Dynamic topic models

✍ Blei, David M.; Lafferty, John D. 📂 Article 📅 2006 🏛 ACM Press ⚖ 406 KB

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - Robust probabilistic projections

✍ Archambeau, Cédric; Delannay, Nicolas; Verleysen, Michel 📂 Article 📅 2006 🏛 ACM Press ⚖ 264 KB

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - Robust probabilistic projections

✍ Archambeau, Cédric; Delannay, Nicolas; Verleysen, Michel 📂 Article 📅 2006 🏛 ACM Press ⚖ 264 KB

Principal components and canonical correlations are at the root of many exploratory data mining techniques and provide standard pre-processing tools in machine learning. Lately, probabilistic reformulations of these methods have been proposed . They are based on a Gaussian density model and are ther

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - An empirical comparison of supervised learning algorithms

✍ Caruana, Rich; Niculescu-Mizil, Alexandru 📂 Article 📅 2006 🏛 ACM Press ⚖ 157 KB

[ACM Press the 23rd international confer

[ACM Press the 23rd international conference - Pittsburgh, Pennsylvania (2006.06.25-2006.06.29)] Proceedings of the 23rd international conference on Machine learning - ICML '06 - An empirical comparison of supervised learning algorithms

✍ Caruana, Rich; Niculescu-Mizil, Alexandru 📂 Article 📅 2006 🏛 ACM Press ⚖ 157 KB

A number of supervised learning methods have been introduced in the last decade. Unfortunately, the last comprehensive empirical evaluation of supervised learning was the Statlog Project in the early 90's. We present a large-scale empirical comparison between ten supervised learning methods: SVMs, n