<p>The ubiquitous challenge of learning and decision-making from rank data arises in situations where intelligent systems collect preference and behavior data from humans, learn from the data, and then use the data to help humans make efficient, effective, and timely decisions. Often, such data are
Learning and Decision-Making from Rank Data
✍ Scribed by Lirong Xia
- Publisher
- Springer
- Year
- 2019
- Tongue
- English
- Leaves
- 151
- Series
- Synthesis Lectures on Artificial Intelligence and Machine Learning
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
The ubiquitous challenge of learning and decision-making from rank data arises in situations where intelligent systems collect preference and behavior data from humans, learn from the data, and then use the data to help humans make efficient, effective, and timely decisions. Often, such data are represented by rankings.
This book surveys some recent progress toward addressing the challenge from the considerations of statistics, computation, and socio-economics. We will cover classical statistical models for rank data, including random utility models, distance-based models, and mixture models. We will discuss and compare classical and state-of-the-art algorithms, such as algorithms based on Minorize-Majorization (MM), Expectation-Maximization (EM), Generalized Method-of-Moments (GMM), rank breaking, and tensor decomposition. We will also introduce principled Bayesian preference elicitation frameworks for collecting rank data. Finally, we will examine socio-economic aspects of statistically desirable decision-making mechanisms, such as Bayesian estimators.
This book can be useful in three ways: (1) for theoreticians in statistics and machine learning to better understand the considerations and caveats of learning from rank data, compared to learning from other types of data, especially cardinal data; (2) for practitioners to apply algorithms covered by the book for sampling, learning, and aggregation; and (3) as a textbook for graduate students or advanced undergraduate students to learn about the field.
This book requires that the reader has basic knowledge in probability, statistics, and algorithms. Knowledge in social choice would also help but is not required.
✦ Table of Contents
Cover
Copyright Page
Title Page
Contents
Preface
Acknowledgments
Introduction
The Research Problem
Overview of the Book
Statistical Models for Rank Data
Basics of Statistical Modeling
Modeling Partial Orders as Events
Random Utility Models
The Plackett–Luce Model
Properties of Random Utility Models (RUMs)
Sampling from Random Utility Models
Connection to Discrete Choice Models
Distance-Based Models
Mallows' Model
Repeated Insertion Model: Efficient Sampling from Mallows
Condorcet's Model
Datasets and Model Fitness
Bibliographical Notes
Parameter Estimation Algorithms
Algorithms for the Plackett–Luce Model
The Minorize-Maximization (MM) Algorithm
The Luce Spectral Ranking (LSR) Algorithm
Generalized Method-of-Moments (GMM) Algorithm
Algorithms for General Random Utility Models
The Expectation-Maximization (EM) Algorithm
EM for RUMs: Monte Carlo E-Step by Gibbs Sampling
EM for RUMs: M-Step
GMM for RUMs with Location Families
Algorithms for Distance-Based Models
Bibliographical Notes
The Rank-Breaking Framework
Rank-Breaking for Random Utility Models
Breaking + GMM for Plackett–Luce
Uniqueness of Outcome of Algorithm 4.11
Characterization of Consistent Breakings for Plackett–Luce
Computational and Statistical Efficiency of Algorithms for Plackett–Luce
Rank-Breaking for General Random Utility Models with Location Families
Rank-Breaking + Composite Marginal Likelihood (RBCML)
Weighted Breakings
Composite Marginal Likelihood Methods (CML)
The RBCML Framework
Consistency and Asymptotic Normality of RBCML
RBCML for Plackett–Luce
RBCML for RUMs with Location Families
The Adaptive RBCML Algorithm
Experiments
Bibliographical Notes
Mixture Models for Rank Data
Mixture Models
Identifiability of Mixture Models
An EM Algorithm for Learning Mixture Models
Learning Mixtures of Plackett–Luce
Algorithms for Mixtures of Plackett–Luce
Learning Mixtures of General RUMs with Location Families
Learning Mixtures of Mallows
Bibliographical Notes
Bayesian Preference Elicitation
The Bayesian Preference Elicitation Problem
Plackett–Luce Model with Features
Computing Expected Information Gain
Bayesian Preference Elicitation for Personal Choice
Information Criteria
Approximation Techniques for Personal Choice
Bayesian Preference Elicitation for Social Choice
Approximating Posterior Distributions
Ranked-Top- k Questions
Social Choice by Randomized Voting
Experimental Results
Estimating the Cost Function
Comparing Information Criteria
Bibliographical Notes
Socially Desirable Group Decision-Making from Rank Data
Statistical Decision-Theoretic Framework
Measuring Decision Mechanisms: Bayesian Loss and Frequentist Loss
Socio-Economic Criteria: Social Choice Axioms
Minimax Estimators in Neutral Frameworks
Socially Desirable Bayesian Estimators
An Impossibility Theorem on Strict Condorcet Criterion
Satisfiability of Other Axioms
An Automated Design Framework
Data Generation
Hypothesis Space
Optimization
Bibliographical Notes
Future Directions
Bibliography
Author's Biography
📜 SIMILAR VOLUMES
<p><strong>Connect data and instruction to improve practice</strong></p><p>Gathering data and using it to inform instruction is a requirement for many schools, yet educators are not necessarily formally trained in how to do it. This book helps bridge the gap between classroom practice and the princi
<p><span>Data is the indispensable fuel that drives the decision making of everything from governments, to major corporations, to sports teams. Its value is almost beyond measure. But what if that data is either unavailable or problematic to access? That’s where synthetic data comes in. This book wi
<p><span>Data is the indispensable fuel that drives the decision making of everything from governments, to major corporations, to sports teams. Its value is almost beyond measure. But what if that data is either unavailable or problematic to access? That’s where synthetic data comes in. This book wi
<p>Christoph Samitsch investigates whether decision-making efficiency is being influenced by the quality of data and information. Results of the research provide evidence that defined data quality dimensions have an effect on decision-making performance as well as the time it takes to make a decisi