<p><b>Person re-identification is the problem of associating observations of targets in different non-overlapping cameras.</b> Most of the existing learning-based methods have resulted in improved performance on standard re-identification benchmarks, but at the cost of time-consuming and tediously l
Person Re-identification With Limited Supervision (Synthesis Lectures on Computer Vision)
โ Scribed by Rameswar Panda, Amit K. Roy-chowdhury
- Publisher
- Morgan & Claypool
- Year
- 2021
- Tongue
- English
- Leaves
- 100
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Person re-identification is the problem of associating observations of targets in different non-overlapping cameras. Most of the existing learning-based methods have resulted in improved performance on standard re-identification benchmarks, but at the cost of time-consuming and tediously labeled data. Motivated by this, learning person re-identification models with limited to no supervision has drawn a great deal of attention in recent years.
In this book, we provide an overview of some of the literature in person re-identification, and then move on to focus on some specific problems in the context of person re-identification with limited supervision in multi-camera environments. We expect this to lead to interesting problems for researchers to consider in the future, beyond the conventional fully supervised setup that has been the framework for a lot of work in person re-identification.
Chapter 1 starts with an overview of the problems in person re-identification and the major research directions. We provide an overview of the prior works that align most closely with the limited supervision theme of this book. Chapter 2 demonstrates how global camera network constraints in the form of consistency can be utilized for improving the accuracy of camera pair-wise person re-identification models and also selecting a minimal subset of image pairs for labeling without compromising accuracy. Chapter 3 presents two methods that hold the potential for developing highly scalable systems for video person re-identification with limited supervision. In the one-shot setting where only one tracklet per identity is labeled, the objective is to utilize this small labeled set along with a larger unlabeled set of tracklets to obtain a re-identification model. Another setting is completely unsupervised without requiring any identity labels. The temporal consistency in the videos allows us to infer about matching objects across the cameras with higher confidence, even with limited to no supervision. Chapter 4 investigates person re-identification in dynamic camera networks. Specifically, we consider a novel problem that has received very little attention in the community but is critically important for many applications where a new camera is added to an existing group observing a set of targets. We propose two possible solutions for on-boarding new camera(s) dynamically to an existing network using transfer learning with limited additional supervision. Finally, Chapter 5 concludes the book by highlighting the major directions for future research.
โฆ Table of Contents
Preface
Person Re-identification: An Overview
Introduction
An Overview of Related Work
Supervised Person Re-identification
Unsupervised Person Re-identification
Semi-Supervised and One-Shot Person Re-identification
Active Learning for Person Re-identification
Open World Person Re-identification
Organization of the Book
Supervised Re-identification: Optimizing the Annotation Effort
Network Consistent Re-identification
Estimating Globally Consistent Associations
Examples of Globally Consistent Re-identification
Optimal Subset Selection for Annotation
Estimating the Optimal Subset
Example Results on Optimal Labeling Set
Conclusion
Towards Unsupervised Person Re-identification
Temporal Coherence for Self-Supervised, One-Shot Video Re-identification
Framework for One-Shot Video Re-identification
Example Results for One-Shot Video Re-identification
Global Network Constraints for Unsupervised Video Re-identification
Unsupervised Video Re-identification
Example Results on Unsupervised Video Re-identification
Conclusion
Re-identification in Dynamic Camera Networks
On-boarding New Cameras: Transferring from the Best Source Camera
Transferring from the Best Source Camera
Example Results on Camera On-Boarding
On-Boarding New Cameras Without Access to Source Camera Data
Hypothesis Transfer Learning for On-Boarding New Cameras
Example Results on Camera On-Boarding Without Source Data
Conclusion
Future Research Directions
Knowledge Transfer Across Networks
Learning in Mobile Camera Networks
Human-in-the-Loop Re-identification
Adversarial Robust Re-identification
Efficient Model Deployment
What Does the Future Hold?
Bibliography
Authors' Biographies
Blank Page
๐ SIMILAR VOLUMES
Visual pattern analysis is a fundamental tool in mining data for knowledge. Computational representations for patterns and texture allow us to summarize, store, compare, and label in order to learn about the physical world. Our ability to capture visual imagery with cameras and sensors has resulted
<p>Background subtraction is a widely used concept for detection of moving objects in videos. In the last two decades there has been a lot of development in designing algorithms for background subtraction, as well as wide use of these algorithms in various important applications, such as visual surv
<p><span>In the human quest for scientific knowledge, empirical evidence is collected by visual perception. Tracking with computer vision takes on the important role to reveal complex patterns of motion that exist in the world we live in. Multi-object tracking algorithms provide new information on h
<p>The first book of its kind dedicated to the challenge of person re-identification, this text provides an in-depth, multidisciplinary discussion of recent developments and state-of-the-art methods. Features: introduces examples of robust feature representations, reviews salient feature weighting a
"Demand continues to grow worldwide, from both government and commerce, for technologies capable of automatically selecting and identifying object and human behaviour. This accessible text/reference presents a comprehensive and unified treatment of visual analysis of behaviour from computational-mod