<span>Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.<br>The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO,
Computer Vision Projects with PyTorch: Design and Develop Production-Grade Models
ā Scribed by Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma
- Publisher
- Apress
- Year
- 2022
- Tongue
- English
- Leaves
- 355
- Category
- Library
No coin nor oath required. For personal study only.
⦠Synopsis
Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.
The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.
After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.
What You Will Learn
- Solve problems in computer vision with PyTorch.
- Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications
- Design and develop production-grade computer vision projects for real-world industry problems
- Interpret computer vision models and solve business problems
Who This Book Is For
Data scientists and machine learning engineers interested in building computer vision projects and solving business problems
⦠Table of Contents
Table of Contents
About the Authors
About the Technical Reviewer
Introduction
Chapter 1: The Building Blocks ofĀ Computer Vision
What Is Computer Vision
Applications
Classification
Object Detection andĀ Localization
Image Segmentation
Anomaly Detection
Video Analysis
Channels
Convolutional Neural Networks
Receptive Field
Local Receptive Field
Global Receptive Field
Pooling
Max Pooling
Average Pooling
Global Average Pooling
Calculation: Feature Map andĀ Receptive Fields
Kernel
Stride
Pooling
Padding
Input andĀ Output
Calculation ofĀ Receptive Field
Understanding theĀ CNN Architecture Type
Understanding Types ofĀ Architecture
AlexNet
VGG
ResNet
Inception Architectures
Working withĀ Deep Learning Model Techniques
Batch Normalization
Dropouts
Data Augmentation Techniques
Introduction toĀ PyTorch
Installation
Basic Start
Summary
Chapter 2: Image Classification
Topics toĀ Cover
Defining theĀ Problem
Overview ofĀ theĀ Approach
Creating anĀ Image Classification Pipeline
First Basic Model
Data
Data Exploration
Data Loader
Define theĀ Model
The Training Process
The Second Variation ofĀ Model
The Third Variation ofĀ theĀ Model
The Fourth Variation ofĀ theĀ Model
Summary
Chapter 3: Building anĀ Object Detection Model
Object Detection Using Boosted Cascade
R-CNN
The Region Proposal Network
Fast Region-Based Convolutional Neural Network
How theĀ Region Proposal Network Works
The Anchor Generation Layer
The Region Proposal Layer
Mask R-CNN
Prerequisites
YOLO
YOLO V2/V3
Project Code Snippets
Step 1: Getting Annotated Data
Step 2: Fixing theĀ Configuration File andĀ Training
The Model File
Summary
Chapter 4: Building anĀ Image Segmentation Model
Image Segmentation
Pretrained Support fromĀ PyTorch
Semantic Segmentation
Instance Segmentation
Fine-Tuning theĀ Model
Summary
Chapter 5: Image-Based Search andĀ Recommendation System
Problem Statement
Approach andĀ Methodology
Implementation
The Dataset
Installing andĀ Importing Libraries
Importing andĀ Understanding theĀ Data
Feature Engineering
ResNet18
Calculating Similarity andĀ Ranking
Visualizing theĀ Recommendations
Taking Image Input fromĀ Users andĀ Recommending Similar Products
Summary
Chapter 6: Pose Estimation
Top-Down Approach
Bottom-Up Approach
OpenPose
Branch-1
Branch-2
HRNet (High-Resolution Net)
Higher HRNet
PoseNet
How Does PoseNet Work?
Single Person Pose Estimation
Multi-Person Pose Estimation
Pros andĀ Cons ofĀ PoseNet
Applications ofĀ Pose Estimation
Test Cases Performed Retail Store Videos
Implementation
Step 1: Identify theĀ List ofĀ Human Keypoints toĀ Track
Step 2: Identify theĀ Possible Connections Between theĀ Keypoints
Step 3: Load theĀ Pretrained Model fromĀ theĀ PyTorch Library
Step 4: Input Image Preprocessing andĀ Modeling
Step 5: Build Custom Functions toĀ Plot theĀ Output
Step 6: Plot theĀ Output onĀ theĀ Input Image
Summary
Chapter 7: Image Anomaly Detection
Anomaly Detection
Approach 1: Using aĀ Pretrained Classification Model
Step 1: Import theĀ Required Libraries
Step 2: Create theĀ Seed and Deterministic Functions
Step 3: Set theĀ Hyperparameter
Step 4: Import theĀ Dataset
Step 5: Image Preprocessing Stage
Step 6: Load theĀ Pretrained Model
Step 7: Freeze theĀ Model
Step 8: Train theĀ Model
Step 9: Evaluate theĀ Model
Approach 2: Using Autoencoder
Step 1: Prepare theĀ Dataset Object
Step 2: Build theĀ Autoencoder Network
Step 3: Train theĀ Autoencoder Network
Step 4: Calculate theĀ Reconstruction Loss Based onĀ theĀ Original Data
Step 5: Select theĀ Most Anomalous Digit Based onĀ theĀ Error Metric Score
Output
Summary
Chapter 8: Image Super-Resolution
Up-Scaling Using theĀ Nearest Neighbor Concept
Understanding Bilinear Up-Scaling
Variational Autoencoders
Generative Adversarial Networks
The Model Code
Model Development
Imports
Running theĀ Application
Summary
Chapter 9: Video Analytics
Problem Statement
Approach
Implementation
Data
Uploading theĀ Required Videos toĀ Google Colab
Convert theĀ Video toĀ aĀ Series ofĀ Images
Image Extraction
Data Preparation
Identify theĀ Hotspots inĀ aĀ Retail Store
Importing Images
Getting Crowd Counts
Security andĀ Surveillance
Identify theĀ Demographics (Age andĀ Gender)
Summary
Chapter 10: Explainable AI forĀ Computer Vision
Grad-CAM
Grad-CAM++
NBDT
Step 1
Step 2
Steps 3 andĀ 4
Grad-CAM andĀ Grad-CAM++ Implementation
Grad-CAM andĀ Grad-CAM++ Implementation onĀ aĀ Single Image
NBDT Implementation onĀ aĀ Single Image
Summary
Index
š SIMILAR VOLUMES
<span>Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.<br>The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO,
The use of computers has opened up remarkable opportunities for innovative design, improved productivity, and greater efficiency in the use of materials. Uniquely, this book focuses on the practical use of computers for clothing pattern design and product development. Readers are introduced to the
With this hands-on guide to 3D deep learning, developers working with 3D computer vision will be able to put their knowledge to work and get up and running in no time. Complete with step-by-step explanations of essential concepts and practical examples, this book lets you explore and gain a thoro
<p><span>Visualize and build deep learning models with 3D data using PyTorch3D and other Python frameworks to conquer real-world application challenges with ease</span></p><p><span><br></span></p><p><span>Key Features: </span></p><ul><li><span><span>Understand 3D data processing with rendering, PyTo