š”– Scriptorium
✦   LIBER   ✦

šŸ“

Natural Language Processing Projects: Build Next-Generation NLP Applications Using AI Techniques

āœ Scribed by Akshay Kulkarni, Adarsha Shivananda, Anoosh Kulkarni


Publisher
Apress
Year
2021
Tongue
English
Leaves
327
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Leverage machine learning and deep learning techniques to build fully-fledged natural language processing (NLP) projects. Projects throughout this book grow in complexity and showcase methodologies, optimizing tips, and tricks to solve various business problems. You will use modern Python libraries and algorithms to build end-to-end NLP projects.Ā 

The book starts with an overview of natural language processing (NLP) and artificial intelligence to provide a quick refresher on algorithms. Next, it covers end-to-end NLP projects beginning with traditional algorithms and projects such as customer review sentiment and emotion detection, topic modeling, and document clustering. From there, it delves into e-commerce related projects such as product categorization using the description of the product, a search engine to retrieve the relevant content, and a content-based recommendation system to enhance user experience. Moving forward, it explains how to build systems to find similar sentences using contextualĀ embedding, summarizing huge documents using recurrent neural networks (RNN), automatic word suggestion using long short-term memory networks (LSTM), and how to build a chatbot using transfer learning. It concludes with an exploration of next-generation AI and algorithms in the research space.Ā 


By the end of this book, you will have the knowledge needed to solve various business problems using NLP techniques.



What You Will Learn

  • Implement full-fledged intelligent NLP applications with Python
  • Translate real-world business problem on text data with NLP techniques
  • Leverage machine learning and deep learning techniques to perform smart language processing
  • Gain hands-on experience implementing end-to-end search engine information retrieval, text summarization, chatbots, text generation, document clustering and product classification, and more


Who This Book Is For

Data scientists, machine learning engineers, and deep learning professionals looking to build natural language applications using Python

✦ Table of Contents


Table of Contents
About the Authors
About the Technical Reviewer
Acknowledgments
Chapter 1: Natural Language Processing andĀ Artificial Intelligence Overview
Machine Learning
Supervised Learning
Unsupervised Learning
Reinforcement Learning
Supervised Learning
Regression
Classification
Linear Regression
Logistic Regression
Classification
Random Forest
Support-Vector Machines
Nonlinear Data: TheĀ Kernel Trick
Neural Networks
Deep Neural Networks
Convolutional Neural Networks
RNN
Unsupervised Learning
Clustering
k-means Clustering
Hierarchical Clustering
Dimensionality Reduction
Reinforcement Learning
NLP Concepts
Unstructured Data
Natural Language Processing
Text Preprocessing
Text toĀ Features
One-Hot Encoding (OHE)
Count Vectorizer
Term Frequency–Inverse Document Frequency (TF-IDF)
Word Embeddings
Applications
Text Classification
Text Summarization
Text Generation
The AI Life Cycle
Understanding andĀ Defining theĀ Business Problem
Translating theĀ Business Problem into anĀ ML Problem andĀ Creating aĀ Solution Approach/Architecture Design
Data Collection andĀ Understanding
Data Cleaning andĀ Preprocessing
Exploratory Data Analysis
Feature Engineering andĀ Selection
Model Building, Tuning, andĀ Selection
Model Testing andĀ Validation
Prediction andĀ Insights
Deployment or Productionization andĀ Model Maintenance
Business Context
Chapter 2: Product360: Sentiment andĀ Emotion Detector
Problem Statement
Approach Formulation
Steps toĀ Solve This Problem
Building anĀ Emotion Classifier Model
Data forĀ Emotion Classifiers
Data Cleaning andĀ Preprocessing
Label Encoding
Train-Test Split
Feature Engineering
Model Building Phase
Multinomial Naive Bayes
Linear Classifier/Logistic Regression
Support-Vector Machine
Random Forest
Model Evaluation andĀ Comparison Summary
Confusion Matrix forĀ theĀ Selected Model
Real-time Data Extraction
Twitter API
Predicting theĀ Emotion
Predicting theĀ Sentiment
Visualization andĀ Insights
Emotion Analysis
Emotion andĀ Sentiment Analysis
Automated Reporting
Summary
Chapter 3: TED Talks Segmentation andĀ Topics Extraction Using Machine Learning
Problem Statement
Approach Formulation
Data Collection
Understanding theĀ Data
Data Cleaning andĀ Preprocessing
Feature Engineering
Count Vectors
TF-IDF Vectors
Word Embeddings
Model Building Phase
K-means Clustering
Elbow Method
Silhouette Coefficient
Count Vectors asĀ Features
Elbow Method
Silhouette
TF-IDF asĀ Features
Elbow Method
Silhouette
Word Embeddings asĀ Features
Elbow Method
Silhouette
Building Clustering Model
Cluster Visualization
Topic Modeling
Topic Modeling forĀ Cluster 1
Topic Modeling forĀ Cluster 0
Conclusion
Chapter 4: Enhancing E-commerce Using an Advanced Search Engine and Recommendation System
Problem Statement
Approach
Content-Based Filtering
Environment Setup
Understanding theĀ Data
Exploratory Data Analysis
Data Preprocessing
Text Preprocessing
Model Building
Content-based Recommendation System
Product Search Engine
Implementation
Advanced Search Engine Using PyTerrier andĀ Sentence-BERT
Data Preprocessing
Building theĀ Search Engine
Multilingual Search Engine Using Deep Text Search
Summary
Chapter 5: Creating a Résumé Parsing, Screening and Shortlisting System
Context
Methodology andĀ Approach
Implementation
Installing andĀ Importing Required Libraries
Reading Résumés and Job Descriptions
Text Processing
Text toĀ Features
Feature Reduction
Model Building
Extracting Entities
Ranking
Visualization
Conclusion
Chapter 6: Creating anĀ E-commerce Product Categorization Model Using Deep Learning
Problem Statement
Methodology andĀ Approach
Environment Setup
Understanding theĀ Data
Exploratory Data Analysis
Data Preprocessing
Text Preprocessing
Feature Engineering
Train-Test Split
Model Building
ANN
Long Short-Term Memory: Recurrent Neural Networks
Convolutional Neural Networks
Evaluating theĀ Model
Hyperparameter Tuning
Results
Summary
Chapter 7: Predicting Duplicate Questions inĀ Quora
Problem Statement
Approach
Unsupervised Learning
Supervised Learning
Data set
Implementation: Unsupervised Learning
Data Preparation
A.Ā Building Vectors Using doc2vec
B.Ā Sentence Transformers Using theĀ BERT Model
C.Ā GPT
Finding Similar questions
Implementation: Supervised Learning
Understanding theĀ Data
Preprocessing theĀ Data
Text toĀ Feature
Model Building
Evaluation
Predictions onĀ New Sentence Pairs
Conclusion
Chapter 8: Named-Entity Recognition Using CRF andĀ BERT
Problem Statement
Methodology andĀ Approach
Implementation
Data
Train Data Preparation
Test Data Preparation
Model Building
Conditional Random Fields (CRF)
Simple Feature Mapping
Feature Mapping by Adding More Features
BERT Transformer
Next Steps
Summary
Chapter 9: Building aĀ Chatbot Using Transfer Learning
Approach
Rule-based Chatbots
Generative or Smart Chatbots
Chatbot 1: QA System Using Similarity Scores
Chatbot 2: Context-based Chatbot Using aĀ Pretrained Model
Hugging Face Transformers
Chatbot 3: Pretrained Chatbot Using RNN
Future Scope
RASA
Microsoft Bot Framework
Conclusion
Chapter 10: News Headline Summarization
Approach
Extractive Summarization
Abstractive Summarization
Environment Setup
Understanding theĀ Data
Text Preprocessing
Model Building
BART: Simple-Transformer Pretrained Model
T5 Pretrained Model
Evaluation Metrics forĀ Summarization
Future Scope
Conclusion
Chapter 11: Text Generation: Next Word Prediction
Problem Statement
Approach: Understanding Language Modeling
Implementation
Model 1: Word-to-Word Text Generation
Model 2: Sentence by Sentence
Model 3: Sequence ofĀ Input Words andĀ Output Word
GPT-2 (Advanced Pretrained Model)
Autocomplete/Suggestion
Fast Autocomplete
Conclusion
Chapter 12: Conclusion and Future Trends
AutoNLP
Multilingual NLP
Conversational AI
Industry-Specific Pretrained Models
Image Captioning
Index


šŸ“œ SIMILAR VOLUMES


Natural Language Processing Projects: Bu
āœ Akshay Kulkarni, Adarsha Shivananda, Anoosh Kulkarni šŸ“‚ Library šŸ“… 2021 šŸ› Apress 🌐 English

<p>Leverage machine learning and deep learning techniques to build fully-fledged natural language processing (NLP) projects. Projects throughout this book grow in complexity and showcase methodologies, optimizing tips, and tricks to solve various business problems. You will use modern Python librari

Natural Language Processing Projects: Bu
āœ Akshay Kulkarni, Adarsha Shivananda, Anoosh Kulkarni šŸ“‚ Library šŸ“… 2021 šŸ› Apress 🌐 English

<p><span>Leverage machine learning and deep learning techniques to build fully-fledged natural language processing (NLP) projects. Projects throughout this book grow in complexity and showcase methodologies, optimizing tips, and tricks to solve various business problems. You will use modern Python l

Hands-On Natural Language Processing wit
āœ Thomas Dop šŸ“‚ Library šŸ“… 2020 šŸ› Packt Publishing Ltd 🌐 English

Become a proficient NLP data scientist by developing deep learning models for NLP and extract valuable insights from structured and unstructured data Key Features Get to grips with word embeddings, semantics, labeling, and high-level word representations using practical examples Learn modern approac

Hands-On Natural Language Processing wit
āœ Thomas Dop šŸ“‚ Library šŸ› Packt Publishing 🌐 English

<p><span>Become a proficient NLP data scientist by developing deep learning models for NLP and extract valuable insights from structured and unstructured data</span></p><h4><span>Key Features</span></h4><ul><li><span><span>Get to grips with word embeddings, semantics, labeling, and high-level word r

Hands-On Natural Language Processing wit
āœ Thomas Dop šŸ“‚ Library šŸ› Packt Publishing 🌐 English

<p><span>Become a proficient NLP data scientist by developing deep learning models for NLP and extract valuable insights from structured and unstructured data</span></p><h4><span>Key Features</span></h4><ul><li><span><span>Get to grips with word embeddings, semantics, labeling, and high-level word r

Advanced Applications of Generative Ai a
āœ Ahmed J. Obaid, Bharat Bhushan, Muthmainnah S., S. Suman Rajest šŸ“‚ Library šŸ“… 2023 šŸ› IGI Global 🌐 English

The rapid advancements in Artificial Intelligence (AI), specifically in Natural Language Processing (NLP) and Generative AI, pose a challenge for academic scholars. Staying current with the latest techniques and applications in these fields is difficult due to their dynamic nature, while the lack of