Large Language Model-Based Solutions : HOW TO DELIVER VALUE WITH COST-EFFECTIVE GENERATIVE AI APPLICATIONS

✍ Scribed by Shreyas Subramanian

Publisher: WILEY
Year: 2024
Tongue: English
Leaves: 224
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

Learn to build cost-effective apps using Large Language Models

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:

Effective strategies to address the challenge of the high computational cost associated with LLMs

...

✦ Table of Contents

Cover
Table of Contents
Title Page
Introduction
GenAI APPLICATIONS AND LARGE LANGUAGE MODELS
IMPORTANCE OF COST OPTIMIZATION
MICRO CASE STUDIES
WHO IS THIS BOOK FOR?
SUMMARY
1 Introduction
OVERVIEW OF GenAI APPLICATIONS AND LARGE LANGUAGE MODELS
PATHS TO PRODUCTIONIZING GenAI APPLICATIONS
THE IMPORTANCE OF COST OPTIMIZATION
SUMMARY
2 Tuning Techniques for Cost Optimization
FINE‐TUNING AND CUSTOMIZABILITY
PARAMETER‐EFFICIENT FINE‐TUNING METHODS
COST AND PERFORMANCE IMPLICATIONS OF PEFT METHODS
SUMMARY
3 Inference Techniques for Cost Optimization
INTRODUCTION TO INFERENCE TECHNIQUES
PROMPT ENGINEERING
CACHING WITH VECTOR STORES
CHAINS FOR LONG DOCUMENTS
SUMMARIZATION
BATCH PROMPTING FOR EFFICIENT INFERENCE
MODEL OPTIMIZATION METHODS
PARAMETER‐EFFICIENT FINE‐TUNING METHODS
COST AND PERFORMANCE IMPLICATIONS
SUMMARY
REFERENCES
4 Model Selection and Alternatives
INTRODUCTION TO MODEL SELECTION
MOTIVATING EXAMPLE: THE TALE OF TWO MODELS
THE ROLE OF COMPACT AND NIMBLE MODELS
EXAMPLES OF SUCCESSFUL SMALLER MODELS
DOMAIN‐SPECIFIC MODELS
THE POWER OF PROMPTING WITH GENERAL‐PURPOSE MODELS
SUMMARY
5 Infrastructure and Deployment Tuning Strategies
INTRODUCTION TO TUNING STRATEGIES
HARDWARE UTILIZATION AND BATCH TUNING
INFERENCE ACCELERATION TOOLS
MONITORING AND OBSERVABILITY
SUMMARY
CONCLUSION
BALANCING PERFORMANCE AND COST
FUTURE TRENDS IN GenAI APPLICATIONS
SUMMARY
INDEX
Copyright
Dedication
ABOUT THE AUTHOR
ABOUT THE TECHNICAL EDITOR
End User License Agreement

📜 SIMILAR VOLUMES

Large Language Model-Based Solutions: Ho

📁 Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications

✍ Shreyas Subramanian 📂 Library 📅 2024 🏛 WILEY 🌐 English

Learn to build cost-effective apps using Large Language Models InLarge Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scient

Large Language Model-Based Solutions: Ho

📁 Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications (Tech Today)

✍ Shreyas Subramanian 📂 Library 📅 2024 🏛 Wiley 🌐 English

Learn to build cost-effective apps using Large Language ModelsIn Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, del

Learn Python Generative AI: Journey from

📁 Learn Python Generative AI: Journey from autoencoders to transformers to large language models

✍ Zonunfeli Ralte, Indrajit Kar 📂 Library 📅 2024 🏛 BPB Publications 🌐 English

Learn to unleash the power of AI creativity KEY FEATURES ● Understand the core concepts related to generative AI. ● Different types of generative models and their applications. ● Learn how to design generative AI neural networks using Python and TensorFlow. DESCRIPTION This book researches the intri

Learn Python Generative AI: Journey from

📁 Learn Python Generative AI: Journey from autoencoders to transformers to large language models

✍ Zonunfeli Ralte, Indrajit Kar 📂 Library 📅 2024 🏛 BPB Publications 🌐 English

Productionizing AI: How to Deliver AI B2

📁 Productionizing AI: How to Deliver AI B2B Solutions with Cloud and Python

✍ Barry Walsh 📂 Library 📅 2022 🏛 Apress 🌐 English

This book is a guide to productionizing AI solutions using best-of-breed cloud services with workarounds to lower costs. Supplemented with step-by-step instructions covering data import through wrangling to partitioning and modeling through to inference and deployment, and augmented with pl

Productionizing AI: How to Deliver AI B2

📁 Productionizing AI: How to Deliver AI B2B Solutions with Cloud and Python

✍ Barry Walsh 📂 Library 📅 2022 🏛 Apress 🌐 English