๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Large Language Model-Based Solutions : HOW TO DELIVER VALUE WITH COST-EFFECTIVE GENERATIVE AI APPLICATIONS

โœ Scribed by Shreyas Subramanian


Publisher
WILEY
Year
2024
Tongue
English
Leaves
224
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


Learn to build cost-effective apps using Large Language Models

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:

  • Effective strategies to address the challenge of the high computational cost associated with LLMs
  • ...

    โœฆ Table of Contents


    Cover
    Table of Contents
    Title Page
    Introduction
    GenAI APPLICATIONS AND LARGE LANGUAGE MODELS
    IMPORTANCE OF COST OPTIMIZATION
    MICRO CASE STUDIES
    WHO IS THIS BOOK FOR?
    SUMMARY
    1 Introduction
    OVERVIEW OF GenAI APPLICATIONS AND LARGE LANGUAGE MODELS
    PATHS TO PRODUCTIONIZING GenAI APPLICATIONS
    THE IMPORTANCE OF COST OPTIMIZATION
    SUMMARY
    2 Tuning Techniques for Cost Optimization
    FINEโ€TUNING AND CUSTOMIZABILITY
    PARAMETERโ€EFFICIENT FINEโ€TUNING METHODS
    COST AND PERFORMANCE IMPLICATIONS OF PEFT METHODS
    SUMMARY
    3 Inference Techniques for Cost Optimization
    INTRODUCTION TO INFERENCE TECHNIQUES
    PROMPT ENGINEERING
    CACHING WITH VECTOR STORES
    CHAINS FOR LONG DOCUMENTS
    SUMMARIZATION
    BATCH PROMPTING FOR EFFICIENT INFERENCE
    MODEL OPTIMIZATION METHODS
    PARAMETERโ€EFFICIENT FINEโ€TUNING METHODS
    COST AND PERFORMANCE IMPLICATIONS
    SUMMARY
    REFERENCES
    4 Model Selection and Alternatives
    INTRODUCTION TO MODEL SELECTION
    MOTIVATING EXAMPLE: THE TALE OF TWO MODELS
    THE ROLE OF COMPACT AND NIMBLE MODELS
    EXAMPLES OF SUCCESSFUL SMALLER MODELS
    DOMAINโ€SPECIFIC MODELS
    THE POWER OF PROMPTING WITH GENERALโ€PURPOSE MODELS
    SUMMARY
    5 Infrastructure and Deployment Tuning Strategies
    INTRODUCTION TO TUNING STRATEGIES
    HARDWARE UTILIZATION AND BATCH TUNING
    INFERENCE ACCELERATION TOOLS
    MONITORING AND OBSERVABILITY
    SUMMARY
    CONCLUSION
    BALANCING PERFORMANCE AND COST
    FUTURE TRENDS IN GenAI APPLICATIONS
    SUMMARY
    INDEX
    Copyright
    Dedication
    ABOUT THE AUTHOR
    ABOUT THE TECHNICAL EDITOR
    End User License Agreement


    ๐Ÿ“œ SIMILAR VOLUMES


    Large Language Model-Based Solutions: Ho
    โœ Shreyas Subramanian ๐Ÿ“‚ Library ๐Ÿ“… 2024 ๐Ÿ› WILEY ๐ŸŒ English

    Learn to build cost-effective apps using Large Language Models InLarge Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scient

    Large Language Model-Based Solutions: Ho
    โœ Shreyas Subramanian ๐Ÿ“‚ Library ๐Ÿ“… 2024 ๐Ÿ› Wiley ๐ŸŒ English

    <p><span>Learn to build cost-effective apps using Large Language Models</span></p><p><span>In </span><span>Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications</span><span>, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, del

    Learn Python Generative AI: Journey from
    โœ Zonunfeli Ralte, Indrajit Kar ๐Ÿ“‚ Library ๐Ÿ“… 2024 ๐Ÿ› BPB Publications ๐ŸŒ English

    Learn to unleash the power of AI creativity KEY FEATURES โ— Understand the core concepts related to generative AI. โ— Different types of generative models and their applications. โ— Learn how to design generative AI neural networks using Python and TensorFlow. DESCRIPTION This book researches the intri

    Learn Python Generative AI: Journey from
    โœ Zonunfeli Ralte, Indrajit Kar ๐Ÿ“‚ Library ๐Ÿ“… 2024 ๐Ÿ› BPB Publications ๐ŸŒ English

    Learn to unleash the power of AI creativity KEY FEATURES โ— Understand the core concepts related to generative AI. โ— Different types of generative models and their applications. โ— Learn how to design generative AI neural networks using Python and TensorFlow. DESCRIPTION This book researches the intri

    Productionizing AI: How to Deliver AI B2
    โœ Barry Walsh ๐Ÿ“‚ Library ๐Ÿ“… 2022 ๐Ÿ› Apress ๐ŸŒ English

    <p><span>This book is a guide to productionizing AI solutions using best-of-breed cloud services with workarounds to lower costs. Supplemented with step-by-step instructions covering data import through wrangling to partitioning and modeling through to inference and deployment, and augmented with pl

    Productionizing AI: How to Deliver AI B2
    โœ Barry Walsh ๐Ÿ“‚ Library ๐Ÿ“… 2022 ๐Ÿ› Apress ๐ŸŒ English

    <p><span>This book is a guide to productionizing AI solutions using best-of-breed cloud services with workarounds to lower costs. Supplemented with step-by-step instructions covering data import through wrangling to partitioning and modeling through to inference and deployment, and augmented with pl