๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Build a Large Language Model (From Scratch)

โœ Scribed by welcome.html


Tongue
English
Series
MEAP
Edition
v06
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


In this book, I invite you to embark on an educational journey with me to
learn how to build Large Language Models (LLMs) from the ground up.
Together, we'll delve deep into the LLM training pipeline, starting from data
loading and culminating in finetuning LLMs on custom datasets.
For many years, I've been deeply immersed in the world of deep learning,
coding LLMs, and have found great joy in explaining complex concepts
thoroughly. This book has been a long-standing idea in my mind, and I'm
thrilled to finally have the opportunity to write it and share it with you. Those
of you familiar with my work, especially from my blog, have likely seen
glimpses of my approach to coding from scratch. This method has resonated
well with many readers, and I hope it will be equally effective for you.
I've designed the book to emphasize hands-on learning, primarily using
PyTorch and without relying on pre-existing libraries. With this approach,
coupled with numerous figures and illustrations, I aim to provide you with a
thorough understanding of how LLMs work, their limitations, and
customization methods. Moreover, we'll explore commonly used workflows
and paradigms in pretraining and fine-tuning LLMs, offering insights into
their development and customization.
The book is structured with detailed step-by-step introductions, ensuring no
critical detail is overlooked. To gain the most from this book, you should
have a background in Python programming. Prior experience in deep learning
and a foundational understanding of PyTorch, or familiarity with other deep
learning frameworks like TensorFlow, will be beneficial

โœฆ Table of Contents


Build a Large Language Model (From Scratch)
1. welcome
2. 1_Understanding_Large_Language_Models
3. 2_Working_with_Text_Data
4. 3_Coding_Attention_Mechanisms
5. 4_Implementing_a_GPT_model_from_Scratch_To_Generate_Text
6. 5_Pretraining_on_Unlabeled_Data
7. Appendix_A._Introduction_to_PyTorch
8. Appendix_B._References_and_Further_Reading
9. Appendix_C._Exercise_Solutions
10. Appendix_D._Adding_Bells_and_Whistles_to_the_Training_Loop


๐Ÿ“œ SIMILAR VOLUMES


Build a Large Language Model (From Scrat
โœ Sebastian Raschka ๐Ÿ“‚ Library ๐Ÿ“… 2024 ๐Ÿ› Manning Publications Co. ๐ŸŒ English

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up! In Build a Large Language Model (from Scratch), youโ€™ll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through cre

Build a Large Language Model (From Scrat
โœ Sebastian Raschka ๐Ÿ“‚ Library ๐Ÿ“… 2023 ๐Ÿ› Manning Publications ๐ŸŒ English

Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into co

Writing a C Compiler: Build a Real Progr
โœ Nora Sandler ๐Ÿ“‚ Library ๐Ÿ“… 2023 ๐Ÿ› No Starch Press ๐ŸŒ English

<span>A hands-on, example-filled guide to the theory and practice of writing a C compiler: a computer program that translates code written by programmers into code the computer can read.</span><span><br><br>An approachable, hands-on tutorial to writing a C compiler: a computer program that translate