Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up! In Build a Large Language Model (from Scratch), youโll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through cre
Build a Large Language Model (From Scratch)
โ Scribed by welcome.html
- Tongue
- English
- Series
- MEAP
- Edition
- v06
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
In this book, I invite you to embark on an educational journey with me to
learn how to build Large Language Models (LLMs) from the ground up.
Together, we'll delve deep into the LLM training pipeline, starting from data
loading and culminating in finetuning LLMs on custom datasets.
For many years, I've been deeply immersed in the world of deep learning,
coding LLMs, and have found great joy in explaining complex concepts
thoroughly. This book has been a long-standing idea in my mind, and I'm
thrilled to finally have the opportunity to write it and share it with you. Those
of you familiar with my work, especially from my blog, have likely seen
glimpses of my approach to coding from scratch. This method has resonated
well with many readers, and I hope it will be equally effective for you.
I've designed the book to emphasize hands-on learning, primarily using
PyTorch and without relying on pre-existing libraries. With this approach,
coupled with numerous figures and illustrations, I aim to provide you with a
thorough understanding of how LLMs work, their limitations, and
customization methods. Moreover, we'll explore commonly used workflows
and paradigms in pretraining and fine-tuning LLMs, offering insights into
their development and customization.
The book is structured with detailed step-by-step introductions, ensuring no
critical detail is overlooked. To gain the most from this book, you should
have a background in Python programming. Prior experience in deep learning
and a foundational understanding of PyTorch, or familiarity with other deep
learning frameworks like TensorFlow, will be beneficial
โฆ Table of Contents
Build a Large Language Model (From Scratch)
1. welcome
2. 1_Understanding_Large_Language_Models
3. 2_Working_with_Text_Data
4. 3_Coding_Attention_Mechanisms
5. 4_Implementing_a_GPT_model_from_Scratch_To_Generate_Text
6. 5_Pretraining_on_Unlabeled_Data
7. Appendix_A._Introduction_to_PyTorch
8. Appendix_B._References_and_Further_Reading
9. Appendix_C._Exercise_Solutions
10. Appendix_D._Adding_Bells_and_Whistles_to_the_Training_Loop
๐ SIMILAR VOLUMES
Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into co
<span>A hands-on, example-filled guide to the theory and practice of writing a C compiler: a computer program that translates code written by programmers into code the computer can read.</span><span><br><br>An approachable, hands-on tutorial to writing a C compiler: a computer program that translate