Building Large Language Models from Scratch

Tallenna

Building Large Language Models from Scratch

Kirjailija:

Dilyan Grigorov

e-kirja

, 2026

englanti

88,30 €

Lue Adobe DRM-yhteensopivassa e-kirjojen lukuohjelmassaTämä e-kirja on kopiosuojattu Adobe DRM:llä, mikä vaikuttaa siihen, millä alustalla voit lukea kirjaa. Lue lisää

This book is a complete, hands-on guide to designing, training, and deploying your own Large Language Models (LLMs)—from the foundations of tokenization to the advanced stages of fine-tuning and reinforcement learning. Written for developers, data scientists, and AI practitioners, it bridges core principles and state-of-the-art techniques, offering a rare, transparent look at how modern transformers truly work beneath the surface.Starting from the essentials, you’ll learn how to set up your environment with Python and PyTorch, manage datasets, and implement critical fundamentals such as tensors, embeddings, and gradient descent. You’ll then progress through the architectural heart of modern models, covering RMS normalization, rotary positional embeddings (RoPE), scaled dot-product attention, Grouped Query Attention (GQA), Mixture of Experts (MoE), and SwiGLU activations, each explored in depth and built step by step in code. As you advance, the book introduces custom CUDA kernel integration, teaching you how to optimize key components for speed and memory efficiency at the GPU level—an essential skill for scaling real-world LLMs. You’ll also gain mastery over the phases of training that define today’s leading models:Pretraining - Building general linguistic and semantic understanding.Midtraining - Expanding domain-specific capabilities and adaptability.Supervised Fine-Tuning (SFT) - Aligning behavior with curated, task-driven data.Reinforcement Learning from Human Feedback (RLHF) - Refining responses through reward-based optimization for human alignment.The final chapters guide you through dataset preparation, filtering, deduplication, and training optimization, culminating in model evaluation and real-world prompting with a custom TokenGenerator for text generation and inference.By the end of this book, you’ll have the knowledge and confidence to architect, train, and deploy your own transformer-based models, equipped with both the theoretical depth and practical expertise to innovate in the rapidly evolving world of AI.What You’ll LearnHow to configure and optimize your development environment using PyTorchThe mechanics of tokenization, embeddings, normalization, and attention mechanisms.How to implement transformer components like RMSNorm, RoPE, GQA, MoE, and SwiGLU from scratch.How to integrate custom CUDA kernels to accelerate transformer computations.The full LLM training pipeline: pretraining, midtraining, supervised fine-tuning, and RLHF.Techniques for dataset preparation, deduplication, model debugging, and GPU memory management.How to train, evaluate, and deploy a complete GPT-like architecture for real-world tasks.Who this book is for:Software developers, data scientists, machine learning engineers and AI enthusiasts looking to build their models from scratch.

Alaotsikko: Design, Train, and Deploy LLMs with PyTorch
Kirjailija: Dilyan Grigorov
ISBN: 9798868822971
Kieli: englanti
Julkaisupäivä: 4.5.2026
Kustantaja: APRESS
Formaatti: Epub - Adobe DRM
Lue e-kirjoja täällä: Lue e-kirja mobiililaitteella/tabletilla
Lukulaite
Tietokone

Toimitus

Digitaaliset e-kirjat ja äänikirjat toimitetaan heti oston jälkeen ja ovat silloin käytettävissä digitaalisessa kirjastossasi. Fyysisille tuotteille tarjoamme ilmaisen ekonomitoimituksen yksityisasiakkaille 29 eurosta alkaen. Tarjoamme toimituksen noutopisteeseen, suoraan postilaatikkoosi tai ovelle, pakettiautomaattiin tai kotiinkuljetuksena. Saatavilla olevat toimitusvaihtoehdot näkyvät kassalla. Toimitusaika voi vaihdella toimitustavan, osoitteen ja tilaukseesi sisältyvien tuotteiden toimitusaikojen mukaan. Arvioitu kokonaistoimitusaika näkyy kassalla.

Maksaminen

Voit maksaa MobilePaylla, kortilla (Visa tai Mastercard), Apple Paylla tai Klarnalla. Klarnan maksuvaihtoehdot ovat: korttimaksu, verkkopankkimaksu, lasku tai osamaksu. Klikkaamalla "Maksa tilaus" hyväksyt Kustomsin ja Adlibriksen yleiset ehdot. Voit tutustua henkilötietojesi käsittelyyn Adlibriksen tietosuojakäytännössä ja Kustomsin tietosuojainformaatiossa.

Palautukset

Adlibriksella sinulla on aina 28 päivän palautusoikeus siitä hetkestä, kun olet vastaanottanut tuotteesi. Palautusmaksu on 5,90 euroa. Peruuttamisoikeus ei koske tuotteita, jotka toimitetaan sähköisesti ja ovat ladattavissa tilauksen vahvistushetkestä alkaen. Lisätietoja palautuksista, hyvityksistä ja reklamaatioista löydät ostoehdoistamme.

Building Large Language Models from Scratch

Kuvaus

Tuotetiedot

Toimitus ja maksaminen

Toimitus

Maksaminen

Palautukset

Tutustu tuotekokoelmiin

Arvostelut

Lisätietoa Adlibriksesta

Tietoa Adlibriksesta

Edulliset hinnat

Sujuvat toimitukset

Ilmainen toimitus yli 29 € tilauksiin*

Tilaa uutiskirje

Usein kysyttyä

Lisätietoa Adlibriksesta

Tietoa Adlibriksesta

Edulliset hinnat

Sujuvat toimitukset

Ilmainen toimitus yli 29 € tilauksiin*

Tilaa uutiskirje

Usein kysyttyä

Toimitus ja maksaminen

Toimitus

Maksaminen

Palautukset

Tutustu tuotekokoelmiin