Transformers and Large Language Models
Regulärer Preis:
62,99 €
Sofort verfügbar
Transformers and Large Language Models, Apress
A Hands-On Guide to RAG and Agentic AI
Von Ahmed Fawzy Gad, im heise shop in digitaler Fassung erhältlich
Produktinformationen "Transformers and Large Language Models"
This book is a hands-on guide to understanding the foundations, architectures,
and real-world applications of transformers and large language models in modern
AI.
The book begins by laying the foundations of generative AI architectures,
tokenization, encoding, and classical modeling techniques. Initial chapters
address the evolution from feed-forward networks and recurrent neural networks
to long short-term memory (LSTM), setting the stage for the revolutionary
transformer architecture. The core of the book focuses on transformers,
introducing the encoder-decoder framework, attention mechanisms, positional
encodings, and the internal workings of multi-head attention, normalization, and
multi-layer perceptrons. Readers gain insight into advanced techniques such as
rotary positional embeddings (RoPE), mixture of experts (MoE), and knowledge
distillation, alongside practical training strategies like self-supervised
learning, fine-tuning, and reinforcement learning with human feedback. Popular
models from OpenAI, DeepSeek, and other vendors are examined to highlight the
evolution of the LLM landscape. Building on these foundations, the text explores
methods for model customization, including parameter-efficient fine-tuning
(LoRA, adapters), text generation strategies, prompt engineering, and
quantization. Retrieval-Augmented Generation (RAG) is introduced as a critical
innovation for grounding LLMs in external knowledge, with detailed evaluation
techniques for retrieval and generation. Finally, the book ventures into Agentic
AI, demonstrating protocols like Model Context Protocol (MCP) and Agent-to-Agent
(A2A) interactions with practical coding examples.
In conclusion, this book serves as both a practical guide, equipping readers
with the technical depth and applied strategies needed to design, fine-tune, and
deploy cutting-edge transformers and large language models for real-world
applications.
What we will learn:
Ø Understand the foundations of AI, ML pipelines, tokenization,
encoding, and early neural architectures.
Ø Explore transformers in depth—encoder-decoder design,
attention mechanisms, and advanced embedding methods.
Ø Learn modern LLM advancements like RoPE, MoE, SLMs, fine-tuning
strategies, and evaluation techniques.
Ø Master practical customization through prompt engineering, PEFT
methods, quantization, and text generation.
nWho this book is for:
Data scientists, ML engineers, AI researchers, and developers exploring
Transformers and large language models.
Artikel-Details
- Anbieter:
- Apress
- Autor:
- Ahmed Fawzy Gad
- Artikelnummer:
- 9798868827853
- Veröffentlicht:
- 30.06.26
Barrierefreiheit
This PDF has been created in accordance with the PDF/UA-1 standard to enhance accessibility, including screen reader support, described non-text content (images, graphs), bookmarks for easy navigation
- entspricht den Vorgaben der PDF / UA 1 (05)
- keine Vorlesefunktionen des Lesesystems deaktiviert (bis auf) (10)
- navigierbares Inhaltsverzeichnis (11)
- logische Lesereihenfolge eingehalten (13)
- kurze Alternativtexte (z.B für Abbildungen) vorhanden (14)
- Inhalt auch ohne Farbwahrnehmung verständlich dargestellt (25)
- hoher Kontrast zwischen Text und Hintergrund (26)
- Navigation über vor-/zurück-Elemente (29)
- alle zum Verständnis notwendigen Inhalte über Screenreader zugänglich (52)
- Kontakt zum Herausgeber für weitere Informationen zur Barrierefreiheit (99)