Introduction to LLM
This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.
Chapter 10 — Long-Horizon Task Memory
Tenth post of the LLM Primer IV walkthrough. Short-term memory through windows and ReAct scratchpads, long-term memory through episodic vectors and semantic stores, and the compaction techniques that keep an agent productive over hours and days.
2026-04-08Chapter 9 — The RAG Evaluation Triad
Ninth post of the LLM Primer III walkthrough. A RAG system can fail in three different places and the failures look identical from the outside — the Evaluation Triad of Context Relevance, Groundedness, and Answer Relevance is the small vocabulary that prevents fixing one bug while measuring another.
2026-03-26Chapter 8 — Data Anonymization in the RAG Pipeline
Eighth post of the LLM Primer III walkthrough. Pre-generation versus post-generation anonymisation, the three technique families — masking, synthetic replacement, differential privacy — and the utility-privacy tradeoff that determines whether the system remains useful at all.
2026-03-25Chapter 3 — Advanced Chunking Frameworks
Third post of the LLM Primer III walkthrough. The chunking spectrum from fixed-size to structure-aware, the overlap myth, the context cliff that destroys retrieval quietly, and the contextual-retrieval and late-chunking techniques that have reshaped the frontier.
2026-03-20Chapter 2 — Intelligent Document Parsing
Second post of the LLM Primer III walkthrough. Why a PDF is not a text file, what layout-aware parsers actually preserve, the current tool landscape (LlamaParse, Docling, Unstructured, Marker-PDF, Firecrawl, DeepSeek-OCR), and the multimodal track that retrieves over page images directly.
2026-03-19LLM Primer III — Series Introduction & Index
Kicking off the chapter-by-chapter walkthrough of Book III in the LLM Primer series — Enhancing Enterprise AI with RAG. Why retrieval-augmented generation looks simple from the outside and is a stack of disciplines underneath, who this book is for, and the schedule for the eleven posts that follow, March 18 through March 28.
2026-03-17Chapter 14 — Practical Knowledge for Engineers
Twelfth post — the closing chapter of the LLM Primer II walkthrough. How to keep deepening your understanding after the book ends, the tools and libraries that turn the math into shipping work, and the bridge to the other books in the LLM Primer series.
2026-03-16Chapter 7 — Beyond Next-Token Prediction: Embeddings, Retrieval, and Multimodality
Chapter 7 of the LLM Primer I series. The capabilities that turn a next-token predictor into something much more — embeddings, semantic search, retrieval-augmented generation, and the move into multimodal inputs. How RAG actually keeps an LLM grounded in real documents instead of confabulating.
2026-02-24