Introduction to LLM
This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.
Chapter 11 — Continuous Updates and Pipeline Optimization
Eleventh and final post of the LLM Primer III walkthrough. CDC and incremental indexing keep the corpus fresh, semantic caching and model tiering keep latency down, and a four-stage feedback loop closes the gap between what production tells the team and what the team actually changes — plus a bridge to Volume IV on Model Context Protocol.
2026-03-28Chapter 10 — Leading Evaluation Frameworks
Tenth post of the LLM Primer III walkthrough. A field guide to the frameworks that turn the Evaluation Triad into something a team can actually run — RAGAS, TruLens, DeepEval on one side, Braintrust, LangSmith, Phoenix, Galileo, Opik on the other, and the Evaluation Gap none of them has yet closed.
2026-03-27Chapter 9 — The RAG Evaluation Triad
Ninth post of the LLM Primer III walkthrough. A RAG system can fail in three different places and the failures look identical from the outside — the Evaluation Triad of Context Relevance, Groundedness, and Answer Relevance is the small vocabulary that prevents fixing one bug while measuring another.
2026-03-26Chapter 8 — Data Anonymization in the RAG Pipeline
Eighth post of the LLM Primer III walkthrough. Pre-generation versus post-generation anonymisation, the three technique families — masking, synthetic replacement, differential privacy — and the utility-privacy tradeoff that determines whether the system remains useful at all.
2026-03-25Chapter 7 — Implementing Access Control
Seventh post of the LLM Primer III walkthrough. Document-level ACLs as the foundation, RBAC with Microsoft Purview sensitivity labels, ReBAC with Zanzibar and SpiceDB, and the pre-filter versus post-filter discipline that runs underneath all of them.
2026-03-24Chapter 6 — RAG Threat Models and Vulnerabilities
Sixth post of the LLM Primer III walkthrough. The expanded attack surface of retrieval — corpus poisoning, adversarial chunks, indirect prompt injection, embedding inversion, and the confused-deputy problem in agentic RAG. Concrete attacks, each demonstrated, each reproducible.
2026-03-23Chapter 5 — Architecting the Retrieval Pipeline
Fifth post of the LLM Primer III walkthrough. Why a single vector search is not a pipeline — hybrid retrieval, reciprocal rank fusion, cross-encoder reranking, and query-side rewriting and HyDE — assembled into the production architecture that mature RAG systems converge on.
2026-03-22Chapter 4 — Selecting the Right Vector Database
Fourth post of the LLM Primer III walkthrough. The architectural split between purpose-built vector databases and Postgres-style extensions, the managed leaders (Pinecone, Vertex), the open-source field (Qdrant, Milvus, Weaviate), the embedded options, and the three operational axes — residency, ops, cost — that decide the real choice.
2026-03-21Chapter 3 — Advanced Chunking Frameworks
Third post of the LLM Primer III walkthrough. The chunking spectrum from fixed-size to structure-aware, the overlap myth, the context cliff that destroys retrieval quietly, and the contextual-retrieval and late-chunking techniques that have reshaped the frontier.
2026-03-20Chapter 2 — Intelligent Document Parsing
Second post of the LLM Primer III walkthrough. Why a PDF is not a text file, what layout-aware parsers actually preserve, the current tool landscape (LlamaParse, Docling, Unstructured, Marker-PDF, Firecrawl, DeepSeek-OCR), and the multimodal track that retrieves over page images directly.
2026-03-19Chapter 1 — The Evolution of RAG Architecture
First post of the LLM Primer III walkthrough. The four architectural postures of RAG — Naive, Advanced, Modular, Agentic — read as a story about handing more agency to the LLM one decision at a time, and the honest answer to when fine-tuning is the better tool than retrieval.
2026-03-18LLM Primer III — Series Introduction & Index
Kicking off the chapter-by-chapter walkthrough of Book III in the LLM Primer series — Enhancing Enterprise AI with RAG. Why retrieval-augmented generation looks simple from the outside and is a stack of disciplines underneath, who this book is for, and the schedule for the eleven posts that follow, March 18 through March 28.
2026-03-17Chapter 11 — Evaluation, Calibration, and Inference
Eleventh post of the LLM Primer II walkthrough. Perplexity, calibration, the error bars that every benchmark score should carry, and the mathematics of measuring hallucination — the chapter where we ask how anyone can measure a machine that can say anything.
2026-03-13