Introduction to LLM
This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.
Chapter 14 — Benchmarking, Testing, and Performance
Fifteenth and final post of the LLM Primer IV walkthrough. The MCP-Universe Benchmark on real servers, the two systemic failure modes it exposed, the ten-times throughput gap between session-per-request and shared session pools, and the bridge to Volume V.
2026-04-12Chapter 13 — Frameworks and Cloud Integration
Fourteenth post of the LLM Primer IV walkthrough. Strands with Bedrock, the AWS state-layer pattern, the Microsoft Agent Framework, LangChain, Semantic Kernel — and the three production integration shapes teams keep arriving at independently.
2026-04-11Chapter 5 — Transport Protocols and Discovery
Fifth post of the LLM Primer IV walkthrough. The three transports MCP supports, the .well-known discovery layer with Server Cards, and the boring operational concerns — CORS, origin validation, caching — that decide whether a server is a cooperative network citizen or a liability.
2026-04-03Chapter 4 — Client Primitives: Agentic Behaviors and Control
Fourth post of the LLM Primer IV walkthrough. Sampling, Roots, and Elicitation are the three small, controlled holes MCP punches through the host-server wall — each a capability granted back, each a risk accepted on the user's behalf.
2026-04-02Chapter 2 — Unveiling the Model Context Protocol (MCP)
Second post of the LLM Primer IV walkthrough. What MCP actually standardizes, the three-role split of Host, Client, and Server, why dynamic discovery and bidirectional messaging differ from REST in the cases that matter, and the session lifecycle that opens with capability negotiation.
2026-03-31LLM Primer IV — Series Introduction & Index
Kicking off the chapter-by-chapter walkthrough of Book IV in the LLM Primer series — Designing AI Cognition with MCP. Why agents need a protocol layer to scale past demoware, who this book is for, and the schedule for the fourteen posts that follow, March 30 through April 12.
2026-03-29Chapter 5 — Architecting the Retrieval Pipeline
Fifth post of the LLM Primer III walkthrough. Why a single vector search is not a pipeline — hybrid retrieval, reciprocal rank fusion, cross-encoder reranking, and query-side rewriting and HyDE — assembled into the production architecture that mature RAG systems converge on.
2026-03-22