Introduction to LLM

This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.


Total of 7 articles available. | Currently on page 1 of 1.

Chapter 14 — Benchmarking, Testing, and Performance

Fifteenth and final post of the LLM Primer IV walkthrough. The MCP-Universe Benchmark on real servers, the two systemic failure modes it exposed, the ten-times throughput gap between session-per-request and shared session pools, and the bridge to Volume V.

2026-04-12

Chapter 13 — Frameworks and Cloud Integration

Fourteenth post of the LLM Primer IV walkthrough. Strands with Bedrock, the AWS state-layer pattern, the Microsoft Agent Framework, LangChain, Semantic Kernel — and the three production integration shapes teams keep arriving at independently.

2026-04-11

Chapter 5 — Transport Protocols and Discovery

Fifth post of the LLM Primer IV walkthrough. The three transports MCP supports, the .well-known discovery layer with Server Cards, and the boring operational concerns — CORS, origin validation, caching — that decide whether a server is a cooperative network citizen or a liability.

2026-04-03

Chapter 4 — Client Primitives: Agentic Behaviors and Control

Fourth post of the LLM Primer IV walkthrough. Sampling, Roots, and Elicitation are the three small, controlled holes MCP punches through the host-server wall — each a capability granted back, each a risk accepted on the user's behalf.

2026-04-02

Chapter 2 — Unveiling the Model Context Protocol (MCP)

Second post of the LLM Primer IV walkthrough. What MCP actually standardizes, the three-role split of Host, Client, and Server, why dynamic discovery and bidirectional messaging differ from REST in the cases that matter, and the session lifecycle that opens with capability negotiation.

2026-03-31

LLM Primer IV — Series Introduction & Index

Kicking off the chapter-by-chapter walkthrough of Book IV in the LLM Primer series — Designing AI Cognition with MCP. Why agents need a protocol layer to scale past demoware, who this book is for, and the schedule for the fourteen posts that follow, March 30 through April 12.

2026-03-29

Chapter 5 — Architecting the Retrieval Pipeline

Fifth post of the LLM Primer III walkthrough. Why a single vector search is not a pipeline — hybrid retrieval, reciprocal rank fusion, cross-encoder reranking, and query-side rewriting and HyDE — assembled into the production architecture that mature RAG systems converge on.

2026-03-22