Introduction to LLM
This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.
Chapter 13 — Frameworks and Cloud Integration
Fourteenth post of the LLM Primer IV walkthrough. Strands with Bedrock, the AWS state-layer pattern, the Microsoft Agent Framework, LangChain, Semantic Kernel — and the three production integration shapes teams keep arriving at independently.
2026-04-11Chapter 11 — Attack Surfaces and Protocol Vulnerabilities
Eleventh post of the LLM Primer IV walkthrough. The classical attacks adapted to MCP — Confused Deputy, Token Passthrough, Session Hijacking — the protocol-level flaws around capability escalation and unauthenticated sampling, and the implicit trust propagation that makes context poisoning a structural problem rather than a hygiene one.
2026-04-09Chapter 10 — Long-Horizon Task Memory
Tenth post of the LLM Primer IV walkthrough. Short-term memory through windows and ReAct scratchpads, long-term memory through episodic vectors and semantic stores, and the compaction techniques that keep an agent productive over hours and days.
2026-04-08Chapter 4 — Client Primitives: Agentic Behaviors and Control
Fourth post of the LLM Primer IV walkthrough. Sampling, Roots, and Elicitation are the three small, controlled holes MCP punches through the host-server wall — each a capability granted back, each a risk accepted on the user's behalf.
2026-04-02Chapter 2 — Unveiling the Model Context Protocol (MCP)
Second post of the LLM Primer IV walkthrough. What MCP actually standardizes, the three-role split of Host, Client, and Server, why dynamic discovery and bidirectional messaging differ from REST in the cases that matter, and the session lifecycle that opens with capability negotiation.
2026-03-31Chapter 1 — The AI Integration Crisis and the Rise of Agentic Architecture
First post of the LLM Primer IV walkthrough. Why monolithic agents fray as system prompts grow, the N times M integration problem hiding underneath, and the move from prompt engineering to context engineering that MCP was built to enable.
2026-03-30Chapter 11 — Continuous Updates and Pipeline Optimization
Eleventh and final post of the LLM Primer III walkthrough. CDC and incremental indexing keep the corpus fresh, semantic caching and model tiering keep latency down, and a four-stage feedback loop closes the gap between what production tells the team and what the team actually changes — plus a bridge to Volume IV on Model Context Protocol.
2026-03-28Chapter 8 — Data Anonymization in the RAG Pipeline
Eighth post of the LLM Primer III walkthrough. Pre-generation versus post-generation anonymisation, the three technique families — masking, synthetic replacement, differential privacy — and the utility-privacy tradeoff that determines whether the system remains useful at all.
2026-03-25Chapter 6 — RAG Threat Models and Vulnerabilities
Sixth post of the LLM Primer III walkthrough. The expanded attack surface of retrieval — corpus poisoning, adversarial chunks, indirect prompt injection, embedding inversion, and the confused-deputy problem in agentic RAG. Concrete attacks, each demonstrated, each reproducible.
2026-03-23Chapter 4 — Selecting the Right Vector Database
Fourth post of the LLM Primer III walkthrough. The architectural split between purpose-built vector databases and Postgres-style extensions, the managed leaders (Pinecone, Vertex), the open-source field (Qdrant, Milvus, Weaviate), the embedded options, and the three operational axes — residency, ops, cost — that decide the real choice.
2026-03-21Chapter 1 — The Evolution of RAG Architecture
First post of the LLM Primer III walkthrough. The four architectural postures of RAG — Naive, Advanced, Modular, Agentic — read as a story about handing more agency to the LLM one decision at a time, and the honest answer to when fine-tuning is the better tool than retrieval.
2026-03-18Chapter 14 — Practical Knowledge for Engineers
Twelfth post — the closing chapter of the LLM Primer II walkthrough. How to keep deepening your understanding after the book ends, the tools and libraries that turn the math into shipping work, and the bridge to the other books in the LLM Primer series.
2026-03-16Chapter 6 — Fine-Tuning & Adaptation: From Raw Model to Helpful Assistant
Chapter 6 of the LLM Primer I series. The full adaptation stack — from cheap prompt-based steering to parameter-efficient fine-tuning to full alignment with RLHF and its modern successors like DPO. Why post-training is now where closed-model APIs actually differentiate.
2026-02-23Understanding LLMs – A Mathematical Approach to the Engine Behind AI
A preview from Chapter 7.4: Discover why large language models inherit bias, the real-world risks, strategies for mitigation, and the growing role of AI governance.
2025-09-017.3 Integrating Multimodal Models
A preview from Chapter 7.3: Discover how multimodal models fuse text, images, audio, and video to unlock richer AI capabilities beyond text-only LLMs.
2024-10-097.0 Future Outlook and Challenges
A preview from Chapter 7: Explore the future of large language models—ethics, efficiency, multimodal AI, and responsible governance beyond scaling.
2024-10-066.2 Simple Python Experiments with LLMs
A preview from Chapter 6.2: Learn how to run large language models with Hugging Face, OpenAI, Google Cloud, and Azure using just Python and a few lines of code.
2024-10-056.1 Introducing Open-Source Tools and APIs
A preview from Chapter 6.1: Explore Hugging Face, OpenAI, Google Cloud Vertex AI, and Azure Cognitive Services—leading tools to bring LLMs into your projects.
2024-10-046.0 Hands-On with LLMs
A preview from Chapter 6: Learn how to run large language models yourself with open-source libraries, cloud APIs, and Python—making LLMs accessible to everyone.
2024-10-024.4 How LLMs Write Code: The Rise of AI-Powered Programming Assistants
Explore how large language models (LLMs) generate and complete code from natural-language prompts, and what it means for the future of software development.
2024-09-272.3 Key LLM Models: BERT, GPT, and T5 Explained
Discover the main differences between BERT, GPT, and T5 in the realm of Large Language Models (LLMs). Learn about their unique features, applications, and how they contribute to various NLP tasks.
2024-09-101.1 Understanding Large Language Models (LLMs): Definition, Training, and Scalability Explained
Explore the fundamentals of Large Language Models (LLMs), including their structure, training techniques like pre-training and fine-tuning, and the importance of scalability. Discover how LLMs like GPT and BERT work to perform NLP tasks like text generation and translation.
2024-09-03A Guide to LLMs (Large Language Models): Understanding the Foundations of Generative AI
Learn about large language models (LLMs), including GPT, BERT, and T5, their functionality, training processes, and practical applications in NLP. This guide provides insights for engineers interested in leveraging LLMs in various fields.
2024-09-01