Introduction to LLM

This page provides an easy-to-understand guide on LLMs (Large Language Models) from basics to applications for AI enthusiasts.

Total of 2 articles available. | Currently on page 1 of 1.

Chapter 2 — Probability, Tokens, and Text: The Game of Next-Word Guessing

Chapter 2 of the LLM Primer I series. How LLMs convert text into tokens, why language modeling is fundamentally a probability problem, and how the old n-gram approach gave way to neural models that can generalize. Includes plain-English explanations of perplexity and why every token boundary matters.

2026-02-19

3.1 LLM Training: Dataset Selection and Preprocessing Techniques

Learn about dataset selection and preprocessing techniques for training Large Language Models (LLMs). Explore steps like noise removal, tokenization, normalization, and data balancing for optimized model performance.

2024-09-12