AI Foundations Guide
From biological intelligence to the frontier — 27 chapters.
Part I — Intelligence in Nature
-
01
From Reflex to Reason
Single-cell organisms, nervous systems, the evolutionary ladder from stimulus-response to planning.
-
02
The Brain
Neurons, synapses, brain structures, and how biology implements learning.
-
03
From Neuroscience to AI
McCulloch-Pitts, Hebb, the perceptron, XOR, the AI winter. Connectionism vs symbolicism.
Part II — The Math Behind It
-
04
Mathematical Foundations
Linear algebra, calculus, probability, and information theory — each tied to where it appears in AI.
Part III — Building Artificial Neural Networks
-
05
Neurons to Networks
Weights, activations, the math of a single neuron, layers, and deep networks.
-
06
The Training Loop
Forward pass, loss functions, backpropagation, gradient descent, overfitting, regularization.
-
07
GPUs and the Hardware Revolution
Why parallel matrix math on graphics cards was the unlock. The CUDA moment.
-
08
Architectures
CNNs, RNNs, LSTMs, autoencoders, GANs — what each solved and what it couldn't.
Part IV — How LLMs Work
-
09
The Transformer
Attention, self-attention, multi-head attention, positional encoding. Why this architecture won.
-
10
Tokenization
Text to numbers. BPE, vocabulary, subword units. What the model actually sees.
-
11
Pre-training
Next-token prediction, the data, the scale. What emerges and why nobody expected it.
-
12
What the Model "Knows"
Embeddings, latent space, superposition. Why it's compression, not growth.
Part V — Shaping and Using a Model
-
13
Fine-tuning
SFT, RLHF, instruction tuning, alignment. What changes in the weights.
-
14
LoRA and Efficient Methods
Low-rank adaptation, QLoRA, adapters. Why you don't need to retrain everything.
-
15
Inference
Autoregressive decoding, temperature, sampling, KV cache, context windows.
Part VI — The Field Today
-
16
The Communities
cs.AI vs cs.ML vs cs.IR vs cs.CL. Conferences, journals, how to read the landscape.
-
17
The Labs
OpenAI, Anthropic, DeepMind, Meta AI, Mistral. The open vs closed debate.
-
18
The Hardware Stack
Nvidia, AMD, TSMC, Google TPUs, HBM. What each layer does and why Nvidia dominates.
-
19
Hyperscalers and Enterprise AI
Azure, AWS, GCP. How AI gets deployed at scale.
-
20
The Applied Landscape
RAG, tool use, MCP, agentic engineering, multi-agent systems.
Part VII — The Frontier
-
21
Beyond LLMs
Reinforcement learning, world models, JEPA, neuroevolution.
-
22
Continual Learning
Catastrophic forgetting and why models can't just keep learning.
-
23
Intrinsic Motivation and Curiosity
Prediction error as drive, open-ended learning. The constraint-as-feature insight.
-
24
Quantum Computing and AI
What quantum gives you, what it doesn't. Honest state of the field.
-
25
Neural Interfaces
BCIs, Neuralink, convergence of biological and artificial intelligence.
-
26
The Gap
What exists vs what's missing. Growing architecture, survival drive, persistent learning.
Part VIII — Your System
-
27
Connecting It Back
What this means for what you're building. Hardware, research opportunity, next steps.