MDJ Studios

Field notes on agentic AI, engineering, and teaching

Deep dives on building production AI agents with the Claude Agent SDK and MCP, full-stack engineering with Next.js and TypeScript, and the craft of teaching technical subjects.

Subscribe via RSS

An attention diagram showing how the word 'sat' attends to 'cat' and 'mat' most strongly in the sentence 'The cat sat on the mat.'
5 min read

Transformers, as I teach them

Strip away the math and what's left is surprisingly clean. Here's the mental model I use when I teach transformers, the architecture behind every modern LLM, to students who've never written a line of machine learning.

  • LLMs
  • Teaching
  • Machine Learning