Public AI discourse swings between mystification and dismissal. Both are wrong because both skip the mechanism. The mechanism is the only part worth your time: how the system is built, what objective it was optimised against, what hardware it runs on, what representations it learned, where it falls over.
This course is engineered to teach those mechanisms in the order that makes them stick, and to keep them stuck six months later. Lesson 0 is the doorway. You read it once at the start so the rest of the course knows where you're standing.
The course is a workshop you walk through in your head. Seven rooms, an external staircase, a roof. Each room belongs to a phase. Each phase teaches one layer of the modern AI stack.
The bench is where you sit and think before you build. The whiteboard wall is where you sketch the maths you'll need. The server bay is the noisy room with the racks; where hardware lives. The drafting table is where neural architectures get drawn out in chronological order, each one a response to the previous one's limits. The foundry is where models get made: training, scaling, fine-tuning. The lab bench is the wiring rig where production systems get assembled. The staircase is where you learn to read primary research. The roof is where you survey the frontier.
Seventy-eight numbered concept lessons across those rooms, plus this lesson at the doorway, plus a synthesis lesson at the end of each phase, plus a calibration assessment between phases. Roughly eight months at two to three lessons a week.
The order matters. You can't read the frontier without research literacy. You can't have research literacy without the architectures it talks about. You can't have the architectures without the maths and the hardware that shaped them. You can't have any of that without a clear definition of what an intelligence system actually is. The whole structure is downstream of where you'd want to start if you were learning the field carefully from scratch.
The hardest part of learning modern AI is not the mathematics. It is the public discourse, which swings between two failure modes. One frames every new model as a step toward something quasi-mystical. The other dismisses the whole field as statistical autocomplete. Both are wrong because both skip the mechanism.
The mechanism is what this course teaches. Why does this architecture take the shape it does? Which constraint was it responding to? What does the training objective actually reward? Where does the model fail, and why does that failure trace back to the mechanism rather than the headline?
A reader who finishes the course should be able to look at a new architecture or training scheme and ask, quickly and accurately, what hardware fact, what data fact, and what objective fact produced it. That's the habit the course is engineered to install.
Hardware sits underneath every choice in this course. Most AI material treats compute as a budget line and silicon as someone else's problem. That framing produces a misleading picture: modern AI took the shape it has because matrix multiplies are fast on specific silicon, memory bandwidth caps what models can be served, and quantisation makes the spectrum traversable.
So the course is built bottom-up. The maths supports the architectures. The architectures fit the silicon. The training scales because the hardware allows it. The deployment lives within the memory and latency budgets the substrate enforces. By the time you reach the frontier, you can read a new system as a response to a stack of constraints rather than as a mystery.
AI runs across the full compute spectrum. The same mechanisms (representation, optimisation, geometry) appear on microcontrollers with kilobytes of RAM, on phones, on workstations, on home labs, on server clusters, on hyperscale data centers. The principles stay; the constraint set shifts at each tier.
The course teaches AI as a constraint-aware engineering discipline. A recurring question, applied wherever it earns its place, is how does this change under severe constraints? That question lands differently at each tier, and the answers are part of the field.
The course is engineered for long-term retention rather than short-term information exposure. The apparatus that delivers that is load-bearing, not garnish.
You'll see five things working together. A memory palace: the workshop above, with each lesson anchored to a physical object in a room you walk in your head. The route turns lesson sequence into spatial memory; walk the route weekly, and the order becomes muscle. Retrieval practice: three open-ended questions at the end of each lesson, answered without looking, then checked. The brain consolidates what it has to reach for; re-reading alone doesn't trigger that consolidation. Glossary tooltips: dashed-underlined terms in the lesson text (like the ones you've been hovering over in this lesson) expand to definitions. The glossary accumulates across phases as a single living reference. Synthesis lessons: at the end of each phase, a compression lesson that reconnects everything in the room before you leave it. Calibration assessments: a short self-test of mechanism (not trivia) before you start the next phase. If it doesn't stick yet, you go back rather than forward.
Two structural devices recur throughout. Recurring core laws: five short statements that thread the whole course. You'll meet each one where it first lands, and see it called back in every synthesis lesson.
Progressive diagrams: the first system loop you'll see in Lesson 1 will evolve through every phase, the same shape instantiated at higher layers.
A parallel build track sits alongside the lessons. Fifteen core milestones (numpy first, framework second), plus optional extensions to tier-0 microcontroller and tier-3 distributed inference. Coding skill is not a hard prerequisite for conceptual progress. The builds are depth-by-choice; they make the mechanisms physical for learners who want that.
This is a slower course than the ones that fit in a weekend. Eight months, two to three lessons a week, weekly palace walks, daily flashcard review. What you learn this way will still be there a year from now, when whatever framework is hot today is no longer hot.
You're being taught to think like a systems engineer who happens to work in AI: ask which constraint produced this, ask what the tradeoff was, ask how this would change at a different tier of the spectrum. Those habits transfer across whatever frameworks come and go.
Lesson 1 walks you to the first station on the bench and asks: what is an intelligence system, actually? You'll get a working definition: inputs, internal state, outputs, learning signal. From there the course unrolls, layer by layer, in the order constraints actually produced. The doorway is here.
Figure 0.1 is the schematic pinned beside the doorway. Eight phases laid out as rooms in a single connected building. One dashed path links them in the order you'll walk. The central workbench is the calibration stop you return to between phases. Stations are dots on the room walls; synthesis lessons are the closing walk through each room.
Lesson 1 walks you to the first station on the bench, the reading lamp, where the course's most basic question gets a careful answer: what is an intelligence system, actually?