How to build an autonomous, interoperable, production-ready agentic AI platform. 19 sessions. Three protocol layers. One integrated platform.
Click any session to explore topics and assignments
Understand neural networks, attention mechanisms, and positional encoding that power modern AI. You can't debug what you don't understand.
Chrome extension that interacts with an LLM API.
Learn tokenization, scaling laws, RLHF alignment, and the current state of reasoning and multi-modal models.
Enhanced Chrome extension using Gemini Flash / Claude Haiku with streaming.
Build Python and Node.js skills, then create your first goal-directed agent with a working web UI.
Full-stack agent with a working web UI — backend in Python, frontend in Node.js/React. Agent takes a goal and executes it with at least one tool.
Master the Model Context Protocol for tool registration, discovery, and invocation across servers.
Build a custom MCP server wrapping a real-world API (weather, stocks, email, calendar — each student picks different). Connect it to your agent from Session 3.
Implement Chain-of-Thought, ReACT patterns, and self-validating task decomposition for intelligent agents.
Multi-step agent that decomposes a complex goal, plans tool usage, executes, and self-validates results.
Design the 4-layer cognitive pipeline with strategy profiles and adaptive retry loops.
4-module agent with user preference input, strategy selection, and adaptive retry across different problem types.
Build 3-tier memory (preferences, episodic, factual) with hybrid retrieval and semantic chunking.
Agent with 3-tier memory that learns preferences, recalls past workflows, and retrieves facts. Demonstrate persistence across sessions.
Coordinate multiple agents using directed acyclic graphs with parallel execution and session persistence.
Multi-agent DAG executor with 3+ agent types supporting parallel execution, session persistence, and resumption after interruption.
Automate web browsing with Playwright, vision-capable navigation, and multi-source research pipelines.
Agent that autonomously researches a topic across 5+ sources, extracts structured data, compares results, and generates a synthesis report.
Control desktop applications using screen understanding, accessibility trees, and OS-level automation.
Agent that operates a desktop application to complete a real task using vision + accessibility tree, not just scripted clicks.
Connect agents to WhatsApp, Slack, Discord, voice, and 20+ channels through a unified adapter pattern.
Each student/group picks a different channel to integrate. Build a complete adapter that connects to your agent pipeline with message ingress, formatting, and reply routing.
Implement circuit breakers, JSON repair, and Docker-based sandboxing for safe agent execution.
Container-isolated agent execution with circuit breaker. Demonstrate that a misbehaving agent cannot access the host system.
Enable cross-vendor agent collaboration using Google's Agent2Agent protocol with capability discovery and task delegation.
Build an A2A-compliant agent discovered and invoked by other students' agents. Demonstrate cross-agent task delegation to at least 2 other agents.
Build agents that generate dynamic, interactive UIs at runtime using declarative and event-based protocols.
Agent that generates dynamic, interactive UIs — e.g., custom dashboards or comparison tables with interactive filters using A2UI or AG-UI protocol.
Implement intelligent multi-model routing with cost tracking, budget controls, and OpenTelemetry instrumentation.
Intelligent model router with cost dashboard. Auto-select between 3+ models based on task complexity and demonstrate cost savings vs. always-using-frontier.
Shift from reactive to proactive agents that monitor event streams, evaluate relevance, and act autonomously.
Agent monitoring real event streams (GitHub webhooks, email, or custom) acting autonomously for at least 1 hour with a human-reviewable audit log.
Build coding agents with System 2 reasoning, codebase navigation, and markdown-driven skill injection.
Coding agent that reads a codebase, identifies a bug, generates a fix, runs tests, and iterates until tests pass using System 2 reasoning and SKILL.md.
Design custom eval harnesses, run GAIA/SWE-bench benchmarks, and prepare capstone proposals.
Custom eval harness with 20+ test cases, automated scoring, regression detection, and a report comparing two configurations. Plus: capstone proposal draft.
Integrate all three protocol layers (MCP + A2A + A2UI) into a production-ready agentic platform.
Finalize capstone with GitHub Projects plan, or submit a PR to Arcturus 2.0 implementing a course concept. 2-minute lightning preview of capstone idea.
Student presentations and the beginning of the 4-week capstone execution window.
By Session 19, every student will be able to: