The Agent as a System
When most people hear the phrase 'AI agent,' they picture a very smart chatbot — a language model that answers questions. That picture is incomplete. A modern AI agent is a system: a collection of distinct components that work together to perceive a goal, plan a course of action, invoke external capabilities, remember what it has done, and iterate until the goal is met. The language model sits at the center of that system, but the system is far more than the model alone.
Consider a concrete example. You ask an AI research agent: 'Find the three most-cited papers on transformer attention published in the last two years, summarize each one, and save the summaries to a file.' To complete that task the system must search a database, read full-text documents, synthesize information across multiple sources, write formatted summaries, and call a file-write API. No single forward pass through a language model can do all of that. Instead, a loop of reasoning, tool calls, and memory operations runs repeatedly until every sub-task is complete.
A language model is a function that maps a prompt to a completion. An agent is a system that uses a language model as its reasoning core but wraps it in planning, tool access, and memory to take multi-step action in the world. Conflating the two is the single most common misconception in AI discussions.
Five Components of the Modern Agent Stack
Researchers and engineers have converged on a rough consensus about what a general-purpose agent contains. The five canonical components are: (1) the LLM reasoning core, which does the language understanding and generation; (2) the planner, which decomposes a high-level goal into an ordered sequence of concrete actions; (3) the tool layer, which connects the agent to external capabilities like search, code execution, databases, and APIs; (4) the memory system, which stores context, past actions, retrieved facts, and long-term knowledge; and (5) the orchestration loop, which drives the entire cycle — prompting the model, executing the action it chooses, feeding results back in, and repeating until done. Each of these five components has its own architecture, failure modes, and design tradeoffs. Modules H1-02 through H1-06 examine each in turn.
Match each agent component to its primary responsibility.
Terms
Definitions
Drag terms onto their definitions, or click a term then click a definition to match.
Why the Systems View Matters
Understanding agents as systems rather than as clever prompts is practically important because it changes where you look when something goes wrong and what you change when you need improvement. If a research agent produces outdated summaries, the fault might be in the retrieval tool (wrong database, stale index), the memory system (results from a previous task bleeding in), the planner (wrong decomposition of the search step), or the model itself (poor synthesis). Treating the whole thing as a black-box model makes diagnosis impossible. Treating it as a system with distinct components makes every failure traceable.
This systems perspective also clarifies capability boundaries. An LLM on its own has no ability to browse the web, execute code, or write a file — it can only generate text. An agent with a web-search tool and a code-execution sandbox can autonomously research and verify. The same model, radically different capability, achieved entirely through architecture rather than through changes to the model weights. This is why agent engineering has become a distinct and valuable discipline alongside model training.
Agents are also called autonomous agents, LLM agents, or AI agents interchangeably in the literature. The orchestration loop is sometimes called the agent loop or the ReAct loop. The tool layer is sometimes called the action space or the function-call layer. These terms all refer to the same structural ideas.
A company replaces their language model with a newer version but keeps the same planner, tools, and orchestration loop. Which statement is most accurate?
An agent tasked with booking a flight searches for flights, selects one, fills in a payment form, and confirms the booking. Which agent component is primarily responsible for the sequence 'search, select, fill, confirm'?
Agent Anatomy Sketch
- Choose a real or hypothetical AI agent use-case — for example, an agent that monitors your email, drafts replies, and archives threads; or one that tracks a stock portfolio and alerts you to anomalies.
- Step 1: Write one sentence describing what the agent's goal is.
- Step 2: For each of the five components (LLM core, planner, tools, memory, orchestration loop), write one sentence describing what that component does specifically in your chosen agent.
- Step 3: Identify one place where your agent could fail. Which component does the failure belong to, and why?
- Step 4: Compare your sketch with a partner. Did they identify a failure mode you missed?
- Objective: practice mapping the five-component architecture onto a real use-case before you study each component in depth.