Resource

Canonical Reference · Updated quarterly

The AI Agent Stack Map

Every layer of a modern AI agent system, mapped. Eight layers, from the foundation model at the bottom to deployment at the top. This page is updated quarterly and designed to be the canonical reference you can link to when explaining the agent stack.

1. Foundation Models

The LLM that powers reasoning and generation.

Examples

Claude Opus 4.7 — Anthropic
GPT-5 — OpenAI
Gemini 2.5 — Google
Llama 4 — Meta
DeepSeek R1 — DeepSeek

→ Compare models → Claude Opus 4.6 benchmarks

2. Agent Framework

Orchestration libraries that coordinate model calls, tool use, memory, and multi-agent collaboration.

Examples

Claude Agent SDK — Anthropic
LangGraph — LangChain
CrewAI — CrewAI
AutoGen — Microsoft
Vercel AI SDK — Vercel

→ Frameworks directory → Agent infrastructure stack 2026

3. Tool Use Layer

The protocol + runtime that lets agents call external tools. MCP (Model Context Protocol) is the emerging standard.

Examples

MCP — Anthropic (open)
OpenAI Function Calling — OpenAI
Custom tool schemas — Various

→ Prompt engineering playbook → Tool-use skills registry

4. Skills & Capabilities

Packaged agent capabilities — browsing, code execution, file ops, memory, vector search — installable as skill bundles.

Examples

Web Browsing — agentmag
Code Execution — agentmag
Memory & Retrieval — agentmag
Vector Search — agentmag
Terminal — agentmag

→ Skills Registry → Topic: tool-use

5. Memory Layer

Short-term context plus long-term memory across sessions.

Examples

Cosmos DB / Mongo — Azure / MongoDB
Pinecone — Pinecone
Weaviate — Weaviate
pgvector — Postgres

→ Topic: memory → Building reliable agents

6. Evaluation Layer

How you measure agent quality — task success rate, cost per task, latency, safety.

Examples

Braintrust — Braintrust
LangSmith — LangChain
Custom harnesses — Various

→ AI evaluators → Topic: evaluation

7. Observability

Tracing and logging of agent executions — critical for debugging non-deterministic failures.

Examples

LangSmith — LangChain
Braintrust — Braintrust
Sentry — Sentry
OpenTelemetry — CNCF

→ Building reliable agents (article)

8. Deployment

How you run agents in production — serverless, dedicated infra, or vendor platforms.

Examples

Azure App Service — Microsoft
Vercel — Vercel
AWS Bedrock Agents — AWS
Modal — Modal
E2B — E2B

→ Deployment tools → Topic: deployment

Frequently Asked

What is the AI agent stack?

The AI agent stack is the set of layers required to build a production AI agent: a foundation model, an orchestration framework, a tool use layer (often MCP), installable skills, a memory layer, evaluation harnesses, observability, and deployment infrastructure. Each layer has multiple competing vendors and open-source options.

What's different about the agent stack vs a traditional ML stack?

Traditional ML stacks are dominated by training and serving. Agent stacks are dominated by orchestration: the model is commodity; the hard problem is tool use reliability, multi-step reasoning, memory, and evaluation under non-deterministic outputs. You'll spend more time on the framework, skills, and observability layers than on the model itself.

Do I need all 8 layers?

No. A minimal production agent can run with just a foundation model + a light framework + one or two tools. The full stack becomes necessary as you scale: multi-step reasoning needs observability, persistent context needs memory, quality control needs evaluation. Start with layers 1–3 and add layers as bottlenecks appear.

Is the agent stack stable in 2026?

The foundation model, deployment, and observability layers are stable. The framework, tool use, and skills layers are still churning — expect consolidation over the next 12–18 months. MCP is emerging as a standard for tool use, which should stabilize layer 3 specifically.

Stay in the know

The AI Agent Stack Map

1. Foundation Models

2. Agent Framework

3. Tool Use Layer

4. Skills & Capabilities

5. Memory Layer

6. Evaluation Layer

7. Observability

8. Deployment

Frequently Asked

What is the AI agent stack?

What's different about the agent stack vs a traditional ML stack?

Do I need all 8 layers?

Is the agent stack stable in 2026?