This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| single_vs_multi_agent [2026/03/25 15:41] – Move from architecture: namespace to flat agent | single_vs_multi_agent [2026/03/31 01:26] (current) – SEO: add target search terms agent | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Single vs Multi-Agent Architectures ====== | ====== Single vs Multi-Agent Architectures ====== | ||
| - | Choosing between single-agent and multi-agent architectures is a critical design decision that impacts cost, latency, reliability, | + | Choosing between single-agent and multi-agent architectures is a critical design decision that impacts cost, latency, reliability, |
| ===== Overview ===== | ===== Overview ===== | ||
| * **Single Agent** — One LLM-powered agent handles the entire workflow with access to all tools and context. Simple, fast, predictable. | * **Single Agent** — One LLM-powered agent handles the entire workflow with access to all tools and context. Simple, fast, predictable. | ||
| - | * **Orchestrator + Specialists** — A central coordinator delegates subtasks to domain-specific agents. Modular, scalable, but adds coordination overhead. | + | * **Orchestrator + Specialists(([[https:// |
| * **Peer-to-Peer** — Autonomous agents collaborate via message passing or shared memory. Maximum parallelism, | * **Peer-to-Peer** — Autonomous agents collaborate via message passing or shared memory. Maximum parallelism, | ||
| Line 19: | Line 19: | ||
| C -->|No| F{Context window\nsufficient? | C -->|No| F{Context window\nsufficient? | ||
| F -->|Yes| E | F -->|Yes| E | ||
| - | F -->|No| G[Orchestrator + Specialists] | + | F -->|No| G[Orchestrator + Specialists(([[https:// |
| D -->|Yes| H{Need real-time\nparallelism? | D -->|Yes| H{Need real-time\nparallelism? | ||
| D -->|No| G | D -->|No| G | ||
| Line 36: | Line 36: | ||
| ===== Architecture Comparison ===== | ===== Architecture Comparison ===== | ||
| - | ^ Factor ^ Single Agent ^ Orchestrator + Specialists ^ Peer-to-Peer ^ | + | ^ Factor ^ Single Agent ^ Orchestrator + Specialists(([[https:// |
| | **Complexity** | Low | Medium | High | | | **Complexity** | Low | Medium | High | | ||
| | **Latency** | Lowest (1 LLM call) | Medium (2-5 LLM calls) | Variable (parallel) | | | **Latency** | Lowest (1 LLM call) | Medium (2-5 LLM calls) | Variable (parallel) | | ||
| Line 85: | Line 85: | ||
| **Weaknesses**: | **Weaknesses**: | ||
| - | === Orchestrator + Specialists === | + | === Orchestrator + Specialists(([[https:// |
| A coordinator routes tasks to domain experts. Most common multi-agent pattern. | A coordinator routes tasks to domain experts. Most common multi-agent pattern. | ||
| Line 149: | Line 149: | ||
| ^ Framework ^ Pattern ^ Best For ^ Key Feature ^ | ^ Framework ^ Pattern ^ Best For ^ Key Feature ^ | ||
| | **LangGraph** | Orchestrator | Stateful multi-step workflows | Graph-based state machines | | | **LangGraph** | Orchestrator | Stateful multi-step workflows | Graph-based state machines | | ||
| - | | **CrewAI** | Orchestrator + Specialists | Role-based team workflows | Agent roles and delegation | | + | | **CrewAI** | Orchestrator + Specialists(([[https:// |
| | **AutoGen** | Peer-to-Peer | Research and collaborative tasks | Dynamic agent conversations | | | **AutoGen** | Peer-to-Peer | Research and collaborative tasks | Dynamic agent conversations | | ||
| | **OpenAI Swarm** | Orchestrator | Lightweight agent handoffs | Minimal coordination overhead | | | **OpenAI Swarm** | Orchestrator | Lightweight agent handoffs | Minimal coordination overhead | | ||
| Line 171: | Line 171: | ||
| From published 2025-2026 research: | From published 2025-2026 research: | ||
| - | * Multi-agent coding systems score **72.2% on SWE-bench Verified** vs ~65% for single agents using the same base model | + | * Multi-agent coding systems score **72.2% on SWE-bench Verified** vs ~65% for single agents using the same base model((([[https:// |
| - | * Multi-agent systems show **23% higher accuracy** on complex reasoning tasks | + | * Multi-agent systems show **23% higher accuracy** on complex reasoning tasks(([[https:// |
| * Multi-agent throughput improvement of **up to 64%** for parallelizable workloads | * Multi-agent throughput improvement of **up to 64%** for parallelizable workloads | ||
| * Multi-agent token cost is **10-15x higher** for complex requests | * Multi-agent token cost is **10-15x higher** for complex requests | ||
| Line 181: | Line 181: | ||
| - **Default to single agent**. It covers 80% of use cases with lower cost and complexity. | - **Default to single agent**. It covers 80% of use cases with lower cost and complexity. | ||
| - **Add agents for specialization**, | - **Add agents for specialization**, | ||
| - | - **Orchestrator + Specialists** is the most practical multi-agent pattern for production. | + | - **Orchestrator + Specialists(([[https:// |
| - **Peer-to-peer** is rarely needed outside high-throughput parallel processing. | - **Peer-to-peer** is rarely needed outside high-throughput parallel processing. | ||
| - **Measure the tradeoff**: multi-agent gains 7-23% accuracy but costs 3-15x more tokens. | - **Measure the tradeoff**: multi-agent gains 7-23% accuracy but costs 3-15x more tokens. | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 198: | Line 190: | ||
| * [[how_to_structure_system_prompts|How to Structure System Prompts]] — Prompt design for agents | * [[how_to_structure_system_prompts|How to Structure System Prompts]] — Prompt design for agents | ||
| * [[how_to_choose_chunk_size|How to Choose Chunk Size]] — RAG optimization | * [[how_to_choose_chunk_size|How to Choose Chunk Size]] — RAG optimization | ||
| + | |||
| + | ===== References ===== | ||