====== Multi-Agent Systems ======
**Multi-agent systems** (MAS) organize multiple specialized AI agents — typically powered by large language models — to collaborate on complex tasks through structured coordination patterns. By 2025, multi-agent architectures have become the dominant approach for building sophisticated AI applications, outperforming single-agent systems in parallel processing, adaptability, and collective reasoning.(([[https://www.anthropic.com/engineering/multi-agent-research-system|Anthropic — Multi-Agent Research System]])) Multi-agent coordination enables delegation and orchestration of work across different specialized agents, allowing sophisticated automation of enterprise processes.(([[https://tldr.tech/ai/2026-04-13|TLDR AI (2026]])) Systems enabling multiple AI agents to work together on complex tasks now feature persistent context and sandboxed execution environments, with tools like Subspace providing persistent context across multiple agents with Git workflow integration.(([[https://thecreatorsai.com/p/gpt-55-doubles-the-price-google-goes|Creators' AI (2026]])) **Agent swarms** represent a specialized form of multi-agent coordination where multiple AI agents work together in parallel or sequentially to accomplish complex tasks, with native implementations like Kimi K2.6 enabling orchestrated multi-agent workflows for sophisticated application development and problem-solving.(([[https://superhuman.ai|Superhuman AI (2026]])) A key emerging capability is **AI Agent Management**, a meta-skill where AI systems manage and coordinate other AI systems, as demonstrated in products like Claude Code and OpenCode, where a single managing AI agent supervises multiple specialized sub-agents working in parallel on large-scale projects.(([[https://importai.substack.com/p/import-ai-455-automating-ai-research|Import AI (2026]]))

===== Orchestrator Patterns =====
==== Supervisor (Central Orchestrator) ====
A primary agent acts as a hub, decomposing tasks, routing them to specialized workers, and synthesizing results. This hub-and-spoke model ensures consistency and simple debugging, but can bottleneck at 10-20 agents due to coordination overhead and context concentration.(([[https://galileo.ai/blog/architectures-for-multi-agent-systems|Architectures for Multi-Agent Systems]]))

Variants include the **Single Agent + Dynamic Routing** pattern, where the primary agent invokes specialists via dynamic [[function_calling|function calling]] based on the current task.

==== Peer-to-Peer ====
Agents collaborate directly without a central authority, enabling flexible local optimizations like negotiation. For example, driver agents might share route information in a delivery system. Peer-to-peer supports emergent behaviors but risks inconsistency without oversight.

==== Hierarchical ====
A multi-level structure with supervisor agents at the top, domain agents in the middle, and specialists at the bottom. Combines parallel execution, feedback loops, and shared tools/RAG for knowledge access. The control flow follows: decompose, execute parallel subtasks, gather feedback/refine, aggregate results.

^ Pattern ^ Strengths ^ Weaknesses ^ Best For ^
| Supervisor | Guaranteed consistency, simple debugging | Orchestrator bottleneck, sequential latency | Task routing, research pipelines |
| Peer-to-Peer | Tactical flexibility, high throughput | Inconsistency risk, emergent complexity | Decentralized coordination, negotiation |
| Hierarchical | Parallel execution, scalability, structured feedback | Implementation complexity | Complex decomposable tasks |

===== Recursive Multi-Agent Systems =====
**Recursive Multi-Agent Systems** (RMAS) represent an advanced coordination approach where multiple AI agents exchange information through shared latent recursive computation rather than relying exclusively on natural language text exchanges. This paradigm enables agents to leverage compressed, structured representations of reasoning processes, achieving significant improvements in efficiency, accuracy, and computational resource utilization.(([[https://www.latent.space/p/ainews-ai-engineer-worlds-fair-autoresearch|Latent Space - Recursive Multi-Agent Systems Research (2026]]))

==== Overview and Conceptual Framework ====
Traditional multi-agent systems typically rely on natural language as the primary communication medium between agents, requiring explicit serialization of thoughts, plans, and observations into text format. Recursive Multi-Agent Systems extend this model by allowing agents to share intermediate computational representations—latent vectors, abstract reasoning states, or recursive computation graphs—that encode information more compactly than natural language while preserving semantic content.

The recursive nature of this coordination enables hierarchical decomposition of complex tasks, where higher-level agents can delegate sub-tasks to specialized agents while maintaining access to shared latent representations that bypass the need for full natural language serialization. This approach borrows principles from //functional programming recursion//, //hierarchical reinforcement learning//, and //neural architecture design// to create communication mechanisms that are both computationally efficient and semantically rich.

==== Technical Implementation and Mechanisms ====
Recursive Multi-Agent Systems operate through several key technical components. First, agents utilize a **shared latent space** where intermediate representations can be exchanged without serialization overhead. Rather than Agent A generating text describing an observation and Agent B parsing that text, both agents can directly operate on compressed vector representations or abstract computation states that capture the same information more efficiently.

Second, the recursive computation framework enables agents to call sub-routines or spawn child agents while maintaining contextual state through the recursion stack. This allows for **hierarchical task decomposition** where complex problems are broken into recursive sub-problems, with each level of recursion potentially involving different agents or computational strategies. The latent representations passed between recursion levels encode task context, partial solutions, and constraints more efficiently than natural language summaries.

Third, coordination mechanisms implement **constraint satisfaction and consensus protocols** operating directly on latent representations rather than negotiating through natural language dialogue. Agents can evaluate compatibility of solutions and coordinate actions through direct computation on shared latent states, enabling more efficient consensus and faster consensus convergence compared to text-based negotiation.

===== See Also =====
  * [[how_to_build_a_multi_agent_system|How to Build a Multi-Agent System]]
  * [[agent_collabs|Agent Collabs]]
  * [[agent_swarms|Agent Swarms]]
  * [[agentverse|AgentVerse: Facilitating Multi-Agent Collaboration]]
  * [[single_agent_architecture|Single Agent Architecture: Design Patterns for Solo AI Agents]]

===== References =====