====== Knowledge Graph World Models: AriGraph ======
Agents operating in partially observable environments need structured memory to reason about unseen states and plan effective actions. **AriGraph** (IJCAI 2025) introduces a knowledge graph world model that LLM agents dynamically construct during exploration, integrating semantic and episodic memories into a queryable graph structure that dramatically improves reasoning, planning, and decision-making.(([[https://arxiv.org/abs/2407.04363|AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents (arXiv:2407.04363]]))(([[https://www.ijcai.org/proceedings/2025/2|IJCAI 2025 Proceedings]]))

===== The Memory Problem in Agent Systems =====
Traditional approaches to agent memory fall short in complex environments:

  * **Full history:** Stores all observations but quickly overflows context windows and buries relevant information
  * **Summarization:** Compresses history but loses structural relationships and spatial information
  * **Fixed knowledge bases:** Provide static information but cannot adapt to discovered environment structure

AriGraph solves these by maintaining a **dynamically growing knowledge graph** $G = (V, E)$ where nodes $V$ represent entities (objects, locations, characters) and edges $E$ represent relationships (spatial, functional, causal) discovered through exploration.

===== Dual Memory Architecture =====
AriGraph integrates two complementary memory types, inspired by cognitive science:

**Episodic Memory:** Stores specific events and observations from agent interactions. Each exploration step generates episodic entries that capture what the agent saw, did, and experienced at a particular moment.

**Semantic Memory:** Accumulates general knowledge derived from episodic experiences. Over time, repeated observations about object properties, spatial layouts, and functional relationships are abstracted into stable semantic knowledge.

The relationship between the two can be expressed as:

$$M_{\text{semantic}} = \text{Abstract}(\{m_1, m_2, \ldots, m_t\}_{\text{episodic}})$$

Semantic memory builds on episodic memory, creating a structured base for **associative recall** that supports long-term knowledge accumulation beyond what unstructured methods can achieve.

===== Graph Construction and Update =====
The Ariadne agent processes local observations from the environment and incrementally builds the graph:

**Growth Phase:** During initial exploration, new observations add nodes and edges rapidly as the agent discovers new locations, objects, and relationships.

**Stabilization Phase:** As the agent becomes familiar with the environment, graph growth flattens, an indicator of effective generalization rather than redundant storage.

**Cleaning Phase:** Pruning mechanisms remove redundant or contradictory entries to maintain graph quality.

The graph update function at each timestep $t$ is:

$$G_{t+1} = \text{Clean}(G_t \cup \text{Extract}(o_t))$$

where $o_t$ is the observation at time $t$ and Extract identifies new entities and relations.

===== Retrieval-Planning-Decision Loop =====
AriGraph serves as the core memory component in a cognitive loop:

  - **Retrieval:** Given the current state, relevant subgraphs are extracted from AriGraph
  - **Planning:** The LLM uses retrieved knowledge to generate action plans, leveraging multi-hop graph traversal for spatial reasoning
  - **Decision:** The agent selects and executes an action based on the plan
  - **Update:** New observations update the graph, closing the loop

This enables efficient **multi-hop inference**, tracing paths between locations and objects for planning without needing the full observation history.

===== Code Example: Knowledge Graph World Model =====
<code python>
class AriGraph:
    def __init__(self):
        self.nodes = {}
        self.edges = []
        self.episodic_memory = []
        self.semantic_memory = {}

    def update(self, observation, action, timestep):
        self.episodic_memory.append({
            "observation": observation,
            "action": action,
            "timestep": timestep
        })
        entities = self.extract_entities(observation)
        relations = self.extract_relations(observation)
        for entity in entities:
            if entity.id not in self.nodes:
                self.nodes[entity.id] = entity
            else:
                self.nodes[entity.id].update_properties(entity)
        for relation in relations:
            self.edges.append(relation)
        self.abstract_semantic_knowledge()
        self.clean_redundancies()

    def retrieve_relevant_subgraph(self, current_state, goal):
        relevant_nodes = self.find_related_nodes(current_state, goal)
        subgraph = self.extract_subgraph(relevant_nodes, max_hops=3)
        return subgraph

    def plan_with_graph(self, current_state, goal, llm):
        subgraph = self.retrieve_relevant_subgraph(current_state, goal)
        path = self.find_path(current_state.location, goal.location)
        prompt = self.build_planning_prompt(
            current=current_state,
            goal=goal,
            knowledge=subgraph,
            path=path
        )
        return llm.generate_plan(prompt)

    def abstract_semantic_knowledge(self):
        entity_observations = {}
        for episode in self.episodic_memory:
            for entity in self.extract_entities(episode["observation"]):
                entity_observations.setdefault(entity.id, []).append(entity)
        for eid, observations in entity_observations.items():
            if len(observations) >= 3:
                self.semantic_memoryeid = self.generalize(observations)
</code>

===== Evaluation Results =====
AriGraph is evaluated on interactive text games (TextWorld environments) and static multi-hop QA:

**Text Games:** AriGraph agents outperform both memory baselines (full history, summarization) and RL-based methods in complex games requiring spatial reasoning and object manipulation. Graph quality improves over time as the agent explores.

**Multi-Hop QA:** Achieves **68.6%** accuracy on GPT-4o-mini, competitive with dedicated knowledge graph methods while being fully dynamic (no pre-built knowledge base required).

Key findings:
  * Structured graph memory enables faster task completion than unstructured alternatives
  * Graph growth curves correlate with agent competence, stabilization indicates mastery
  * Semantic memory significantly improves performance over episodic-only approaches
  * The approach scales to increasing task complexity (more objects, locations, and relationships)

===== Agent Cognitive Architecture Diagram =====
<mermaid>
flowchart TD
    A[Environment] --> B[Observation]
    B --> C[Entity & Relation Extraction]
    C --> D[AriGraph Update]
    D --> E[Episodic Memory]
    D --> F[Semantic Memory]
    E --> F
    G[Current State + Goal] --> H[Subgraph Retrieval]
    F --> H
    E --> H
    H --> I[Multi-Hop Reasoning]
    I --> J[LLM Planning]
    J --> K[Action Selection]
    K --> L[Environment Step]
    L --> B
</mermaid>

===== Connections to Cognitive Science =====
AriGraph's design draws explicitly from theories of human memory:

  * **Episodic-Semantic Distinction** (Tulving, 1972): Separate storage for specific events vs. general knowledge, with semantic memory emerging from episodic experiences
  * **Associative Recall:** Graph structure enables retrieval by association (following edges) rather than sequential search through history
  * **Schema Theory:** Semantic memory nodes function as schemas that organize new observations and guide expectations
  * **Spatial Cognition:** The graph naturally represents cognitive maps that agents use for navigation and spatial reasoning

===== See Also =====
  * [[knowledge_graphs|Knowledge Graphs]]
  * [[how_to_add_memory_to_an_agent|How to Add Memory to an Agent]]
  * [[world_models_for_agents|World Models for Agents]]
  * [[hermes_vs_openclaw|Hermes Agent vs OpenClaw]]
  * [[agent_state_management|Agent State Management]]

===== References =====