====== Agentic Uncertainty ======
Uncertainty quantification (UQ) in LLM agents is a critical and underexplored challenge. Unlike single-turn question answering, [[agentic_workflows|agentic workflows]] involve sequential decisions where errors compound, each step's uncertainty inherits from and amplifies prior steps. This page covers the "Spiral of Hallucination" phenomenon (arXiv:2601.15703) and the UProp framework for principled uncertainty propagation.

===== The Compounding Problem =====
In single-turn LLM interactions, uncertainty is localized to the current generation. In agentic systems, the agent operates in a partially observable environment where:

  * The true environment state $s_t$ is latent
  * The agent maintains a belief state $b_t(s_t) = P(s_t | h_t)$ based on history $h_t$
  * Internal cognitive errors (hallucinations, logic gaps, memory failures) become external constraints for future actions
  * Small early errors can cascade into catastrophic failures across the planning horizon

===== Spiral of Hallucination =====
The **Spiral of Hallucination** (arXiv:2601.15703) identifies how early epistemic errors in LLM agents propagate irreversibly through the context window, creating a self-reinforcing cycle of degraded reasoning.(("Spiral of Hallucination: Uncertainty-Aware Agentic LLMs." [[https://arxiv.org/abs/2601.15703|arXiv:2601.15703]]))

=== Key Findings ===
  * Early errors (hallucinations, logic gaps, tool misinterpretation) propagate through the agent's context, biasing all subsequent reasoning steps
  * Unlike single-step generation, agentic workflows suffer from **state uncertainty**, the agent's internal belief diverges from reality and this divergence compounds
  * Self-reflection alone leads to "aimless corrections" that fail to halt the spiral
  * Existing UQ methods act passively (diagnosing risk without intervention), which is insufficient for agentic settings

=== POMDP Formulation ===
The paper models agent reliability failures as a Partially Observable Markov Decision Process (POMDP), distinguishing:

  * **Epistemic uncertainty**: Reducible via better reasoning or knowledge, the agent's ignorance about the true state
  * **Aleatoric uncertainty**: Irreducible randomness inherent in the environment

=== Dual-Process UQ Framework ===
Inspired by dual-process theory from cognitive science:

  * **System 1 (Fast, forward propagation)**: Memory-aware uncertainty propagation that prevents error spread at each step, a proactive "immune system" for the agent's reasoning chain
  * **System 2 (Slow, inverse correction)**: Reflective calibration that detects deviations and applies targeted corrections before errors solidify into the context

This bridges passive sensing (traditional UQ) with active reasoning (intervention), improving long-horizon task performance, calibration, and self-awareness.

===== UProp: Information-Theoretic Uncertainty Propagation =====
**UProp** (Duan et al., 2025) introduces a principled, information-theoretic framework for decomposing uncertainty in sequential agent decisions.(("UProp: Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making." [[https://arxiv.org/abs/2506.17419|arXiv:2506.17419]]))

=== Uncertainty Decomposition ===
UProp decomposes LLM sequential decision uncertainty into two parts:

  * **Internal uncertainty**: Intrinsic to the current decision, what existing single-turn UQ methods measure (e.g., token entropy, sampling variance)
  * **Extrinsic uncertainty**: A Mutual Information (MI) quantity $I(d_t; d_{<t})$ describing how much uncertainty is //inherited// from preceding decisions

=== The UProp Estimator ===
UProp efficiently estimates extrinsic uncertainty by converting direct MI estimation to Pointwise Mutual Information (PMI) estimation over multiple Trajectory-Dependent Decision Processes (TDPs):

$\hat{U}_{\text{extrinsic}}(d_t) = \text{PMI}(d_t; d_{<t}) = \log \frac{P(d_t | d_{<t})}{P(d_t)}$

This captures how much the current decision depends on (and inherits uncertainty from) the trajectory so far.

=== Evaluation ===
  * Tested on [[agentbench|AgentBench]] and HotpotQA with GPT-4.1 and [[deepseek|DeepSeek]]-V3
  * Significantly outperforms single-turn UQ baselines even when those baselines use thoughtful aggregation strategies
  * Provides comprehensive analysis including sampling efficiency and intermediate uncertainty propagation

===== Code Example =====
<code python>
import numpy as np

class UncertaintyPropagator:
    """UProp-style uncertainty estimation for agentic decisions."""

    def __init__(self, llm, n_trajectories=10):
        self.llm = llm
        self.n_traj = n_trajectories

    def estimate_uncertainty(self, history, current_query):
        """Decompose uncertainty into internal + extrinsic components."""
        # Internal: uncertainty of current decision in isolation
        p_marginal = self.llm.sample_decisions(current_query, n=self.n_traj)
        internal = self._entropy(p_marginal)

        # Extrinsic: MI between current decision and trajectory history
        p_conditional = self.llm.sample_decisions(
            current_query, context=history, n=self.n_traj
        )
        pmi = np.log(p_conditional / (p_marginal + 1e-10) + 1e-10)
        extrinsic = np.mean(pmi)

        return {"internal": internal, "extrinsic": extrinsic,
                "total": internal + extrinsic}

    def should_intervene(self, uncertainty, threshold=0.7):
        """Dual-process: trigger System 2 correction if extrinsic
        uncertainty exceeds threshold."""
        return uncertainty["extrinsic"] > threshold

    @staticmethod
    def _entropy(probs):
        return -np.sum(probs * np.log(probs + 1e-10))
</code>

===== How Confidence Degrades Across Agent Steps =====
The degradation of agent confidence across sequential steps follows a characteristic pattern:

  - **Step 1-2**: Uncertainty is primarily internal; agent operates near its single-turn accuracy
  - **Step 3-5**: Extrinsic uncertainty begins to accumulate as earlier decisions constrain the solution space
  - **Step 5+**: The spiral effect, if early errors exist, belief state divergence accelerates non-linearly
  - **Long horizon**: Without intervention, total uncertainty can exceed the threshold for reliable operation

The key insight is that //total agent uncertainty is not simply the sum of per-step uncertainties//, the extrinsic (inherited) component introduces multiplicative compounding effects.

===== Mathematical Framework =====
For a sequence of agent decisions $d_1, d_2, \ldots, d_T$:

$U_{\text{total}}(d_t) = \underbrace{H(d_t)}_\text{internal} + \underbrace{I(d_t; d_{<t})}_\text{extrinsic}$

where $H(d_t)$ is the entropy of the current decision and $I(d_t; d_{<t})$ is the mutual information with the trajectory prefix. The spiral of hallucination occurs when:

$I(d_t; d_{<t}) \gg H(d_t)$

i.e., the agent's current decision is dominated by inherited uncertainty from prior (potentially erroneous) steps rather than by its own reasoning capacity. (([[https://arxiv.org/abs/2601.15703|Spiral of Hallucination: Uncertainty-Aware Agentic LLMs (arXiv:2601.15703]])) (([[https://arxiv.org/abs/2506.17419|UProp: Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making (arXiv:2506.17419]]))

===== See Also =====
  * [[multi_hop_qa_agents|Multi-Hop QA Agents]]
  * [[fine_tuning_agents|Fine-Tuning Agents]]
  * [[text_to_sql_agents|Agentic Text-to-SQL]]
  * [[financial_trading_agents|Financial Trading Agents]]
  * [[agentic_workflows|Agentic Workflows]]

===== References =====