This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| budget_aware_reasoning [2026/03/25 14:55] – Create page: Budget-aware reasoning with LLMs agent | budget_aware_reasoning [2026/03/30 22:35] (current) – Restructure: footnotes as references agent | ||
|---|---|---|---|
| Line 5: | Line 5: | ||
| ===== Overview ===== | ===== Overview ===== | ||
| - | Large language models generate increasingly long reasoning traces (Chain-of-Thought, | + | Large language models generate increasingly long reasoning traces (Chain-of-Thought, |
| ===== Budget-Aware Value Tree Search ===== | ===== Budget-Aware Value Tree Search ===== | ||
| Line 27: | Line 27: | ||
| ===== Token-Budget Estimation ===== | ===== Token-Budget Estimation ===== | ||
| - | The TALE framework estimates per-problem token budgets based on reasoning complexity: | + | The TALE framework estimates per-problem token budgets based on reasoning complexity:((https:// |
| < | < | ||
| Line 35: | Line 35: | ||
| ===== Anytime Reasoning with Early Stopping ===== | ===== Anytime Reasoning with Early Stopping ===== | ||
| - | Anytime reasoning produces progressively improving solutions as more tokens are generated, enabling early termination when quality is sufficient or budget is exhausted. | + | Anytime reasoning produces progressively improving solutions as more tokens are generated, enabling early termination when quality is sufficient or budget is exhausted.((https:// |
| **Anytime Index**: Quantifies quality improvement per added token: | **Anytime Index**: Quantifies quality improvement per added token: | ||
| Line 154: | Line 154: | ||
| | TALE token-budget | Good (per-problem) | Slight accuracy drop | Low | | | TALE token-budget | Good (per-problem) | Slight accuracy drop | Low | | ||
| | Anytime + early stop | Best overall | Progressive improvement | Medium | | | Anytime + early stop | Best overall | Progressive improvement | Medium | | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 166: | Line 160: | ||
| * [[multi_hop_qa_agents|Multi-Hop QA Agents]] | * [[multi_hop_qa_agents|Multi-Hop QA Agents]] | ||
| * [[financial_trading_agents|Financial Trading Agents]] | * [[financial_trading_agents|Financial Trading Agents]] | ||
| + | |||
| + | ===== References ===== | ||