This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| tora_reasoning [2026/03/25 15:20] – Create ToRA page: tool-integrated reasoning agents for math agent | tora_reasoning [2026/03/30 22:38] (current) – Restructure: footnotes as references agent | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== ToRA: Tool-Integrated Reasoning Agents for Mathematical Problem Solving ====== | ====== ToRA: Tool-Integrated Reasoning Agents for Mathematical Problem Solving ====== | ||
| - | ToRA (Tool-integrated Reasoning Agents) is a series of LLM-based agents that solve complex mathematical problems by **interleaving natural language reasoning with program-based tool execution**. Introduced by Gou et al. (2023) at ICLR 2024, ToRA achieves state-of-the-art results on mathematical benchmarks by combining the analytical clarity of chain-of-thought reasoning with the computational precision of code execution. | + | ToRA (Tool-integrated Reasoning Agents) is a series of LLM-based agents that solve complex mathematical problems by **interleaving natural language reasoning with program-based tool execution**(([[https:// |
| ===== Overview ===== | ===== Overview ===== | ||
| Line 24: | Line 24: | ||
| </ | </ | ||
| - | Training proceeds in three stages: | + | Training proceeds in three stages(([[https:// |
| - **Trajectory Curation**: Interactive tool-use trajectories are collected via prompting GPT-4 on math datasets | - **Trajectory Curation**: Interactive tool-use trajectories are collected via prompting GPT-4 on math datasets | ||
| Line 44: | Line 44: | ||
| Key findings across 10 mathematical reasoning benchmarks: | Key findings across 10 mathematical reasoning benchmarks: | ||
| - | * **13-19% absolute improvement** over prior open-source models across all datasets and model scales | + | * **13-19% absolute improvement** over prior open-source models across all datasets and model scales(([[https:// |
| * Tool integration is most beneficial for computation-heavy problems (algebra, number theory) | * Tool integration is most beneficial for computation-heavy problems (algebra, number theory) | ||
| * Output space shaping further improves accuracy by ensuring syntactically valid tool calls | * Output space shaping further improves accuracy by ensuring syntactically valid tool calls | ||
| Line 77: | Line 77: | ||
| print(tokenizer.decode(output[0], | print(tokenizer.decode(output[0], | ||
| </ | </ | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 89: | Line 83: | ||
| * [[chain_of_thought|Chain-of-Thought Reasoning]] | * [[chain_of_thought|Chain-of-Thought Reasoning]] | ||
| * [[tool_use_agents|Tool-Use in LLM Agents]] | * [[tool_use_agents|Tool-Use in LLM Agents]] | ||
| + | |||
| + | ===== References ===== | ||