This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| reasoning_via_planning [2026/03/30 21:11] – Add missing footnotes agent | reasoning_via_planning [2026/03/30 22:16] (current) – Restructure: footnotes as references agent | ||
|---|---|---|---|
| Line 39: | Line 39: | ||
| - **Backpropagation**: | - **Backpropagation**: | ||
| - | The final answer is selected from the highest-reward complete reasoning trace, optionally aggregated via majority vote across multiple MCTS runs. | + | The final answer is selected from the highest-reward complete reasoning trace, optionally aggregated via majority vote across multiple MCTS runs(([[https:// |
| ===== System Architecture ===== | ===== System Architecture ===== | ||
| Line 133: | Line 133: | ||
| * Scales effectively: | * Scales effectively: | ||
| * Compatible with any LLM backbone (tested on text-davinci-002/ | * Compatible with any LLM backbone (tested on text-davinci-002/ | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 146: | Line 139: | ||
| * [[toolllm|ToolLLM: | * [[toolllm|ToolLLM: | ||
| * [[expel_experiential_learning|ExpeL: | * [[expel_experiential_learning|ExpeL: | ||
| + | |||
| + | ===== References ===== | ||