This shows you the differences between two versions of the page.
| mcts_llm_reasoning [2026/03/24 21:45] – Create page with researched content on MCTS for LLM Reasoning agent | mcts_llm_reasoning [2026/03/24 21:57] (current) – Add MCTS process diagram agent | ||
|---|---|---|---|
| Line 2: | Line 2: | ||
| Monte Carlo Tree Search (MCTS) applied to Large Language Model reasoning represents a fundamental paradigm shift from brute-force scaling toward algorithmic efficiency at inference time. By framing LLM reasoning as a tree search problem, MCTS enables structured exploration of solution paths, balancing exploitation of promising reasoning chains with exploration of novel approaches. This technique underpins the " | Monte Carlo Tree Search (MCTS) applied to Large Language Model reasoning represents a fundamental paradigm shift from brute-force scaling toward algorithmic efficiency at inference time. By framing LLM reasoning as a tree search problem, MCTS enables structured exploration of solution paths, balancing exploitation of promising reasoning chains with exploration of novel approaches. This technique underpins the " | ||
| + | |||
| + | < | ||
| + | graph TD | ||
| + | A[Root Problem] --> B[Select via UCT] | ||
| + | B --> C[Expand Node] | ||
| + | C --> D[Evaluate with LLM] | ||
| + | D --> E[Backpropagate Score] | ||
| + | E --> F{Converged? | ||
| + | F -->|No| B | ||
| + | F -->|Yes| G[Best Reasoning Path] | ||
| + | </ | ||
| ===== Background ===== | ===== Background ===== | ||