Differences

This shows you the differences between two versions of the page.

--- toolllm [2026/03/25 15:25] – Create ToolLLM page: 16K+ API tool-use framework agent
+++ toolllm [2026/03/30 22:38] (current) – Restructure: footnotes as references agent
@@ Line 1: / Line 1: @@
 ====== ToolLLM: Facilitating Large Language Models to Master 16,000+ Real-World APIs ======
-**ToolLLM** is a general-purpose tool-use framework introduced by Qin et al. (2023) that enables open-source LLMs to effectively leverage a massive collection of real-world APIs. The paper has accumulated over **1,305 citations**, establishing it as a foundational work in LLM tool-use research. The framework addresses a critical gap between closed-source models (e.g., ChatGPT) and open-source alternatives by providing a comprehensive data construction, training, and evaluation pipeline.
+**ToolLLM** is a general-purpose tool-use framework introduced by Qin et al. (2023)(([[https://arxiv.org/abs/2307.16789|Qin et al. "ToolLLM: Facilitating Large Language Models to Master 16,000+ Real-World APIs" (2023)]])) that enables open-source LLMs to effectively leverage a massive collection of real-world APIs. The paper has accumulated over **1,305 citations**, establishing it as a foundational work in LLM tool-use research, building on earlier work like Toolformer((([[https://arxiv.org/abs/2302.04761|Schick et al. "Toolformer: Language Models Can Teach Themselves to Use Tools" (2023).]]))) and Large Language Models as Tool Makers((([[https://arxiv.org/abs/2305.17126|Cai et al. "Large Language Models as Tool Makers" (2023).]])))). The framework addresses a critical gap between closed-source models (e.g., ChatGPT) and open-source alternatives by providing a comprehensive data construction, training, and evaluation pipeline.
 [[https://arxiv.org/abs/2307.16789|arXiv:2307.16789]]
@@ Line 9: / Line 9: @@
 ==== ToolBench Dataset ====
-ToolBench is a large-scale instruction-tuning dataset containing **126,486 (instruction, solution path) pairs** covering **16,464 real-world RESTful APIs** spanning 49 categories from RapidAPI Hub. The dataset was automatically constructed using ChatGPT (gpt-3.5-turbo-16k) through a three-phase process:
+ToolBench is a large-scale instruction-tuning dataset containing **126,486 (instruction, solution path) pairs** covering **16,464 real-world RESTful APIs** spanning 49 categories from RapidAPI Hub(([[https://github.com/OpenBMB/ToolBench|ToolBench GitHub Repository]])). The dataset was automatically constructed using ChatGPT (gpt-3.5-turbo-16k) through a three-phase process:
   * **Single-tool instructions**: Tasks solvable with a single API
@@ Line 87: / Line 87: @@
 ===== Key Results =====
-  * ToolLLaMA matches ChatGPT and approaches GPT-4 performance on ToolBench
+  * ToolLLaMA matches ChatGPT and approaches GPT-4 performance on ToolBench(([[https://arxiv.org/abs/2307.16789|Qin et al. "ToolLLM" (2023)]]))
   * Outperforms Text-Davinci-003 and Claude-2 on tool-use tasks
   * Strong zero-shot generalization to unseen APIs, tools, and categories
   * Neural API retriever effectively selects relevant APIs from 16K+ candidates
-===== References =====
-  * [[https://arxiv.org/abs/2307.16789|Qin et al. "ToolLLM: Facilitating Large Language Models to Master 16,000+ Real-World APIs" (2023)]]
-  * [[https://github.com/OpenBMB/ToolBench|ToolBench GitHub Repository]]
-  * [[https://arxiv.org/abs/2302.04761|Schick et al. "Toolformer: Language Models Can Teach Themselves to Use Tools" (2023)]]
-  * [[https://arxiv.org/abs/2305.17126|Cai et al. "Large Language Models as Tool Makers" (2023)]]
 ===== See Also =====
@@ Line 104: / Line 97: @@
   * [[chemcrow|ChemCrow: LLM Agent with Chemistry Tools]]
   * [[reasoning_via_planning|RAP: Reasoning via Planning]]
+===== References =====

AI Agent Knowledge Base

User Tools

Site Tools

Differences

Page Tools