This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
start [2024/11/26 18:44] – created brad | start [2024/12/13 18:26] (current) – brad | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Large Language Model (LLM) Agents ====== | + | ===== AgentWiki |
- | Welcome to the **LLM Agents Wiki**, your comprehensive resource for understanding and leveraging Large Language | + | Welcome to the **AgentWiki**, a comprehensive resource for understanding and leveraging Large Language |
+ | Catch up on the latest | ||
- | ----- | + | ==== Introduction ==== |
- | ===== Introduction ===== | + | 🤖 Large Language Model (LLM) Agents are sophisticated AI systems that utilize large-scale neural language models to perform tasks autonomously. |
+ | By comprehending natural language, reasoning through complex problems, and interacting with external tools and environments, | ||
+ | They are capable of planning, executing, and adapting their actions based on given objectives and feedback from their environment. | ||
- | **Large Language Model (LLM) Agents** are AI systems that utilize large language models to perform tasks autonomously. By understanding natural language, reasoning through complex problems, and interacting with external tools and environments, | + | ==== Agent System Overview ==== |
- | ----- | + | 🧠 In an LLM-powered autonomous agent system, the LLM functions as the agent' |
- | ===== Key Features of LLM Agents ===== | + | * **[[planning|Planning]]** |
+ | * Task Decomposition | ||
+ | * Self-Reflection | ||
- | * **Reasoning and Planning:** LLM Agents analyze complex tasks, devise strategies, and plan sequences of actions to achieve specific goals. | + | * **[[memory|Memory]]** |
- | * **Tool Utilization: | + | * Hierarchical Memory Systems |
- | * **Memory and Context Management: | + | * Efficient Retrieval Mechanisms |
- | * **Natural Language Understanding: | + | |
- | * **Autonomy and Adaptability: | + | |
- | ----- | + | * **[[tool_use|Tool Use]]** |
+ | * External API Integration | ||
+ | * Dynamic Tool Selection | ||
- | ===== Types of LLM Agents ===== | + | * **[[structured_outputs|Structured Outputs]]** |
+ | * Grammars | ||
+ | * Constrained Outputs | ||
- | ==== Chain-of-Thought (CoT) Reasoning ==== | + | 🚀 These components enable |
- | An approach where the LLM generates a step-by-step reasoning process, enhancing its problem-solving capabilities by making intermediate reasoning steps explicit. | + | |
- | ==== ReAct (Reasoning | + | * **Plan** complex tasks through decomposition |
- | A framework that combines | + | * **Remember** past interactions using advanced memory architectures. |
+ | * **Utilize Tools** | ||
- | ==== AutoGPT | + | ==== Key Features of LLM Agents |
- | An experimental open-source application demonstrating how LLMs like GPT-4 can autonomously achieve user-defined goals by iteratively planning, executing, and learning from actions. | + | |
- | ==== BabyAGI ==== | + | 🌀 **[[advanced_reasoning_planning|Advanced Reasoning and Planning]]**: |
- | A simplified artificial general intelligence model that uses an LLM to create, prioritize, and execute tasks, aiming | + | Employ sophisticated reasoning strategies |
- | ==== AgentGPT ==== | + | 🔧 **[[tool_utilization|Tool Utilization and API Interaction]]**: |
- | A platform enabling users to deploy autonomous AI agents that can carry out tasks in a web-based environment, combining planning and execution | + | Interface with external tools, APIs, databases, and services |
- | ==== Plan-and-Execute Agents ==== | + | 📚 **[[hierarchical_memory|Hierarchical Memory |
- | Agents that first plan a sequence of actions | + | Use multi-level memory architectures |
- | ==== Conversational Agents ==== | + | 💡 **[[natural_language_understanding|Natural Language Understanding and Generation]]**: |
- | Specialized in dialogue, these agents understand | + | Interpret |
- | ==== Tool-Using Agents ==== | + | 🔄 **[[autonomy|Autonomy and Adaptive Behavior]]**: |
- | Agents capable of utilizing external tools or APIs (e.g., calculators, | + | Operate independently, making informed decisions |
- | ----- | + | ==== Workflows ==== |
- | ===== Design Patterns for LLM Agents ===== | + | Workflows in LLM Agent systems streamline the design, implementation, |
- | | + | 🌊 **Key Workflow Tools** |
- | * **Reinforcement Learning from Human Feedback (RLHF):** Training LLMs using human feedback to improve responses and align them with desired outcomes. | + | * **[[flowise|Flowise]]**: A visual programming interface for designing |
- | * **Agent Loop (Perception-Thought-Action Cycle):** An iterative process where the agent perceives inputs, thinks (reasoning/ | + | * **[[promptflow|PromptFlow]]**: A framework |
- | * **Context Window Management: | + | |
- | * **Tool Integration Patterns:** Designing interfaces that allow the LLM to interact with external tools through well-defined APIs or action schemas. | + | |
- | * **Memory Augmentation:** Implementing mechanisms | + | |
- | * **Modular Architecture: | + | |
- | ----- | + | These tools enhance the modularity and reusability of task definitions, |
- | ===== Libraries and Frameworks ===== | + | ==== Components of LLM Agents |
- | * **LangChain: | + | === Planning === |
- | * **LlamaIndex (GPT Index):** A toolkit for connecting LLMs with external data sources and knowledge bases. | + | |
- | * **Hugging Face Transformers: | + | |
- | * **OpenAI API:** Provides access to advanced language models like GPT-4, enabling the integration of LLM capabilities into custom applications and agents. | + | |
- | * **Microsoft Guidance:** A library for controlling LLM generation, allowing developers to specify desired behavior and constraints. | + | |
- | * **AutoGPT and BabyAGI Implementations: | + | |
- | * **Haystack: | + | |
- | ----- | + | 🧩 Planning involves the strategic breakdown of complex tasks into manageable sub-tasks, devising algorithms, and sequencing actions based on logical reasoning and predicted outcomes. |
- | ===== Applications of LLM Agents ===== | + | == Task Decomposition |
- | | + | 🌳 **[[chain_of_thought|Chain-of-Thought (CoT) Reasoning]]** |
- | | + | 🌲 **[[tree_of_thoughts|Tree of Thoughts]]** |
- | * **Research Assistance: | + | ⚙️ |
- | * **Education and Tutoring:** Offering personalized learning experiences, | + | |
- | * **Content Creation:** Generating articles, reports, creative writing, or marketing materials based on user input or autonomous exploration. | + | |
- | * **Software Development Assistance: | + | |
- | | + | |
- | ----- | + | == Self-Reflection == |
- | ===== Recent Developments ===== | + | 🔍 **[[react_framework|ReAct (Reasoning and Acting)]]** |
+ | 🔄 **[[reflexion_framework|Reflexion Framework]]** | ||
+ | 🪞 **[[chain_of_hindsight|Chain of Hindsight (CoH)]]** | ||
+ | 📉 **[[algorithm_distillation|Algorithm Distillation (AD)]]** | ||
- | The field of LLM Agents is rapidly evolving, with significant advancements including: | + | === Memory === |
- | * **Enhanced Reasoning Abilities: | + | 📦 Memory mechanisms |
- | * **Tool Use Integration: | + | |
- | * **Memory | + | |
- | * **Ethical and Safe AI Practices: | + | |
- | * **Open-Source Agent Frameworks: | + | |
- | ----- | + | == Hierarchical Memory Systems == |
- | ===== Getting Started ===== | + | 🕒 **[[sensory_memory|Sensory Memory]]** |
+ | ⏳ **[[short_term_memory|Short-Term Memory (Working Memory)]]** | ||
+ | 📜 **[[long_term_memory|Long-Term Memory (Persistent Memory)]]** | ||
+ | * 📂 **[[explicit_memory|Explicit/ | ||
+ | * 🤫 **[[implicit_memory|Implicit/ | ||
- | Embark on your journey with LLM Agents by exploring the following resources: | + | == Efficient Retrieval Mechanisms == |
- | * **[[Introduction to LLM Agents|Introduction to LLM Agents]]:** An overview of the architecture and components of LLM-powered agent applications. | + | 🔎 **[[maximum_inner_product_search|Maximum Inner Product Search (MIPS)]]** |
- | * **[[LangChain Documentation|LangChain Documentation]]:** Guides and tutorials on using LangChain for developing LLM applications and agents. | + | * 🧮 **[[locality_sensitive_hashing|Locality-Sensitive Hashing (LSH)]]** |
- | * **[[OpenAI API Reference|OpenAI API Reference]]:** Documentation and examples for integrating OpenAI' | + | |
- | * **[[AutoGPT GitHub Repository|AutoGPT GitHub Repository]]:** Explore the code and documentation of AutoGPT to understand autonomous agent implementations. | + | * 🗺️ |
- | * **[[ReAct Paper|ReAct: Synergizing Reasoning and Acting in Language Models]]:** A research paper introducing the ReAct framework. | + | |
- | * **[[Hugging Face Transformers|Hugging Face Transformers]]:** Learn how to use pre-trained models and fine-tune them for your specific needs. | + | |
- | ----- | + | === Tool Use === |
- | ===== Join the Community ===== | + | 🔧 Tool use extends |
- | Stay updated with the latest trends, research, and developments in the field of LLM Agents by joining our community: | + | * 🧰 **[[mrkl_systems|MRKL Systems (Modular Reasoning, Knowledge, and Language)]]** |
+ | * 🛠️ **[[tool_augmented_language_models|Tool-Augmented Language Models (TALM)]]** | ||
+ | * 🤖 **[[toolformer|Toolformer]]** | ||
- | * **Discussion Forums:** Engage with practitioners, | + | == API Integration == |
- | * **Contribute to Open-Source Projects:** Participate in the development of tools and frameworks related to LLM Agents. | + | |
- | * **Attend Workshops and Webinars:** Expand your knowledge through events focused on LLM technologies and applications. | + | |
- | ----- | + | 🔗 **[[openai_function_calling|OpenAI Function Calling and ChatGPT Plugins]]** |
+ | 🔗 **[[hugginggpt|HuggingGPT]]** | ||
+ | 🗂️ **[[api_bank_benchmark|API-Bank Benchmark]]** | ||
- | **Explore. Learn. Innovate.** Unlock the transformative potential of Large Language Model Agents and be at the forefront | + | ==== Types of LLM Agents ==== |
+ | |||
+ | 🧠 **[[chain_of_thought_agents|Chain-of-Thought Agents]]** | ||
+ | 🔄 **[[react_agents|ReAct | ||
+ | 🤖 **[[autonomous_agents|Autonomous Agents]]** | ||
+ | * 🤖 **[[autogpt|AutoGPT]]** | ||
+ | * 🤖 **[[babyagi|BabyAGI]]** | ||
+ | * 🤖 **[[agentgpt|AgentGPT]]** | ||
+ | 📋 **[[plan_and_execute_agents|Plan-and-Execute Agents]]** | ||
+ | 💬 **[[conversational_agents|Conversational Agents]]** | ||
+ | 🔧 **[[tool_using_agents|Tool-Using Agents]]** | ||
+ | |||
+ | ==== Design Patterns for LLM Agents ==== | ||
+ | |||
+ | 🔗 **[[prompt_chaining|Prompt Chaining and Orchestration]]** | ||
+ | 📈 **[[rlhf|Reinforcement Learning from Human Feedback (RLHF)]]** | ||
+ | 🔄 **[[agent_loop|Agent Loop (Perception-Thought-Action Cycle)]]** | ||
+ | 🗂️ **[[context_window_management|Context Window Management]]** | ||
+ | 🔧 **[[tool_integration_patterns|Tool Integration Patterns]]** | ||
+ | 📂 **[[memory_augmentation_strategies|Memory Augmentation Strategies]]** | ||
+ | 🏗️ **[[modular_architectures|Modular and Layered Architectures]]** | ||
+ | |||
+ | ==== Libraries and Frameworks ==== | ||
+ | |||
+ | Explore a range of tools and platforms for developing LLM agents: | ||
+ | |||
+ | **Frameworks & Platforms** | ||
+ | |||
+ | ⚙️ **[[agent_protocol|Agent Protocol]]** | ||
+ | 🤖 **[[anthropic_context_protocol|Anthropic Model Context Protocol]]** | ||
+ | 💻 **[[chatdev|ChatDev]]** | ||
+ | ⚡ **[[bolt_new|Bolt.new]]** | ||
+ | 🔗 **[[flowise|Flowise]]** | ||
+ | 📋 **[[instructor_framework|Instructor]]** | ||
+ | 🔎 **[[llamaindex|LlamaIndex]]** | ||
+ | 🧠 **[[autogpt|AutoGPT]]** | ||
+ | 🤖 **[[babyagi|BabyAGI]]** | ||
+ | 🔗 **[[langroid|Langroid]]** | ||
+ | 📊 **[[microsoft_graphrag|Microsoft GraphRAG]]** | ||
+ | 🌟 **[[lite_llm|LiteLLM]]** | ||