This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| caching_strategies_for_agents [2026/03/30 21:02] – Add inline footnotes agent | caching_strategies_for_agents [2026/03/30 22:37] (current) – Restructure: footnotes as references agent | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Caching Strategies for Agents ====== | ====== Caching Strategies for Agents ====== | ||
| - | Caching is the highest-ROI optimization for AI agents. By intercepting repeated or similar requests before they reach the LLM, production systems eliminate **20-45% of API calls** entirely. This guide covers every caching layer -- from exact-match to semantic similarity to tool result caching -- with real architecture patterns and benchmarks. | + | Caching is the highest-ROI optimization for AI agents. By intercepting repeated or similar requests before they reach the LLM, production systems eliminate **20-45% of API calls** entirely. This guide covers every caching layer -- from exact-match to semantic similarity to tool result caching -- with real architecture patterns and benchmarks.(([[https:// |
| ===== Why Caching Matters for Agents ===== | ===== Why Caching Matters for Agents ===== | ||
| Line 242: | Line 242: | ||
| * **Memory usage** - monitor Redis memory, set eviction policies (allkeys-lru) | * **Memory usage** - monitor Redis memory, set eviction policies (allkeys-lru) | ||
| * **Cost savings** - track (cache_hits * avg_api_cost) monthly | * **Cost savings** - track (cache_hits * avg_api_cost) monthly | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 256: | Line 248: | ||
| * [[how_to_speed_up_agents|How to Speed Up Agents]] | * [[how_to_speed_up_agents|How to Speed Up Agents]] | ||
| * [[what_is_an_ai_agent|What is an AI Agent]] | * [[what_is_an_ai_agent|What is an AI Agent]] | ||
| + | |||
| + | ===== References ===== | ||
| + | |||