AI Agent

OpenClaw 实战：一行路径省掉 84% 的工具调用——Cron Job 排障实录

2026-06-20·553 字· 3 分钟

AI Agent 实战 OpenClaw AI Agent Cron Job SKILL.md Prompt Engineering 性能优化

OpenClaw 的 daily-ai-news 定时任务连续超时。根因不是模型不够强——是 SKILL.md 里少写了一行绝对路径，导致 Agent 每次花 15 次 exec 搜索工具位置。消息数 165→54，exec 调用 44→7，一行路径比任何算法调优都管用。

OpenClaw 记忆实战：从「向量搜索挂了也能用」到用 NVIDIA 免费 API 补全最后一块拼图

2026-06-20·585 字· 3 分钟

AI Agent 实战 OpenClaw AI Agent 记忆系统 Embedding NVIDIA 向量搜索 BM25

OpenClaw 记忆系统的向量检索默认不可用——但 BM25 文本搜索兜底让系统照常运转了两周。当你发现「不配 embedding 也能跑」，到底要不要修？怎么用 NVIDIA 免费 API 零成本补上？

OpenClaw Memory in Practice: From 'Vector Search Is Down But Everything Still Works' to Zero-Cost NVIDIA Embeddings

2026-06-20·1697 words· 8 min

AI Agent in Practice OpenClaw AI Agent Memory System Embedding NVIDIA Vector Search BM25

OpenClaw’s vector retrieval silently failed — but BM25 text search kept the memory system running for two weeks unnoticed. Should you even bother fixing it? Here’s how I used NVIDIA’s free embedding API to complete the picture at zero cost.

OpenClaw in Practice: One File Path Eliminated 84% of Tool Calls — A Cron Job Debugging Story

2026-06-20·1537 words· 8 min

AI Agent in Practice OpenClaw AI Agent Cron Job SKILL.md Prompt Engineering Performance

OpenClaw’s daily-ai-news cron job kept timing out. The root cause: a missing absolute path in the SKILL.md caused the Agent to spend 15 exec calls searching for a tool every run. Messages 165→54, exec calls 44→7 — one file path beat any algorithm optimization.

Claude's Tool Calling Paradigm Shift: A Deep Dive into Programmatic Tool Calling and Dynamic Filtering

2026-06-13·2548 words· 12 min

AI Agent in Practice Claude AI Agent Agent Architecture Tool Calling Context Engineering Programmatic Tool Calling Dynamic Filtering Code Execution

Background: The Cost Problem in Agent Tool Calling # In traditional agent tool-calling, every tool invocation requires a full cycle of “model inference → tool execution → result return → model re-inference.” This seemingly natural loop breaks down at scale in three ways: Context Pollution: Every tool result is injected verbatim into the context window. Fetch expense reports for 20 employees, and 2,000+ line items enter context — even though you only need to know “which 3 people exceeded their budget.” Inference Overhead: Each tool call demands a full model inference pass. Five tools = five inference passes, each costing hundreds of milliseconds to seconds. Noise Degrades Accuracy: When the context window is packed with intermediate results, the model must find signal in noise. Context Rot research shows LLM performance on complex tasks drops 50-70% as context grows. As Florian Bruniaux puts it in the Claude Code Architecture Guide: “The Outer Loop — everything outside the model: context management, tool invocation, verification, memory consolidation — increasingly determines system quality more than model inference itself.”

Claude 工具调用范式转移：Programmatic Tool Calling 与 Dynamic Filter 深度解读

2026-06-13·1246 字· 6 分钟

AI Agent 实战 Claude AI Agent Agent 架构工具调用上下文工程 Programmatic Tool Calling Dynamic Filtering 代码执行

背景：Agent 工具调用的成本困境 # 在传统 Agent 工具调用模型中，每调用一个工具都需要完成一次"模型推理 → 工具执行 → 结果返回 → 模型再推理"的完整回合。这个看似自然的循环，在工具调用变多时会暴露出三个致命问题：上下文污染：每个工具的结果都被原封不动地注入上下文窗口。查 20 个员工的报销记录，2000+ 条费用明细全部进入 context，即使你只需要知道"哪 3 个人超预算了"。推理开销：每个工具调用都需要一次完整的模型推理。5 个工具调用 = 5 次推理 pass，每次几百毫秒到几秒不等。噪声导致准确率下降：当上下文窗口塞满了中间结果，模型不得不在大量噪声中寻找信号。Context Rot 研究表明，LLM 在复杂任务上的性能会随上下文增长而下降 50-70%。正如 Bruno 在 Claude Code Architecture Guide 中所指出的：“Outer Loop（模型外的一切：上下文管理、工具调用、验证、记忆巩固）开始比模型推理本身更决定系统质量。” Anthropic 在 2025 年 11 月到 2026 年 2 月间陆续推出的一系列工具使用增强功能，本质上都是为了解决 Outer Loop 的效率问题。其中 Programmatic Tool Calling (PTC) 和 Dynamic Filtering 是最具范式转移意义的两项。

OpenClaw 生产踩坑：当最先进的记忆系统遇到最静默的失败

2026-05-27·1643 字· 8 分钟

AI Agent 实战 OpenClaw AI Agent 飞书记忆系统 Compaction 排障

从部署到排障，记录 OpenClaw 从启动失败、飞书消息静默吞回复到 production 稳定的全链路实战经验——compaction safeguard、五层排查法、model-harness-fit 与记忆系统对比。

OpenClaw in Production: When the Most Advanced Memory System Meets the Quietest Failure

2026-05-27·4035 words· 19 min

AI Agent in Practice OpenClaw AI Agent Feishu Memory System Compaction Debugging

A full-chain production battle log: from startup failures and Feishu message silent drops to production stability — compaction safeguard, five-layer debugging, model-harness fit, and memory system comparison.

什么时候用 RAG，什么时候用 LLM Wiki，什么时候用纯文本记忆——一个 Agent 记忆选型框架

2026-05-11·437 字· 3 分钟

Agent 架构 AI Agent 记忆系统 RAG 上下文工程 Agent 架构

做 Agent 系统的人迟早会撞上这个选择题：用户的数据往哪放，下次对话怎么记住？目前工业界有三条主流路线——RAG（向量检索）、LLM Wiki（结构化知识注入）、纯文本上下文记忆（CLAUDE.md / Cursor Rules 模式）。三条路各有拥趸，但选错的代价很大：RAG 做轻了是噪音生成器，纯文本做重了是 token 焚化炉。这篇给出一个可以直接用的决策框架。三种方案一句话定义 # 方案核心机制代表产品/模式 RAG 向量检索 → top-k 片段 → 拼入 prompt Mem0, Zep, LangChain RAG, Cursor Codebase Index LLM Wiki 结构化文档 → 全量或按需注入 system prompt Claude Projects, GPTs Knowledge, Notion AI 纯文本上下文 Markdown/文本文件 → 直接拼入 system prompt CLAUDE.md, Cursor Rules, AGENTS.md, Devin Knowledge 关键区别不在于"存哪里"，而在于检索方式和注入时机。

RAG vs LLM Wiki vs Plain Text — A Decision Framework for Agent Long-Term Memory

2026-05-11·1234 words· 6 min

Agent Architecture AI Agent Memory RAG Context Engineering

Every Agent builder hits this question eventually: where do I store user data so the agent remembers it next session? Three approaches dominate the landscape: RAG (vector retrieval), LLM Wiki (structured knowledge injection), and plain-text context memory (the CLAUDE.md / Cursor Rules pattern). Each has vocal advocates. But picking wrong is expensive — do RAG too light and it’s a noise generator; do plain text too heavy and it’s a token incinerator.

↑