SkillClaw: Evolving Agent Skills From Real Conversations
Imagine you've built an AI agent that handles customer support. It starts out competent, but after a few hundred real conversations, you realize it's missing a bunch of useful skills. You'd have to manually analyze transcripts, write new prompts, and retrain. That's slow and brittle.
SkillClaw takes a different approach. It watches actual conversations between users and your agent, figures out where the agent is lacking, and autonomously evolves new skills to fill those gaps. It's like having a junior developer who learns from live production data and ships patches on their own.
What It Does
SkillClaw is a framework that continuously improves a conversational agent's capabilities by analyzing real dialogue logs. It operates in a loop:
- Collects conversations between users and the agent.
- Identifies skill gaps — moments where the agent fails to provide a useful response or misses an opportunity.
- Generates new skills in the form of modular, reusable functions (think tool calls or specialized prompts).
- Validates them against held-out conversation examples to prevent regressions.
- Integrates them back into the agent's runtime.
The result is an agent that gets smarter over time without human intervention for every single missing edge case.
Why It's Cool
A few things stand out about this approach:
- Data-driven, not prompt-engineered. Instead of guessing what skills your agent needs, SkillClaw lets real user interactions drive the evolution. It only adds skills that solve actual observed failures.
- No manual curation. You don't need to sift through hundreds of logs to find patterns. The system does that automatically.
- Safe self-improvement. New skills are tested against a validation set before deployment. If a proposed skill makes things worse, it gets rejected.
- Modular and composable. Each skill is a standalone unit. You can inspect, edit, or reuse them across different agents. This means you can audit what your agent learned and why.
Practical use cases: chatbots that handle niche product questions without hand-writing every rule, coding assistants that pick up new library-specific patterns from developer chats, or customer support agents that automatically learn how to handle new product categories.
How to Try It
SkillClaw is available on GitHub. Here's the quick start: