Benchmark
技能 已验证 活跃Compare Claude Code output with full config vs minimal config using standardized tasks per stack.
To help users understand the impact of their Claude Code configuration by benchmarking its effectiveness against a minimal baseline for specific tasks.
功能
- Compares full vs. minimal Claude Code configurations.
- Executes standardized benchmark tasks in isolated worktrees.
- Provides detailed comparative results for files created, tests passing, errors, etc.
- Includes cleanup steps for created worktrees.
使用场景
- When evaluating the effectiveness of custom `.claude/` configurations.
- To identify overly complex or inert configuration rules.
- To justify configuration changes by quantifying their impact on task execution.
非目标
- Automatically applying configuration changes based on benchmark results.
- Benchmarking arbitrary code execution outside of Claude Code tasks.
- Modifying the project's main branch or committed files.
安装
请先添加 Marketplace
/plugin marketplace add luiseiman/claude-kit/plugin install claude-kit@dotforge质量评分
已验证类似扩展
Running Claude Code Via Litellm Copilot
100当通过本地 LiteLLM 代理将 Claude Code 路由到 GitHub Copilot 时使用,以减少直接的 Anthropic 支出,配置 ANTHROPIC_BASE_URL 或 ANTHROPIC_MODEL 覆盖,或对 Copilot 代理设置失败进行故障排除,例如 model-not-found、无 localhost 流量或 GitHub 401/403 身份验证错误。
Rule Effectiveness Analysis
100Analyze which rules are actively used vs inert. Detect coverage gaps. Recommend pruning to reduce token consumption.
Bootstrap Project
100Initializes the Claude Code configuration in a new or existing project using the dotforge template.
Definition Of Done
100Mandatory checks to run before completing any task that touches md files or dart code in this repository.
Validate Plugin
100Validate a Claude Code plugin structure, frontmatter, and MCP tool references
Guard
100保护 Claude Code 会话免受上下文溢出影响,通过运行一个后台守护进程来监控会话大小并在压缩命中之前自动进行修剪。当用户说“guard”、“protect session”、“context getting long”、“prevent compaction”、“session management”或正在运行需要持续上下文保护的代理团队时使用。