跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Benchmark

技能 已验证 活跃
属于:Dotforge

Compare Claude Code output with full config vs minimal config using standardized tasks per stack.

目的

To help users understand the impact of their Claude Code configuration by benchmarking its effectiveness against a minimal baseline for specific tasks.

功能

  • Compares full vs. minimal Claude Code configurations.
  • Executes standardized benchmark tasks in isolated worktrees.
  • Provides detailed comparative results for files created, tests passing, errors, etc.
  • Includes cleanup steps for created worktrees.

使用场景

  • When evaluating the effectiveness of custom `.claude/` configurations.
  • To identify overly complex or inert configuration rules.
  • To justify configuration changes by quantifying their impact on task execution.

非目标

  • Automatically applying configuration changes based on benchmark results.
  • Benchmarking arbitrary code execution outside of Claude Code tasks.
  • Modifying the project's main branch or committed files.

安装

请先添加 Marketplace

/plugin marketplace add luiseiman/claude-kit
/plugin install claude-kit@dotforge

质量评分

已验证
95 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标6
许可证MIT
状态
查看源代码