Benchmark
Skill Verified ActiveCompare Claude Code output with full config vs minimal config using standardized tasks per stack.
To help users understand the impact of their Claude Code configuration by benchmarking its effectiveness against a minimal baseline for specific tasks.
Features
- Compares full vs. minimal Claude Code configurations.
- Executes standardized benchmark tasks in isolated worktrees.
- Provides detailed comparative results for files created, tests passing, errors, etc.
- Includes cleanup steps for created worktrees.
Use Cases
- When evaluating the effectiveness of custom `.claude/` configurations.
- To identify overly complex or inert configuration rules.
- To justify configuration changes by quantifying their impact on task execution.
Non-Goals
- Automatically applying configuration changes based on benchmark results.
- Benchmarking arbitrary code execution outside of Claude Code tasks.
- Modifying the project's main branch or committed files.
Installation
First, add the marketplace
/plugin marketplace add luiseiman/claude-kit/plugin install claude-kit@dotforgeQuality Score
VerifiedTrust Signals
Similar Extensions
Running Claude Code Via Litellm Copilot
100Use when routing Claude Code through a local LiteLLM proxy to GitHub Copilot, reducing direct Anthropic spend, configuring ANTHROPIC_BASE_URL or ANTHROPIC_MODEL overrides, or troubleshooting Copilot proxy setup failures such as model-not-found, no localhost traffic, or GitHub 401/403 auth errors.
Rule Effectiveness Analysis
100Analyze which rules are actively used vs inert. Detect coverage gaps. Recommend pruning to reduce token consumption.
Bootstrap Project
100Initializes the Claude Code configuration in a new or existing project using the dotforge template.
Definition Of Done
100Mandatory checks to run before completing any task that touches md files or dart code in this repository.
Validate Plugin
100Validate a Claude Code plugin structure, frontmatter, and MCP tool references
Guard
100Protect Claude Code sessions from context overflow by running a background daemon that monitors session size and auto-prunes before compaction hits. Use when the user says "guard", "protect session", "context getting long", "prevent compaction", "session management", or is running agent teams that need continuous context protection.