Benchmark

Skill Verified Active

Compare Claude Code output with full config vs minimal config using standardized tasks per stack.

Purpose

To help users understand the impact of their Claude Code configuration by benchmarking its effectiveness against a minimal baseline for specific tasks.

Features

Compares full vs. minimal Claude Code configurations.
Executes standardized benchmark tasks in isolated worktrees.
Provides detailed comparative results for files created, tests passing, errors, etc.
Includes cleanup steps for created worktrees.

Use Cases

When evaluating the effectiveness of custom `.claude/` configurations.
To identify overly complex or inert configuration rules.
To justify configuration changes by quantifying their impact on task execution.

Non-Goals

Automatically applying configuration changes based on benchmark results.
Benchmarking arbitrary code execution outside of Claude Code tasks.
Modifying the project's main branch or committed files.

Installation

First, add the marketplace

/plugin marketplace add luiseiman/claude-kit

/plugin install claude-kit@dotforge

Quality Score

Verified

95 /100

Analyzed about 19 hours ago

Trust Signals

Last commit1 day ago

GitHub owner luiseiman

Stars6

LicenseMIT

Websitegithub.com

Status

View Source

Similar Extensions

Running Claude Code Via Litellm Copilot

100

Use when routing Claude Code through a local LiteLLM proxy to GitHub Copilot, reducing direct Anthropic spend, configuring ANTHROPIC_BASE_URL or ANTHROPIC_MODEL overrides, or troubleshooting Copilot proxy setup failures such as model-not-found, no localhost traffic, or GitHub 401/403 auth errors.

Skill

xixu-me

Rule Effectiveness Analysis

100

Analyze which rules are actively used vs inert. Detect coverage gaps. Recommend pruning to reduce token consumption.

Skill

luiseiman

Bootstrap Project

100

Initializes the Claude Code configuration in a new or existing project using the dotforge template.

Skill

luiseiman

Definition Of Done

100

Mandatory checks to run before completing any task that touches md files or dart code in this repository.

Skill

flutter

Validate Plugin

100

Validate a Claude Code plugin structure, frontmatter, and MCP tool references

Skill

ruvnet

Guard

100

Protect Claude Code sessions from context overflow by running a background daemon that monitors session size and auto-prunes before compaction hits. Use when the user says "guard", "protect session", "context getting long", "prevent compaction", "session management", or is running agent teams that need continuous context protection.

Skill

Ruya-AI