此内容尚未提供您的语言版本,正在以英文显示。

Plugin Eval

插件已验证活跃

Three-layer quality evaluation framework for Claude Code plugins with Elo ranking

1 个 Skill 0 个 MCP

目的

To provide developers and platform curators with a robust, automated system for evaluating and improving the quality of Claude Code extensions.

功能

Three-layer evaluation framework (static, LLM judge, Monte Carlo)
Elo ranking for comparative quality assessment
CLI commands for scoring, certifying, and comparing extensions
Detailed documentation and rubrics for evaluation dimensions

使用场景

Evaluating the quality of a new or existing Claude Code skill.
Certifying a plugin for marketplace inclusion or advanced use.
Comparing two different implementations of a similar capability.
Understanding the methodology behind Claude Code extension quality scoring.

非目标

Executing or running Claude Code extensions directly.
Providing a marketplace for distributing extensions.
Automated fixing of detected quality issues.

安装

请先添加 Marketplace

/plugin marketplace add wshobson/agents

/plugin install plugin-eval@claude-code-workflows

包含 1 个扩展

Skill (1)

Evaluation Methodology 技能

PluginEval quality methodology — dimensions, rubrics, statistical methods, and scoring formulas. Use this skill when understanding how plugin quality is measured, when interpreting a low score on a specific dimension, when deciding how to improve a skill's triggering accuracy or orchestration fitness, when calibrating scoring thresholds for your marketplace, or when explaining quality badges to external partners like Neon.

质量评分

已验证

98 /100

12 days ago 分析

信任信号

最近提交14 days ago

GitHub 所有者 wshobson

星标35.3k

许可证MIT

网站sethhobson.com

状态

查看源代码

类似扩展

Cypress

100

创建、更新和修复 Cypress 测试。连接到 Cypress Cloud 以查看测试结果并利用数据来管理您的测试套件。

插件

cypress-io

Huggingface Community Evals

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom evaluations with vLLM/lighteval.

插件

huggingface

Voltagent Qa Sec

测试、安全和代码质量专家 - 代码审查、渗透测试、QA 自动化和 UI 流程验证

插件

VoltAgent