跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Plugin Eval

插件 已验证 活跃

Three-layer quality evaluation framework for Claude Code plugins with Elo ranking

1 个 Skill 0 个 MCP
目的

To provide developers and platform curators with a robust, automated system for evaluating and improving the quality of Claude Code extensions.

功能

  • Three-layer evaluation framework (static, LLM judge, Monte Carlo)
  • Elo ranking for comparative quality assessment
  • CLI commands for scoring, certifying, and comparing extensions
  • Detailed documentation and rubrics for evaluation dimensions

使用场景

  • Evaluating the quality of a new or existing Claude Code skill.
  • Certifying a plugin for marketplace inclusion or advanced use.
  • Comparing two different implementations of a similar capability.
  • Understanding the methodology behind Claude Code extension quality scoring.

非目标

  • Executing or running Claude Code extensions directly.
  • Providing a marketplace for distributing extensions.
  • Automated fixing of detected quality issues.

安装

请先添加 Marketplace

/plugin marketplace add wshobson/agents
/plugin install plugin-eval@claude-code-workflows

质量评分

已验证
98 /100
12 days ago 分析

信任信号

最近提交14 days ago
星标35.3k
许可证MIT
状态
查看源代码