Plugin Eval
Plugin Verifiziert AktivThree-layer quality evaluation framework for Claude Code plugins with Elo ranking
To provide developers and platform curators with a robust, automated system for evaluating and improving the quality of Claude Code extensions.
Funktionen
- Three-layer evaluation framework (static, LLM judge, Monte Carlo)
- Elo ranking for comparative quality assessment
- CLI commands for scoring, certifying, and comparing extensions
- Detailed documentation and rubrics for evaluation dimensions
Anwendungsfälle
- Evaluating the quality of a new or existing Claude Code skill.
- Certifying a plugin for marketplace inclusion or advanced use.
- Comparing two different implementations of a similar capability.
- Understanding the methodology behind Claude Code extension quality scoring.
Nicht-Ziele
- Executing or running Claude Code extensions directly.
- Providing a marketplace for distributing extensions.
- Automated fixing of detected quality issues.
Installation
Zuerst Marketplace hinzufügen
/plugin marketplace add wshobson/agents/plugin install plugin-eval@claude-code-workflowsQualitätspunktzahl
VerifiziertVertrauenssignale
Ähnliche Erweiterungen
Cypress
100Erstellen, aktualisieren und beheben Sie Cypress-Tests. Verbinden Sie sich mit Cypress Cloud, um Testergebnisse anzuzeigen und Daten zur Verwaltung Ihrer Testsuite zu verwenden.
Huggingface Community Evals
98Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom evaluations with vLLM/lighteval.
Voltagent Qa Sec
75Experten für Tests, Sicherheit und Codequalität – Code-Reviews, Penetrationstests, QA-Automatisierung und Validierung von UI-Flows