Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Agent Evaluation

Skill Verifiziert Aktiv

Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.

Zweck

To empower users with systematic methods and best practices for evaluating and enhancing the performance, reliability, and quality of AI agents and their components.

Funktionen

Structured evaluation methodologies (LLM-as-Judge, Human Eval)
Comprehensive rubric design with scoring guidelines
Techniques for mitigating LLM evaluation biases
Practical prompt patterns and workflow examples
Guidance on test case design and iteration

Anwendungsfälle

Testing prompt effectiveness for AI agents
Validating context engineering choices
Measuring improvement quality of AI outputs
Developing robust evaluation pipelines for AI systems

Nicht-Ziele

Developing AI agents themselves
Automating all aspects of AI evaluation without human oversight
Providing domain-specific evaluation rubrics outside of general AI agent assessment

Praktiken

Evaluation methodology
Prompt engineering
Test design
Bias mitigation

Versioning

info:Release ManagementWhile the trust signals indicate a recent commit date, there is no explicit versioning declared in the manifest or CHANGELOG, and installation instructions reference 'main'.

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add NeoLabHQ/context-engineering-kit

/plugin install customaize-agent@context-engineering-kit

Qualitätspunktzahl

Verifiziert

99 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit9 days ago

GitHub-Inhaber NeoLabHQ

Sterne993

LizenzGPL-3.0

Websitecek.neolab.finance

Status

Quellcode ansehen

Agent Evaluation

Funktionen

Anwendungsfälle

Nicht-Ziele

Praktiken

Versioning

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Create Command

Project Development

Write A Skill

Context Compression

Arize Prompt Optimization

Prompt Optimization