跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Judge

技能 活跃

Launch a meta-judge then a judge sub-agent to evaluate results produced in the current conversation

目的

To provide a structured and objective evaluation of work produced within a conversation, ensuring quality through tailored criteria and evidence-based feedback.

功能

  • Launches meta-judge to generate evaluation criteria
  • Dispatches judge sub-agent for isolated context evaluation
  • Supports structured scoring with evidence-based feedback
  • Generates multi-dimensional rubrics tailored to artifact type
  • Provides report-only evaluation without automatic changes

使用场景

  • Evaluating code changes for quality and adherence to requirements
  • Assessing documentation for clarity and completeness
  • Reviewing analysis or reports for accuracy and evidence
  • Getting objective feedback on any conversational output

非目标

  • Automatically modifying or fixing the evaluated work
  • Performing destructive operations
  • Handling secrets or sensitive data
  • Replacing human review entirely, but augmenting it

工作流

  1. Identify work to evaluate based on conversation history and arguments.
  2. Extract evaluation context (original task, output, files, criteria).
  3. Present evaluation scope to the user.
  4. Dispatch meta-judge agent to generate evaluation specification YAML.
  5. Dispatch judge agent with work context and specification YAML.
  6. Validate the judge's evaluation report.
  7. Present the final evaluation report and verdict to the user.

Versioning

  • warning:Release ManagementNo explicit versioning is declared in the SKILL.md frontmatter or a changelog, and install instructions reference 'main', making version pinning difficult.

Practical Utility

  • info:Usage examplesWhile the SKILL.md details the workflow, explicit end-to-end usage examples with claimed output are not provided for the 'judge' command itself, though related plugins have examples.

安装

请先添加 Marketplace

/plugin marketplace add NeoLabHQ/context-engineering-kit
/plugin install sadd@context-engineering-kit

质量评分

95 /100
1 day ago 分析

信任信号

最近提交9 days ago
星标993
许可证GPL-3.0
状态
查看源代码