Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Arize Evaluator

Skill Verifiziert Aktiv

Handles LLM-as-judge evaluation workflows on Arize including creating/updating evaluators, running evaluations on spans or experiments, managing tasks, trigger-run operations, column mapping, and continuous monitoring. Use when the user mentions create evaluator, LLM judge, hallucination, faithfulness, correctness, relevance, run eval, score spans, score experiment, trigger-run, column mapping, continuous monitoring, or improve evaluator prompt.

Zweck

To enable users to efficiently set up and manage LLM-as-judge evaluation workflows on Arize, ensuring consistent and reliable assessment of model performance.

Funktionen

Create/update LLM-as-judge evaluators
Run evaluations on project spans or experiment runs
Manage AI integrations and model configurations
Configure column mappings and data granularity
Automate continuous monitoring and backfilling

Anwendungsfälle

When needing to evaluate LLM responses for hallucination, correctness, or relevance.
To set up automated quality checks for LLM outputs on new data.
When analyzing the performance of different LLM experiments against a dataset.
To configure continuous monitoring of LLM-as-judge evaluations for production systems.

Nicht-Ziele

Directly interacting with LLM APIs without using the 'ax' CLI and Arize platform.
Performing data analysis or visualization outside of the Arize platform.
Managing Arize project or dataset creation (delegated to other skills).

Workflow

Confirm project/experiment details and AI integration.
Create or select an LLM evaluator with specific templates and choices.
Determine column mappings from actual data.
Create a task (continuous, backfill, or both).
Trigger a backfill run if requested, then monitor.

Praktiken

LLM Evaluation
MLOps
Data Science Workflows
CLI Tooling

Voraussetzungen

ax CLI installed (v0.14.0+)
Configured Arize profile with API key
Arize space name or ID (via ARIZE_SPACE env var)
AI integration configured in Arize (for LLM provider credentials)

Installation

npx skills add github/awesome-copilot

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert

100 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit1 day ago

GitHub-Inhaber github

Sterne32.9k

LizenzMIT

Websiteawesome-copilot.github.com

Status

Quellcode ansehen

Arize Evaluator

Funktionen

Anwendungsfälle

Nicht-Ziele

Workflow

Praktiken

Voraussetzungen

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Arize Experiment

Arize Dataset

ML Pipeline Workflow

Arize Prompt Optimization

Arize Ai Provider Integration

Trader Regime