此内容尚未提供您的语言版本,正在以英文显示。

CE Optimize

技能已验证活跃

Run metric-driven iterative optimization loops -- define a measurable goal, run parallel experiments, measure each against hard gates or LLM-as-judge scores, keep improvements, and converge on the best solution. Use when optimizing clustering quality, search relevance, build performance, prompt quality, or any measurable outcome that benefits from systematic experimentation.

目的

To systematically improve measurable outcomes through automated, iterative experimentation and convergence.

功能

Metric-driven iterative optimization loops
Support for hard metrics and LLM-as-judge
Automated experiment execution and measurement
Robust persistence and crash recovery
Scoped modification of code and configuration

使用场景

Optimizing build performance or test coverage
Tuning LLM prompts for quality and cost
Improving search relevance or clustering quality
Systematically experimenting with code or configuration variants

非目标

Implementing the core logic being optimized
Replacing manual code development entirely
Running experiments without a defined measurement harness
Performing optimizations that cannot be measured or evaluated systematically

实践

Experiment design
Iterative development
Metric definition
Code quality
MLOps

先决条件

Git repository
Bash shell
Python 3
The `ce-optimize` skill installed

安装

请先添加 Marketplace

/plugin marketplace add EveryInc/compound-engineering-plugin

/plugin install compound-engineering@compound-engineering-plugin

质量评分

已验证

100 /100

1 day ago 分析

信任信号

最近提交1 day ago

GitHub 所有者 EveryInc

星标16.7k

下载量 12.8k

许可证MIT

网站every.to

状态

查看源代码

类似扩展

Moyu (摸鱼)

100

감지된 과잉 엔지니어링 패턴: (1) 사용자가 명시적으로 요청하지 않은 코드나 파일을 수정할 때 (2) 요청되지 않은 새로운 추상화 계층(클래스, 인터페이스, 팩토리, 래퍼)을 생성할 때 (3) 요청되지 않은 주석, 문서, JSDoc, 타입 주석을 추가할 때 (4) 요청되지 않은 새로운 종속성을 도입할 때 (5) 최소 편집 대신 파일 전체를 다시 작성할 때 (6) diff 범위가 사용자의 요청을 명백히 초과할 때 (7) 사용자가 "너무 많아", "거기는 건드리지 마", "X만 변경해", "간단하게", "그만"과 같은 신호를 보낼 때 (8) 발생할 수 없는 시나리오에 대한 오류 처리, 유효성 검사, 방어적 코드를 추가할 때 (9) 요청되지 않은 테스트, 설정 스캐폴딩, 문서를 생성할 때

技能

uucz

Arize Experiment

100

Creates, runs, and analyzes Arize experiments for evaluating and comparing model performance. Covers experiment CRUD, exporting runs, comparing results, and evaluation workflows using the ax CLI. Use when the user mentions create experiment, run experiment, compare models, model performance, evaluate AI, experiment results, benchmark, A/B test models, or measure accuracy.

技能

github

Arize Prompt Optimization

100

Optimizes, improves, and debugs LLM prompts using production trace data, evaluations, and annotations. Extracts prompts from spans, gathers performance signal, and runs a data-driven optimization loop using the ax CLI. Use when the user mentions optimize prompt, improve prompt, make AI respond better, improve output quality, prompt engineering, prompt tuning, or system prompt improvement.

技能

github

Prompt Optimization

100

应用提示重复以提高非推理 LLM 的准确性

技能

asklokesh

Vector Index Tuning

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

技能

wshobson

Migrate Validate

100

Validate pending migrations for foreign key consistency, rollback safety, and best practices

技能

ruvnet