Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Autoresearch Agent

Skill Verifiziert Aktiv

Autonomous experiment loop that optimizes any file by a measurable metric. Inspired by Karpathy's autoresearch. The agent edits a target file, runs a fixed evaluation, keeps improvements (git commit), discards failures (git reset), and loops indefinitely. Use when: user wants to optimize code speed, reduce bundle/image size, improve test pass rate, optimize prompts, improve content quality (headlines, copy, CTR), or run any measurable improvement loop. Requires: a target file, an evaluation command that outputs a metric, and a git repo.

Zweck

To automate and measure the optimization of any file based on a defined metric, enabling users to systematically improve code performance, content quality, or other measurable outcomes.

Funktionen

Autonomous experiment loop for file optimization
Automated editing, evaluation, committing, and reverting
Support for various domains (engineering, marketing, content, prompts)
Configurable evaluation commands, metrics, and directions
Git integration for versioning and rollback

Anwendungsfälle

Use when you want to optimize code speed, reduce bundle/image size, or improve test pass rates.
Use when you need to improve content quality such as headlines, copy, or click-through rates.
Use when optimizing prompts for LLM interactions or agent instructions.
Use when running any measurable improvement loop that can be automated.

Nicht-Ziele

Modifying the evaluation script or external dependencies.
Performing optimization without a clear, measurable metric.
Handling the initial setup of the target project's build or testing environment.

Workflow

User runs `/ar:setup` to configure experiment parameters (target file, evaluation command, metric, direction).
Script creates experiment directory, config files, and a git branch.
User (or AI agent) calls `python scripts/run_experiment.py --single`.
Script edits the target file based on AI's instruction (or a generated change).
Script runs the evaluation command and parses the metric.
Script compares the new metric to the best metric found so far.
If improved, the change is committed; otherwise, the repo is reset.
Result (keep/discard/crash) is logged to `results.tsv`.
This loop continues until interrupted or a goal is met.

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add alirezarezvani/claude-skills

/plugin install autoresearch-agent@claude-code-skills

Qualitätspunktzahl

Verifiziert

99 /100

Analysiert about 21 hours ago

Vertrauenssignale

Letzter Commit1 day ago

GitHub-Inhaber alirezarezvani

Sterne14.6k

LizenzMIT

Websitealirezarezvani.medium.com

Status

Quellcode ansehen

Autoresearch Agent

Funktionen

Anwendungsfälle

Nicht-Ziele

Workflow

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Project Session Manager

Oh My Claudecode

Using Git Worktrees

CE Optimize

Rule Effectiveness Analysis

Arize Prompt Optimization