Cost Benchmark
Skill Verifiziert AktivRun the corpus benchmark — booster locally, optional Gemini/Sonnet/Opus baselines — and persist a verifiable measured-vs-claimed table
To provide a verifiable, local benchmark for LLM cost performance, ensuring accuracy in claimed costs and facilitating performance audits.
Funktionen
- Run local corpus benchmark
- Support optional Gemini/Sonnet/Opus baselines
- Persist verifiable measured-vs-claimed table
- Documented environment overrides for customization
- Output historical and latest benchmark runs
Anwendungsfälle
- Verify release cost claims before publishing
- Confirm new benchmark cases route correctly
- Audit 'claimed upstream' tags by verifying benchmark support
- Compare costs of different LLM models for specific tasks
Nicht-Ziele
- Modifying benchmark corpus data
- Running benchmarks against live production systems
- Automated remediation of cost regressions
Installation
Zuerst Marketplace hinzufügen
/plugin marketplace add ruvnet/ruflo/plugin install ruflo-cost-tracker@rufloQualitätspunktzahl
VerifiziertVertrauenssignale
Ähnliche Erweiterungen
Janitor Tokens
100Zeigt an, wie viele Token im Kontextfenster jede Fähigkeit verbraucht. Verwenden Sie dies, wenn der Benutzer nach Token-Kosten, Budget, Kapazität oder nach Fähigkeiten fragt, die am meisten Kontextspeicherplatz verschwenden.
Cloud Architect
100Designs cloud architectures, creates migration plans, generates cost optimization recommendations, and produces disaster recovery strategies across AWS, Azure, and GCP. Use when designing cloud architectures, planning migrations, or optimizing multi-cloud deployments. Invoke for Well-Architected Framework, cost optimization, disaster recovery, landing zones, security architecture, serverless design.
Chat Format
100Format prompts for different LLM providers with chat templates and HNSW-powered context retrieval
Oh My Claudecode
100Process-first advisor routing for Claude, Codex, or Gemini via `omc ask`, with artifact capture and no raw CLI assembly
Wrap Up Ritual
100End-of-session ritual that audits changes, runs quality checks, captures learnings, and produces a session summary. Use when saying "wrap up", "done for the day", "finish coding", or ending a coding session.
Project Development
100This skill should be used when the user asks to "start an LLM project", "design batch pipeline", "evaluate task-model fit", "structure agent project", or mentions pipeline architecture, agent-assisted development, cost estimation, or choosing between LLM and traditional approaches.