Cost Benchmark

Skill Verified Active

Run the corpus benchmark — booster locally, optional Gemini/Sonnet/Opus baselines — and persist a verifiable measured-vs-claimed table

Purpose

To provide a verifiable, local benchmark for LLM cost performance, ensuring accuracy in claimed costs and facilitating performance audits.

Features

Run local corpus benchmark
Support optional Gemini/Sonnet/Opus baselines
Persist verifiable measured-vs-claimed table
Documented environment overrides for customization
Output historical and latest benchmark runs

Use Cases

Verify release cost claims before publishing
Confirm new benchmark cases route correctly
Audit 'claimed upstream' tags by verifying benchmark support
Compare costs of different LLM models for specific tasks

Non-Goals

Modifying benchmark corpus data
Running benchmarks against live production systems
Automated remediation of cost regressions

Installation

First, add the marketplace

/plugin marketplace add ruvnet/ruflo

/plugin install ruflo-cost-tracker@ruflo

Quality Score

Verified

99 /100

Analyzed about 16 hours ago

Trust Signals

Last commitabout 17 hours ago

GitHub owner ruvnet

Stars50.2k

Downloads 68.3k

LicenseMIT

Websitecognitum.one

Status

View Source

Similar Extensions

Janitor Tokens

100

Show how many context window tokens each skill consumes. Use when the user asks about token cost, context budget, skill size, or wants to know which skills waste the most context space.

Skill

khendzel

Cloud Architect

100

Designs cloud architectures, creates migration plans, generates cost optimization recommendations, and produces disaster recovery strategies across AWS, Azure, and GCP. Use when designing cloud architectures, planning migrations, or optimizing multi-cloud deployments. Invoke for Well-Architected Framework, cost optimization, disaster recovery, landing zones, security architecture, serverless design.

Skill

jeffallan

Chat Format

100

Format prompts for different LLM providers with chat templates and HNSW-powered context retrieval

Skill

ruvnet

Oh My Claudecode

100

Process-first advisor routing for Claude, Codex, or Gemini via `omc ask`, with artifact capture and no raw CLI assembly

Skill

Yeachan-Heo

Wrap Up Ritual

100

End-of-session ritual that audits changes, runs quality checks, captures learnings, and produces a session summary. Use when saying "wrap up", "done for the day", "finish coding", or ending a coding session.

Skill

rohitg00

Project Development

100

This skill should be used when the user asks to "start an LLM project", "design batch pipeline", "evaluate task-model fit", "structure agent project", or mentions pipeline architecture, agent-assisted development, cost estimation, or choosing between LLM and traditional approaches.

Skill

muratcankoylan