Skip to main content

Ruflo Cost Tracker

Plugin Verified Active
Part of:Ruflo

Token usage tracking, model cost attribution per agent, budget alerts, and optimization recommendations — uses memory_* (namespace-routed) for cost-tracking and cost-patterns; pairs with federation budget circuit breaker (ADR-097)

13 Skills 0 MCPs
Purpose

To provide AI agents and their operators with granular control and visibility over token usage costs, enabling efficient budgeting and optimization.

Features

  • Token usage tracking per agent, task, model
  • USD cost attribution using current model pricing
  • Configurable budget monitoring with tiered alerts
  • Cost optimization recommendations with estimated savings
  • Telemetry export to Prometheus and webhooks
  • Integration with Agent Booster for zero-cost transforms

Use Cases

  • Monitor daily/weekly/monthly AI agent spending.
  • Identify agents or conversations consuming excessive budget.
  • Receive alerts when approaching or exceeding budget thresholds.
  • Optimize AI agent workflows to reduce LLM token costs.

Non-Goals

  • Replacing core LLM functionality.
  • Directly managing LLM model pricing (assumes fixed, documented rates).
  • Real-time, in-flight LLM cost modification (focuses on post-usage analysis and optimization).

Installation

First, add the marketplace

/plugin marketplace add ruvnet/ruflo
/plugin install ruflo-cost-tracker@ruflo

Contains 13 extensions

Skill (13)

Cost Benchmark Skill

Run the corpus benchmark — booster locally, optional Gemini/Sonnet/Opus baselines — and persist a verifiable measured-vs-claimed table

99
Cost Booster Edit Skill

Apply a simple code transform via agent-booster's WASM engine — sub-millisecond, deterministic, $0 (no LLM call). Companion to cost-booster-route.

95
Cost Booster Route Skill

Route tasks through hooks_route, partition by Agent Booster availability, and report Tier 1 bypass utilization with $0 cost

99
Cost Budget Check Skill

Read accumulated cost-tracking spend + budget config, compute utilization, emit 50/75/90/100% alert ladder

99
Cost Compact Context Skill

Wrap getTokenOptimizer().getCompactContext() to retrieve compacted ReasoningBank context for cost-analysis queries; report bridge-reported tokensSaved

97
Cost Conversation View Skill

Per-conversation cost view — list every session in cost-tracking with started-at, message count, top model, and total cost

98
Cost Export Skill

Export cost-tracking telemetry in Prometheus textfile or webhook JSON formats — for external observability (Grafana, Datadog, custom dashboards)

98
Cost Federation Skill

Consumer-side wiring for ADR-097 Phase 3 federation_spend events — per-peer rolling windows + suspension-threshold check

98
Cost Optimize Skill

Analyze token usage patterns and recommend cost optimizations with estimated savings

95
Cost Report Skill

Generate a cost report showing token usage and USD costs by agent and model

78
Cost Summary Skill

Single-shot programmatic dump of all cost data — total spend, per-tier, top session, budget status, federation aggregate. JSON or markdown.

98
Cost Track Skill

Auto-capture per-session token usage from the Claude Code session jsonl and persist to the cost-tracking namespace

97
Cost Trend Skill

Read every docs/benchmarks/runs/*.json and surface drift in win rate, latency, escalation rate, and LLM-baseline cost over time

97

Quality Score

Verified
98 /100
Analyzed about 14 hours ago

Trust Signals

Last commitabout 15 hours ago
Stars50.2k
LicenseMIT
Status
View Source

© 2025 SkillRepo · Find the right skill, skip the noise.