Cost Optimize
Skill Verified ActiveAnalyze token usage patterns and recommend cost optimizations with estimated savings
To help users reduce LLM operational costs by analyzing token usage patterns and providing actionable recommendations for optimization.
Features
- Analyze token usage patterns
- Recommend cost optimizations
- Estimate dollar savings
- Assess model fit and cache utilization
- Detect agent redundancy
Use Cases
- When LLM costs exceed expectations
- To proactively reduce spending on LLM usage
- To optimize model selection for different task complexities
- To improve cache utilization for cost reduction
Non-Goals
- Performing actual cost-saving actions
- Managing external cloud billing accounts
- Analyzing costs unrelated to token usage
Workflow
- Load usage data from the 'cost-tracking' namespace
- Analyze model tier fit for task complexity
- Check cache hit rates per agent
- Detect redundant agents or overlapping tasks
- Estimate savings for each recommendation
- Search for prior optimization patterns
- Store new optimization patterns
- Emit model outcome signals for learning
- Report ranked recommendations and total savings
Practical Utility
- info:Usage examplesWhile the SKILL.md describes the workflow and tool usage, explicit end-to-end, ready-to-use examples demonstrating input, invocation, and output are missing.
- info:Edge casesThe SKILL.md mentions analyzing usage data and potential issues like model fit and cache rates, but doesn't explicitly detail failure modes or recovery steps for malformed input or credential issues.
Installation
First, add the marketplace
/plugin marketplace add ruvnet/ruflo/plugin install ruflo-cost-tracker@rufloQuality Score
VerifiedTrust Signals
Similar Extensions
Janitor Tokens
100Show how many context window tokens each skill consumes. Use when the user asks about token cost, context budget, skill size, or wants to know which skills waste the most context space.
Cloud Architect
100Designs cloud architectures, creates migration plans, generates cost optimization recommendations, and produces disaster recovery strategies across AWS, Azure, and GCP. Use when designing cloud architectures, planning migrations, or optimizing multi-cloud deployments. Invoke for Well-Architected Framework, cost optimization, disaster recovery, landing zones, security architecture, serverless design.
Cost Booster Route
99Route tasks through hooks_route, partition by Agent Booster availability, and report Tier 1 bypass utilization with $0 cost
Cost Benchmark
99Run the corpus benchmark — booster locally, optional Gemini/Sonnet/Opus baselines — and persist a verifiable measured-vs-claimed table
Cost Mode
99Cost-conscious Claude Code mode. Reduces output tokens 40-70% and overall costs 30-60% by enforcing concise responses, smart model routing, and efficient workflow patterns. Keeps full technical accuracy. Activate with /cost-mode or "enable cost mode". Auto-triggers on mentions of budget, cost, tokens, or spending.
OpenRouter AI Service Skill
99OpenRouter unified AI API - Access 200+ LLMs through single interface with intelligent routing, streaming, cost optimization, and model fallbacks