Cost Mode
Skill Verified ActiveCost-conscious Claude Code mode. Reduces output tokens 40-70% and overall costs 30-60% by enforcing concise responses, smart model routing, and efficient workflow patterns. Keeps full technical accuracy. Activate with /cost-mode or "enable cost mode". Auto-triggers on mentions of budget, cost, tokens, or spending.
To significantly reduce Claude Code operational costs by minimizing token usage and intelligently routing tasks to the most cost-effective models, while maintaining technical accuracy.
Features
- Enforces concise responses and cuts filler content
- Suggests cheaper, viable LLM models for tasks
- Recommends CLI tools for deterministic tasks
- Manages session awareness to prevent token bloat
- Offers adjustable intensity levels (lite, standard, strict)
Use Cases
- Reducing budget for frequent Claude Code usage
- Maintaining technical accuracy while minimizing output verbosity
- Guiding LLM interactions for cost-effectiveness in daily development
- Ensuring efficiency for repetitive or deterministic coding tasks
Non-Goals
- Sacrificing technical accuracy for brevity
- Reducing clarity on security warnings or destructive operation confirmations
- Writing code in commits, PRs, or generated files differently
- Implementing cost-saving measures that confuse the user or hinder complex concept explanation
Installation
/plugin install claude-cost-optimizer@Sagargupta16-claude-cost-optimizerQuality Score
VerifiedTrust Signals
Similar Extensions
Create Command
100Interactive assistant for creating new Claude commands with proper structure, patterns, and MCP tool integration
Validate Plugin
100Validate a Claude Code plugin structure, frontmatter, and MCP tool references
Running Claude Code Via Litellm Copilot
100Use when routing Claude Code through a local LiteLLM proxy to GitHub Copilot, reducing direct Anthropic spend, configuring ANTHROPIC_BASE_URL or ANTHROPIC_MODEL overrides, or troubleshooting Copilot proxy setup failures such as model-not-found, no localhost traffic, or GitHub 401/403 auth errors.
Guard
100Protect Claude Code sessions from context overflow by running a background daemon that monitors session size and auto-prunes before compaction hits. Use when the user says "guard", "protect session", "context getting long", "prevent compaction", "session management", or is running agent teams that need continuous context protection.
Rule Effectiveness Analysis
100Analyze which rules are actively used vs inert. Detect coverage gaps. Recommend pruning to reduce token consumption.
Bootstrap Project
100Initializes the Claude Code configuration in a new or existing project using the dotforge template.