Skip to main content

Manage Token Budget

Skill Verified Active

Monitor, cap, and recover from context accumulation in agentic systems. Covers per-cycle cost tracking, context window auditing, budget caps with enforcement policies, emergency pruning when approaching limits, and progressive disclosure integration to minimize token spend on routing. Use when running long-lived agent loops (heartbeats, polling, autonomous workflows), when context windows are growing unpredictably between cycles, when API costs spike beyond expected baselines, when designing new agentic workflows that need cost guardrails from the start, or when post-mortem analysis reveals a cost incident caused by context accumulation.

Purpose

To control the cost and context footprint of agentic systems by implementing robust token budget management and ensuring efficient context window utilization.

Features

  • Per-cycle cost tracking
  • Context window auditing
  • Budget caps with enforcement policies
  • Emergency context pruning
  • Progressive disclosure for routing

Use Cases

  • Running long-lived agent loops
  • Managing unpredictable context growth
  • Controlling API cost spikes
  • Designing agentic workflows with cost guardrails

Non-Goals

  • Replacing core agentic system logic
  • Managing external application costs outside the agent's context
  • Providing a generic log analysis tool

Installation

/plugin install agent-almanac@pjt222-agent-almanac

Quality Score

Verified
98 /100
Analyzed about 21 hours ago

Trust Signals

Last commit1 day ago
Stars14
LicenseMIT
Status
View Source

Similar Extensions

Context Mode Ops

100

Manage context-mode GitHub issues, PRs, releases, and marketing with parallel subagent army. Orchestrates 10-20 dynamic agents per task. Use when triaging issues, reviewing PRs, releasing versions, writing LinkedIn posts, announcing releases, fixing bugs, merging contributions, validating ENV vars, testing adapters, or syncing branches.

Skill
mksglu

Janitor Tokens

100

Show how many context window tokens each skill consumes. Use when the user asks about token cost, context budget, skill size, or wants to know which skills waste the most context space.

Skill
khendzel

Cloud Architect

100

Designs cloud architectures, creates migration plans, generates cost optimization recommendations, and produces disaster recovery strategies across AWS, Azure, and GCP. Use when designing cloud architectures, planning migrations, or optimizing multi-cloud deployments. Invoke for Well-Architected Framework, cost optimization, disaster recovery, landing zones, security architecture, serverless design.

Skill
jeffallan

Cost Booster Route

99

Route tasks through hooks_route, partition by Agent Booster availability, and report Tier 1 bypass utilization with $0 cost

Skill
ruvnet

Cost Benchmark

99

Run the corpus benchmark — booster locally, optional Gemini/Sonnet/Opus baselines — and persist a verifiable measured-vs-claimed table

Skill
ruvnet

Cost Mode

99

Cost-conscious Claude Code mode. Reduces output tokens 40-70% and overall costs 30-60% by enforcing concise responses, smart model routing, and efficient workflow patterns. Keeps full technical accuracy. Activate with /cost-mode or "enable cost mode". Auto-triggers on mentions of budget, cost, tokens, or spending.

Skill
Sagargupta16

© 2025 SkillRepo · Find the right skill, skip the noise.