Observability Monitoring
插件 已验证 活跃Metrics collection, logging infrastructure, distributed tracing, SLO implementation, and monitoring dashboards
To enable users to build and manage production-ready monitoring, logging, tracing, and reliability systems for their applications and infrastructure.
功能
- Metrics collection and monitoring with Prometheus
- Distributed tracing implementation with OpenTelemetry
- Log aggregation and analysis setup
- SLI/SLO definition and error budget management
- Grafana dashboard creation for visualization
使用场景
- Implementing a comprehensive observability stack
- Setting up monitoring for microservices architecture
- Defining and tracking Service Level Objectives (SLOs)
- Troubleshooting performance bottlenecks with distributed tracing
非目标
- Providing a fully managed SaaS observability solution
- Replacing existing monitoring tools without integration
- Implementing application code directly
安装
请先添加 Marketplace
/plugin marketplace add wshobson/agents/plugin install observability-monitoring@claude-code-workflows包含 4 个扩展
Skill (4)
Implement distributed tracing with Jaeger and Tempo to track requests across microservices and identify performance bottlenecks. Use when debugging microservices, analyzing request flows, or implementing observability for distributed systems.
Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.
Set up Prometheus for comprehensive metric collection, storage, and monitoring of infrastructure and applications. Use when implementing metrics collection, setting up monitoring infrastructure, or configuring alerting systems.
Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) with error budgets and alerting. Use when establishing reliability targets, implementing SRE practices, or measuring service performance.
质量评分
已验证类似扩展
Claude Hud
100Real-time statusline HUD for Claude Code - displays context usage, tool activity, agent tracking, and todo progress
Data Validation Suite
99Schema validation, data quality monitoring, streaming validation pipelines, and input validation for backend APIs
Claude Code Hooks
99为 Claude Code 的自动化运行提供生产环境安全钩子。包括上下文监控、语法检查、分支保护、活动日志记录等。
X Twitter Scraper
99X (Twitter) 实时数据平台技能,提供 REST API(100 多个端点)、MCP 服务器(2 个工具)和 Webhook。涵盖推文搜索、用户查找、时间线、提取、监控、赠品抽奖、积分、支持以及经过确认的私有读取、写入操作、Webhook、监控和按使用付费流程。每次调用读取价格为 $0.00015。
Llm Cost Optimizer
99Use when you need to reduce LLM API spend, control token usage, route between models by cost/quality, implement prompt caching, or build cost observability for AI features. Triggers: 'my AI costs are
Slo Architect
99End-to-end SLO/SLI/error-budget discipline per Google SRE Workbook. Ships SLO designer (refuses to render without required fields), error-budget calculator with multi-window burn-rate alert thresholds (PromQL-shaped), and SLO reviewer that catches the 7 common bugs (target too high, window too short, no SLI definition, CPU-as-SLI, etc.). 4 references on principles + SLI design + error budget math + composition with feature-flags-architect/chaos-engineering/kubernetes-operator. Asset templates for SLO YAML and error budget policy. /slo-design slash command. NOT a generic observability skill.