Grafana Dashboards
技能 已验证 活跃Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.
Enable users to design and implement effective Grafana dashboards for monitoring applications, infrastructure, and business metrics with clear examples and best practices.
功能
- Generate Grafana dashboard JSON configurations
- Illustrate various panel types (Stat, Graph, Table, Heatmap)
- Provide examples for dashboard variables and alerts
- Detail dashboard provisioning methods (file, Terraform, Ansible)
- Offer best practices for dashboard design and implementation
使用场景
- Visualize Prometheus metrics in Grafana
- Create custom operational observability interfaces
- Implement SLO dashboards for service level objectives
- Monitor infrastructure resource utilization and performance
非目标
- Directly provision Grafana dashboards into a running instance
- Provide a runtime agent for live metric collection
- Configure Prometheus or other data sources
Practical Utility
- info:Edge casesWhile best practices are listed, specific failure modes and recovery steps for edge cases are not explicitly documented in the skill.
安装
请先添加 Marketplace
/plugin marketplace add wshobson/agents/plugin install observability-monitoring@claude-code-workflows质量评分
已验证类似扩展
Service Mesh Observability
98Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.
Observability Designer
100Observability Designer (POWERFUL)
Monitor Stream
99Stream live swarm events using the Monitor tool for real-time observability
LangSmith Observability
99LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.
Plan Capacity
99Perform capacity planning using historical metrics and growth models. Use predict_linear for forecasting, identify resource constraints, calculate headroom, and recommend scaling actions before saturation. Use before seasonal traffic spikes or product launches, during quarterly capacity reviews, when resource utilization trends upward, or before budget planning cycles.
Define SLO/SLI/SLA
99Establish Service Level Objectives (SLO), Service Level Indicators (SLI), and Service Level Agreements (SLA) with error budget tracking, burn rate alerts, and automated reporting using Prometheus and tools like Sloth or Pyrra. Use when defining reliability targets for customer-facing services, balancing feature velocity against system reliability through error budgets, migrating from arbitrary uptime goals to data-driven metrics, or implementing Site Reliability Engineering practices.