Setup Prometheus Monitoring
技能 已验证 活跃Configure Prometheus for time-series metrics collection, including scrape configurations, service discovery, recording rules, and federation patterns for multi-cluster deployments. Use when setting up centralized metrics collection for microservices, implementing time-series monitoring for application and infrastructure, establishing a foundation for SLO/SLI tracking and alerting, or migrating from legacy monitoring solutions to a modern observability stack.
To guide users through setting up a production-ready Prometheus monitoring instance, enabling centralized metrics collection, time-series monitoring, and a foundation for SLO/SLI tracking.
功能
- Prometheus installation and basic configuration
- Dynamic service discovery setup (Kubernetes, file-based, Consul)
- Configuration of recording rules for pre-aggregation
- Optimization of storage and retention policies
- Federation setup for multi-cluster environments
- High availability deployment guidance
使用场景
- Setting up centralized metrics collection for microservices
- Implementing time-series monitoring for applications and infrastructure
- Establishing a foundation for SLO/SLI tracking and alerting
- Consolidating metrics from multiple Prometheus instances via federation
- Migrating legacy monitoring solutions to Prometheus
非目标
- Setting up Alertmanager or Grafana dashboards
- Writing custom Prometheus exporters
- Advanced Prometheus query optimization beyond rule aggregation
- Detailed analysis of specific metrics beyond configuration
Code Execution
- info:ValidationThe SKILL.md includes validation steps like 'promtool check config' and 'promtool check rules', but there's no explicit mention of a schema library for input validation within the bash scripts themselves.
安装
/plugin install agent-almanac@pjt222-agent-almanac质量评分
已验证类似扩展
Grafana Dashboards
99Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.
Service Mesh Observability
98Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.
Azure Monitor Query Py
100Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".
Query Netdata Cloud
100Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.
Observability Designer
100Observability Designer (POWERFUL)
Monitor Stream
99Stream live swarm events using the Monitor tool for real-time observability