此内容尚未提供您的语言版本,正在以英文显示。

Setup Prometheus Monitoring

技能已验证活跃

Configure Prometheus for time-series metrics collection, including scrape configurations, service discovery, recording rules, and federation patterns for multi-cluster deployments. Use when setting up centralized metrics collection for microservices, implementing time-series monitoring for application and infrastructure, establishing a foundation for SLO/SLI tracking and alerting, or migrating from legacy monitoring solutions to a modern observability stack.

目的

To guide users through setting up a production-ready Prometheus monitoring instance, enabling centralized metrics collection, time-series monitoring, and a foundation for SLO/SLI tracking.

功能

Prometheus installation and basic configuration
Dynamic service discovery setup (Kubernetes, file-based, Consul)
Configuration of recording rules for pre-aggregation
Optimization of storage and retention policies
Federation setup for multi-cluster environments
High availability deployment guidance

使用场景

Setting up centralized metrics collection for microservices
Implementing time-series monitoring for applications and infrastructure
Establishing a foundation for SLO/SLI tracking and alerting
Consolidating metrics from multiple Prometheus instances via federation
Migrating legacy monitoring solutions to Prometheus

非目标

Setting up Alertmanager or Grafana dashboards
Writing custom Prometheus exporters
Advanced Prometheus query optimization beyond rule aggregation
Detailed analysis of specific metrics beyond configuration

Code Execution

info:ValidationThe SKILL.md includes validation steps like 'promtool check config' and 'promtool check rules', but there's no explicit mention of a schema library for input validation within the bash scripts themselves.

安装

/plugin install agent-almanac@pjt222-agent-almanac

质量评分

已验证

95 /100

about 22 hours ago 分析

信任信号

最近提交2 days ago

GitHub 所有者 pjt222

星标14

下载量 308

许可证MIT

网站pjt222.github.io

状态

查看源代码

类似扩展

Grafana Dashboards

Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.

技能

wshobson

Service Mesh Observability

Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.

技能

wshobson

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

技能

microsoft

Query Netdata Cloud

100

Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.

技能

netdata

Observability Designer

100

Observability Designer (POWERFUL)

技能

alirezarezvani

Monitor Stream

Stream live swarm events using the Monitor tool for real-time observability

技能

ruvnet