跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Setup Uptime Checks

技能 已验证 活跃

Configure external uptime monitoring using Blackbox Exporter and Prometheus. Implement SSL certificate monitoring, HTTP endpoint health checks, and status pages for customer-facing visibility. Use when monitoring customer-facing endpoints such as APIs and websites, tracking SSL certificate expiration, validating service availability from multiple regions, creating public status pages, or meeting SLA requirements for uptime reporting.

目的

To enable users to configure robust external uptime monitoring, track SSL certificate expirations, and provide public status visibility for their customer-facing services.

功能

  • Configure Blackbox Exporter for various probe types
  • Integrate Blackbox Exporter with Prometheus for metric collection
  • Set up Prometheus alerting rules for endpoint downtime and SSL expiry
  • Guide on building uptime monitoring dashboards
  • Provide options for status page implementation

使用场景

  • Monitoring customer-facing APIs and websites
  • Tracking SSL certificate expiration dates
  • Validating service availability from multiple regions
  • Creating public-facing service status pages
  • Meeting SLA requirements for uptime reporting

非目标

  • Internal network monitoring
  • Application performance monitoring (APM) beyond basic health checks
  • Automated certificate renewal

工作流

  1. Deploy Blackbox Exporter
  2. Configure Blackbox Modules
  3. Configure Prometheus Scrape Jobs
  4. Create Uptime Alerts
  5. Build Uptime Dashboard
  6. Set Up Status Page

实践

  • Observability
  • Site Reliability Engineering
  • Infrastructure as Code

先决条件

  • Docker or Kubernetes environment
  • Prometheus instance
  • Bash-compatible shell

Documentation

  • info:Configuration & parameter referenceWhile the SKILL.md outlines steps, specific default values for all optional parameters (e.g., Prometheus instance details, status page tool configurations) are not explicitly documented.

安装

/plugin install agent-almanac@pjt222-agent-almanac

质量评分

已验证
98 /100
2 days ago 分析

信任信号

最近提交3 days ago
星标14
许可证MIT
状态
查看源代码

类似扩展

Grafana Dashboards

99

Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.

技能
wshobson

Monitor Data Integrity

100

Design and operate a data integrity monitoring programme based on ALCOA+ principles. Covers detective controls, audit trail review schedules, anomaly detection patterns (off-hours activity, sequential modifications, bulk changes), metrics dashboards, investigation triggers, and escalation matrix definition. Use when establishing a data integrity monitoring programme for GxP systems, preparing for inspections where data integrity is a focus area, after a data integrity incident requiring enhanced monitoring, or when implementing MHRA, WHO, or PIC/S guidance.

技能
pjt222

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

技能
microsoft

Query Netdata Cloud

100

Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.

技能
netdata

Meta Observer

100

Track skill performance and emerging patterns

技能
mshadmanrahman

Ops Monitor

100

Unified APM and monitoring surface. Polls Datadog, New Relic, and OpenTelemetry backends for active alerts, error traces, and entity health. Use --watch for live polling every 60 seconds. Use --setup to configure monitoring credentials.

技能
Lifecycle-Innovations-Limited