Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Setup Uptime Checks

Skill Verifiziert Aktiv
Teil von:Agent Almanac

Configure external uptime monitoring using Blackbox Exporter and Prometheus. Implement SSL certificate monitoring, HTTP endpoint health checks, and status pages for customer-facing visibility. Use when monitoring customer-facing endpoints such as APIs and websites, tracking SSL certificate expiration, validating service availability from multiple regions, creating public status pages, or meeting SLA requirements for uptime reporting.

Zweck

To enable users to configure robust external uptime monitoring, track SSL certificate expirations, and provide public status visibility for their customer-facing services.

Funktionen

  • Configure Blackbox Exporter for various probe types
  • Integrate Blackbox Exporter with Prometheus for metric collection
  • Set up Prometheus alerting rules for endpoint downtime and SSL expiry
  • Guide on building uptime monitoring dashboards
  • Provide options for status page implementation

Anwendungsfälle

  • Monitoring customer-facing APIs and websites
  • Tracking SSL certificate expiration dates
  • Validating service availability from multiple regions
  • Creating public-facing service status pages
  • Meeting SLA requirements for uptime reporting

Nicht-Ziele

  • Internal network monitoring
  • Application performance monitoring (APM) beyond basic health checks
  • Automated certificate renewal

Workflow

  1. Deploy Blackbox Exporter
  2. Configure Blackbox Modules
  3. Configure Prometheus Scrape Jobs
  4. Create Uptime Alerts
  5. Build Uptime Dashboard
  6. Set Up Status Page

Praktiken

  • Observability
  • Site Reliability Engineering
  • Infrastructure as Code

Voraussetzungen

  • Docker or Kubernetes environment
  • Prometheus instance
  • Bash-compatible shell

Documentation

  • info:Configuration & parameter referenceWhile the SKILL.md outlines steps, specific default values for all optional parameters (e.g., Prometheus instance details, status page tool configurations) are not explicitly documented.

Installation

/plugin install agent-almanac@pjt222-agent-almanac

Qualitätspunktzahl

Verifiziert
98 /100
Analysiert about 20 hours ago

Vertrauenssignale

Letzter Commit1 day ago
Sterne14
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Grafana Dashboards

99

Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.

Skill
wshobson

Monitor Data Integrity

100

Design and operate a data integrity monitoring programme based on ALCOA+ principles. Covers detective controls, audit trail review schedules, anomaly detection patterns (off-hours activity, sequential modifications, bulk changes), metrics dashboards, investigation triggers, and escalation matrix definition. Use when establishing a data integrity monitoring programme for GxP systems, preparing for inspections where data integrity is a focus area, after a data integrity incident requiring enhanced monitoring, or when implementing MHRA, WHO, or PIC/S guidance.

Skill
pjt222

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

Skill
microsoft

Query Netdata Cloud

100

Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.

Skill
netdata

Meta Observer

100

Track skill performance and emerging patterns

Skill
mshadmanrahman

Ops Monitor

100

Unified APM and monitoring surface. Polls Datadog, New Relic, and OpenTelemetry backends for active alerts, error traces, and entity health. Use --watch for live polling every 60 seconds. Use --setup to configure monitoring credentials.

Skill
Lifecycle-Innovations-Limited