Datadog Observability Skill

Skill Active

Full-stack observability with Datadog APM, logs, metrics, synthetics, and RUM. Use when implementing monitoring, tracing, alerting, or cost optimization for production systems.

Purpose

To enable users to effectively implement and manage full-stack observability for their production systems using Datadog, covering monitoring, tracing, logging, and cost optimization.

Features

Full-stack observability with Datadog APM, logs, metrics, synthetics, RUM
Detailed installation instructions for Docker, Kubernetes, Linux, Windows
Guidance on application instrumentation (Python, Node.js, Go, Java)
Configuration best practices for custom metrics, log processing, and alerting
Strategies for cost optimization and cardinality management

Use Cases

Implementing production monitoring and observability
Setting up distributed tracing across microservices
Configuring log aggregation and analysis pipelines
Creating custom metrics and dashboards
Optimizing Datadog costs

Non-Goals

Building with open-source stacks (Prometheus/Grafana)
Replacing managed observability solutions when cost is the primary concern
Providing a direct API to Datadog services (documentation and guidance only)

Workflow

Understand Datadog capabilities
Install Datadog Agent on target platform
Instrument applications for APM and custom metrics
Configure log collection and processing
Set up dashboards and alerts
Monitor and optimize costs

Practices

Observability best practices
APM instrumentation
Log management
Metric monitoring
Alerting strategies
Cost optimization

Prerequisites

Datadog account and API key
Installation of Datadog Agent
Application instrumentation (for APM/metrics)

Trust

warning:Issues AttentionThere are 4 open issues and 0 closed issues in the last 90 days, indicating a closure rate of 0% and potentially slow maintainer engagement.

Installation

npx skills add bobmatnyc/claude-mpm-skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

89 /100

Analyzed about 19 hours ago

Trust Signals

Last commit29 days ago

GitHub owner bobmatnyc

Stars44

LicenseMIT

Status

View Source

Similar Extensions

Observability Designer

100

Observability Designer (POWERFUL)

Skill

alirezarezvani

Ops Monitor

100

Unified APM and monitoring surface. Polls Datadog, New Relic, and OpenTelemetry backends for active alerts, error traces, and entity health. Use --watch for live polling every 60 seconds. Use --setup to configure monitoring credentials.

Skill

Lifecycle-Innovations-Limited

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

Skill

microsoft

Query Netdata Cloud

100

Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.

Skill

netdata

LangSmith Observability

LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.

Skill

Orchestra-Research

Service Mesh Observability

Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.

Skill

wshobson