Skip to main content

Datadog Observability Skill

Skill Active

Full-stack observability with Datadog APM, logs, metrics, synthetics, and RUM. Use when implementing monitoring, tracing, alerting, or cost optimization for production systems.

Purpose

To enable users to effectively implement and manage full-stack observability for their production systems using Datadog, covering monitoring, tracing, logging, and cost optimization.

Features

  • Full-stack observability with Datadog APM, logs, metrics, synthetics, RUM
  • Detailed installation instructions for Docker, Kubernetes, Linux, Windows
  • Guidance on application instrumentation (Python, Node.js, Go, Java)
  • Configuration best practices for custom metrics, log processing, and alerting
  • Strategies for cost optimization and cardinality management

Use Cases

  • Implementing production monitoring and observability
  • Setting up distributed tracing across microservices
  • Configuring log aggregation and analysis pipelines
  • Creating custom metrics and dashboards
  • Optimizing Datadog costs

Non-Goals

  • Building with open-source stacks (Prometheus/Grafana)
  • Replacing managed observability solutions when cost is the primary concern
  • Providing a direct API to Datadog services (documentation and guidance only)

Workflow

  1. Understand Datadog capabilities
  2. Install Datadog Agent on target platform
  3. Instrument applications for APM and custom metrics
  4. Configure log collection and processing
  5. Set up dashboards and alerts
  6. Monitor and optimize costs

Practices

  • Observability best practices
  • APM instrumentation
  • Log management
  • Metric monitoring
  • Alerting strategies
  • Cost optimization

Prerequisites

  • Datadog account and API key
  • Installation of Datadog Agent
  • Application instrumentation (for APM/metrics)

Trust

  • warning:Issues AttentionThere are 4 open issues and 0 closed issues in the last 90 days, indicating a closure rate of 0% and potentially slow maintainer engagement.

Installation

npx skills add bobmatnyc/claude-mpm-skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

89 /100
Analyzed about 19 hours ago

Trust Signals

Last commit29 days ago
Stars44
LicenseMIT
Status
View Source

Similar Extensions

Observability Designer

100

Observability Designer (POWERFUL)

Skill
alirezarezvani

Ops Monitor

100

Unified APM and monitoring surface. Polls Datadog, New Relic, and OpenTelemetry backends for active alerts, error traces, and entity health. Use --watch for live polling every 60 seconds. Use --setup to configure monitoring credentials.

Skill
Lifecycle-Innovations-Limited

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

Skill
microsoft

Query Netdata Cloud

100

Query Netdata Cloud via its REST API -- metrics, logs (systemd-journal / windows-events / otel-logs), topology graphs (topology:snmp), network flows (flows:netflow), alerts, dynamic configuration (DynCfg), and generic Functions on a node. Use when the user asks about querying Netdata Cloud, fetching metrics from the cloud, querying logs / topology / netflow / sflow / ipfix through Cloud, listing or modifying configurations via DynCfg, calling agent Functions through Cloud, listing spaces/rooms/nodes, or building a curl command against `app.netdata.cloud`. Pairs with the `query-netdata-agents` skill when direct-agent access is needed.

Skill
netdata

LangSmith Observability

99

LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.

Skill
Orchestra-Research

Service Mesh Observability

98

Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.

Skill
wshobson

© 2025 SkillRepo · Find the right skill, skip the noise.