Skip to main content

Incident Response

Skill Verified Active

Manage active production incidents through detection, triage, mitigation, communication, and resolution with structured roles and decision-making. Use this skill whenever the user has an active incident, a production issue, a service outage, a security incident, or needs to plan incident response procedures. Triggers on incident response, production incident, outage, service down, site down, P0, P1, severity, downtime, on-call, incident commander, status page, postmortem prep. Also triggers when something is actively broken in production and the user is figuring out what to do.

Purpose

To enable users to effectively manage production incidents by providing a clear, actionable framework for detection, triage, mitigation, communication, and resolution.

Features

  • Structured incident management framework
  • Defined roles and responsibilities
  • Severity assessment rubric
  • Mitigation patterns
  • Communication strategy guidelines
  • Decision-making rubrics

Use Cases

  • Managing active production incidents
  • Building incident response procedures
  • Defining severity levels and response protocols
  • Planning on-call rotations and incident command structures

Non-Goals

  • Post-incident retrospective analysis (use `after-action-report`)
  • Planning for planned launches (use `launch-runbook`)
  • Pre-launch issue triage (use `qa-testing`)

Installation

npx skills add rampstackco/claude-skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified
100 /100
Analyzed about 13 hours ago

Trust Signals

Last commit3 days ago
Stars168
LicenseMIT
Status
View Source

Similar Extensions

Ops Fires

100

Production incidents dashboard. Reads ECS health, Sentry errors, CI failures. Offers to dispatch fix agents for active fires.

Skill
Lifecycle-Innovations-Limited

After Action Report

100

Run a structured after-action review (postmortem, retrospective) on a launch, incident, or completed project to capture timeline, root cause analysis, contributing factors, and actionable lessons. Use this skill whenever the user wants to run a postmortem, retrospective, AAR, or after-action review on any past event. Triggers on after-action report, AAR, postmortem, retrospective, retro, post-incident review, what went well what didn't, lessons learned, blameless postmortem, root cause analysis, RCA, five whys. Also triggers when the user has just shipped something or just resolved an incident and wants to capture learnings.

Skill
rampstackco

K8s Manifest Generator

100

Create production-ready Kubernetes manifests for Deployments, Services, ConfigMaps, and Secrets following best practices and security standards. Use when generating Kubernetes YAML manifests, creating K8s resources, or implementing production-grade Kubernetes configurations.

Skill
wshobson

OpenClaw Release Maintainer

100

Prepare or verify OpenClaw stable/beta releases, changelogs, release notes, publish commands, and artifacts.

Skill
steipete

Azure Container Registry SDK for Python

100

Azure Container Registry SDK for Python. Use for managing container images, artifacts, and repositories. Triggers: "azure-containerregistry", "ContainerRegistryClient", "container images", "docker registry", "ACR".

Skill
microsoft

Azure App Configuration SDK for Python

100

Azure App Configuration SDK for Python. Use for centralized configuration management, feature flags, and dynamic settings. Triggers: "azure-appconfiguration", "AzureAppConfigurationClient", "feature flags", "configuration", "key-value settings".

Skill
microsoft

© 2025 SkillRepo · Find the right skill, skip the noise.