Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Incident Followup Audit

Skill Verifiziert Aktiv
Teil von:Swe Skills

Audits post-incident engineering follow-through after a sev or incident to verify whether the durable follow-up happened: regression tests, monitors, docs, runbooks, ownership updates, tickets, rollback learnings, and remaining backlog. Use when a user asks whether incident follow-up is complete, what still needs to be done after a postmortem, or how to close the engineering loop. Do NOT use for live incident response, root-cause analysis, or a generic bug hunt unrelated to an incident.

Zweck

To provide a structured audit of post-incident engineering follow-through, ensuring that all necessary follow-up actions are completed and closing the engineering loop after an incident.

Funktionen

  • Audits post-incident engineering follow-through
  • Verifies durable follow-up actions
  • Identifies missing or incomplete work
  • Ranks remaining backlog by risk and impact
  • Anchors audit to specific incident identifiers

Anwendungsfälle

  • Audit postmortem follow-through after a sev or production incident
  • Check whether regression tests, monitors, runbooks, or docs were added
  • See whether ownership, tickets, or rollback learnings were captured
  • Review what is still left to do before an incident is considered fully closed

Nicht-Ziele

  • Live incident response or war-room triage
  • Root-cause analysis of the incident itself
  • Generic code review or bug hunting unrelated to an incident
  • Broad cleanup work not tied to a concrete incident or postmortem

Practical Utility

  • info:Usage examplesThe SKILL.md outlines the desired output structure and evidence types but does not provide concrete, copy-pasteable end-to-end examples of invocations and their claimed outputs.

Installation

/plugin install swe-skills@ckorhonen-swe-skills

Qualitätspunktzahl

Verifiziert
97 /100
Analysiert about 21 hours ago

Vertrauenssignale

Letzter Commit5 days ago
Sterne1
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Ship Gate

100

Pre-production audit that scans a codebase for security, database, deployment, code quality, AI/LLM, dependency, frontend, and observability issues. Intercepts deploy commands and blocks until critical items pass. Stack-agnostic. Use for "run ship gate", "am I ready to ship", "pre-launch audit", "can I deploy", "push to production", "go live checklist", "preflight check". Not for CI/CD setup or infra provisioning.

Skill
alirezarezvani

Incident Response

100

Manage active production incidents through detection, triage, mitigation, communication, and resolution with structured roles and decision-making. Use this skill whenever the user has an active incident, a production issue, a service outage, a security incident, or needs to plan incident response procedures. Triggers on incident response, production incident, outage, service down, site down, P0, P1, severity, downtime, on-call, incident commander, status page, postmortem prep. Also triggers when something is actively broken in production and the user is figuring out what to do.

Skill
rampstackco

Ops Fires

100

Production incidents dashboard. Reads ECS health, Sentry errors, CI failures. Offers to dispatch fix agents for active fires.

Skill
Lifecycle-Innovations-Limited

After Action Report

100

Run a structured after-action review (postmortem, retrospective) on a launch, incident, or completed project to capture timeline, root cause analysis, contributing factors, and actionable lessons. Use this skill whenever the user wants to run a postmortem, retrospective, AAR, or after-action review on any past event. Triggers on after-action report, AAR, postmortem, retrospective, retro, post-incident review, what went well what didn't, lessons learned, blameless postmortem, root cause analysis, RCA, five whys. Also triggers when the user has just shipped something or just resolved an incident and wants to capture learnings.

Skill
rampstackco

SRE Engineer

98

Defines service level objectives, creates error budget policies, designs incident response procedures, develops capacity models, and produces monitoring configurations and automation scripts for production systems. Use when defining SLIs/SLOs, managing error budgets, building reliable systems at scale, incident management, chaos engineering, toil reduction, or capacity planning.

Skill
jeffallan

Incident Commander

98

Incident Commander Skill

Skill
alirezarezvani