Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Chaos Engineer

Skill Verifiziert Aktiv

Designs chaos experiments, creates failure injection frameworks, and facilitates game day exercises for distributed systems — producing runbooks, experiment manifests, rollback procedures, and post-mortem templates. Use when designing chaos experiments, implementing failure injection frameworks, or conducting game day exercises. Invoke for chaos experiments, resilience testing, blast radius control, game days, antifragile systems, fault injection, Chaos Monkey, Litmus Chaos.

Zweck

To empower teams to proactively build more resilient distributed systems by providing structured methodologies and practical guidance for chaos engineering.

Funktionen

  • Designs chaos experiments with clear hypotheses
  • Creates failure injection frameworks and procedures
  • Facilitates game day exercises for incident response
  • Produces runbooks, manifests, and post-mortem templates
  • Integrates with common chaos engineering tools

Anwendungsfälle

  • Designing and running controlled chaos experiments
  • Implementing failure injection for resilience testing
  • Planning and executing game day exercises
  • Improving system reliability by identifying weaknesses

Nicht-Ziele

  • Performing actual chaos injections automatically
  • Replacing dedicated chaos engineering platforms
  • Providing live monitoring dashboards (links to concepts are provided)
  • Automating post-mortem analysis

Workflow

  1. System Analysis
  2. Experiment Design
  3. Execute Chaos
  4. Learn & Improve
  5. Automate

Praktiken

  • Experiment Design
  • Safety Controls
  • Game Day Facilitation
  • Incident Response Preparation

Voraussetzungen

  • Understanding of distributed systems
  • Familiarity with target system's architecture
  • Access to relevant environments (e.g., Kubernetes cluster, AWS)

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add jeffallan/claude-skills
/plugin install claude-skills@fullstack-dev-skills

Qualitätspunktzahl

Verifiziert
99 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit13 days ago
Sterne9k
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Chaos Engineering

99

Use when planning, running, or learning from chaos engineering experiments. Triggers on "chaos experiment", "fault injection", "gameday", "resilience test", "blast radius", "steady state", "abort criteria", "Chaos Toolkit", "Chaos Mesh", "Litmus", "Gremlin", "AWS FIS", or any deliberate failure-injection question. Ships experiment designer, blast-radius calculator, and postmortem generator (all stdlib Python), 4 references on chaos principles + experiment design + attack taxonomy + tooling landscape, and a /chaos-experiment slash command. Composes with feature-flags-architect (kill switches as abort triggers) and kubernetes-operator (common chaos targets).

Skill
alirezarezvani

Workflow Orchestration Patterns

99

Design durable workflows with Temporal for distributed systems. Covers workflow vs activity separation, saga patterns, state management, and determinism constraints. Use when building long-running processes, distributed transactions, or microservice orchestration.

Skill
wshobson

Microservices Architect

98

Designs distributed system architectures, decomposes monoliths into bounded-context services, recommends communication patterns, and produces service boundary diagrams and resilience strategies. Use when designing distributed systems, decomposing monoliths, or implementing microservices patterns — including service boundaries, DDD, saga patterns, event sourcing, CQRS, service mesh, or distributed tracing.

Skill
jeffallan

Run Chaos Experiment

95

Design and execute chaos engineering experiments using Litmus or Chaos Mesh. Test system resilience through controlled fault injection, validate hypothesis-driven tests, and improve failure recovery. Use before major product launches, after architecture changes to validate resilience, during GameDays or disaster recovery drills, to validate assumptions about failure modes, or as part of an SRE maturity program.

Skill
pjt222

Release It!

95

Build production-ready systems with stability patterns: circuit breakers, bulkheads, timeouts, and retry logic. Use when the user mentions "production outage", "circuit breaker", "timeout strategy", "deployment pipeline", "chaos engineering", "bulkhead pattern", "retry with backoff", or "health checks". Also trigger when designing resilient microservices, planning zero-downtime deployments, or investigating cascading failure scenarios. Covers capacity planning, health checks, and anti-fragility patterns. For data systems, see ddia-systems. For system architecture, see system-design.

Skill
wondelai

K8s Manifest Generator

100

Create production-ready Kubernetes manifests for Deployments, Services, ConfigMaps, and Secrets following best practices and security standards. Use when generating Kubernetes YAML manifests, creating K8s resources, or implementing production-grade Kubernetes configurations.

Skill
wshobson