Back to list
dralgorhythm

chaos-engineering

by dralgorhythm

A More Effective Agent Harness for Claude

4🍴 0📅 Jan 22, 2026

SKILL.md


name: chaos-engineering description: Test system resilience through controlled failures. Use when validating fault tolerance, disaster recovery, or system reliability. Covers chaos experiments. allowed-tools: Read, Write, Bash, Glob, Grep

Chaos Engineering

Principles

  1. Build a Hypothesis: Define expected behavior
  2. Minimize Blast Radius: Start small
  3. Run in Production: Real conditions matter
  4. Automate: Make experiments repeatable
  5. Minimize Impact: Have abort conditions

Experiment Process

  1. Steady State: Define normal metrics
  2. Hypothesis: "System will maintain X under condition Y"
  3. Introduce Variables: Inject failure
  4. Observe: Compare to steady state
  5. Analyze: Confirm or disprove hypothesis

Common Experiments

Network Failures

# Add latency
tc qdisc add dev eth0 root netem delay 100ms

# Packet loss
tc qdisc add dev eth0 root netem loss 10%

# Remove
tc qdisc del dev eth0 root

Resource Exhaustion

# CPU stress
stress --cpu 4 --timeout 60s

# Memory stress
stress --vm 2 --vm-bytes 1G --timeout 60s

# Disk fill
dd if=/dev/zero of=/tmp/fill bs=1M count=1024

Service Failures

  • Kill processes
  • Restart containers
  • Terminate instances
  • Block dependencies

Chaos Tools

  • Chaos Monkey: Random instance termination
  • Gremlin: Comprehensive chaos platform
  • Litmus: Kubernetes chaos engineering
  • Chaos Mesh: Cloud-native chaos

Experiment Template

## Experiment: [Name]

### Hypothesis
If [condition], then [expected behavior].

### Steady State
- Metric A: [baseline value]
- Metric B: [baseline value]

### Method
1. [Step 1]
2. [Step 2]
3. [Step 3]

### Abort Conditions
- If [condition], stop immediately

### Results
[What happened]

### Findings
[What we learned]

Safety Rules

  1. Start in non-production
  2. Have rollback ready
  3. Monitor continuously
  4. Communicate with team
  5. Document everything

Score

Total Score

55/100

Based on repository quality metrics

SKILL.md

SKILL.mdファイルが含まれている

+20
LICENSE

ライセンスが設定されている

0/10
説明文

100文字以上の説明がある

0/10
人気

GitHub Stars 100以上

0/15
最近の活動

1ヶ月以内に更新

+10
フォーク

10回以上フォークされている

0/5
Issue管理

オープンIssueが50未満

+5
言語

プログラミング言語が設定されている

+5
タグ

1つ以上のタグが設定されている

+5

Reviews

💬

Reviews coming soon