multi-ai

Name: multi-ai
Rating: 65
Author: Z-M-Huang

by Z-M-Huang

Claude (planner + coder) and codex (reviewer)

⭐ 13🍴 4📅 Jan 24, 2026

ai-tools claude-code code-review developer-tools multi-agent multi-ai plugin security

View on GitHub Run in Manus

SKILL.md

name: multi-ai description: Start the multi-AI pipeline with TDD-driven ralph loop. Plan → Review → Implement (loop until tests pass + reviews approve). plugin-scoped: true allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Skill, AskUserQuestion

Multi-AI Pipeline with Ralph Loop

This pipeline combines human-guided planning with autonomous TDD-driven implementation using the Ralph Wiggum technique.

Scripts location: ${CLAUDE_PLUGIN_ROOT}/scripts/ Task directory: ${CLAUDE_PROJECT_DIR}/.task/

Pipeline Overview

Phase 1: Requirements (INTERACTIVE)
├── /user-story gathers requirements + TDD criteria
└── User approves

Phase 2: Planning (SEMI-INTERACTIVE)
├── Create plan with test commands, mode, risk assessment
├── Review loop for plan (autonomous)
└── Prompt user ONLY if clarification needed or conflicts detected

Phase 3: Implementation
├── IF simple mode → implement + single review cycle
└── IF ralph-loop mode → iterate until tests pass + reviews approve

Phase 4: Complete

Phase 1: Requirements Gathering (Interactive)

Step 1: Clean Up Previous Task

"${CLAUDE_PLUGIN_ROOT}/scripts/orchestrator.sh" reset

Step 2: Set State and Gather Requirements

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set requirements_gathering ""

Invoke /user-story to interactively gather:

Functional requirements
Technical requirements
Acceptance criteria
TDD criteria (test commands, success patterns)
Implementation mode (simple or ralph-loop)
Max iterations (default 10)

WAIT for user approval before continuing.

Phase 2: Planning (Semi-Interactive)

Step 3: Create Initial Plan

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set plan_drafting ""

Create .task/plan.json based on the approved user story.

Step 4: Refine Plan with Risk Assessment

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set plan_refining "$(bun ${CLAUDE_PLUGIN_ROOT}/scripts/json-tool.ts get .task/plan.json .id)"

Research the codebase and create .task/plan-refined.json with:

{
  "id": "plan-YYYYMMDD-HHMMSS",
  "title": "Feature title",
  "description": "What the user wants",
  "requirements": ["req 1", "req 2"],
  "technical_approach": "How to implement",
  "files_to_modify": ["path/to/file.ts"],
  "files_to_create": ["path/to/new.ts"],
  "implementation": {
    "mode": "ralph-loop",
    "max_iterations": 10,
    "skill": "implement-sonnet"
  },
  "test_plan": {
    "commands": ["npm test", "npm run lint"],
    "success_pattern": "passed|✓",
    "run_after_review": true
  },
  "risk_assessment": {
    "infinite_loop_risks": [
      "Risk: Linter auto-fix may conflict with reviewer style preferences"
    ],
    "conflicts_detected": [],
    "requires_user_decision": false
  },
  "completion_promise": "<promise>IMPLEMENTATION_COMPLETE</promise>",
  "refined_by": "claude",
  "refined_at": "ISO8601"
}

Step 5: Risk Assessment Check

Before proceeding, analyze for potential infinite loop risks:

Check for conflicts:

Test vs Review conflicts: Does the test require something reviews might reject?
Linter vs Style conflicts: Do auto-fixes conflict with coding standards?
Missing infrastructure: Are test dependencies available?
Circular dependencies: Could fixes create new review issues?

IF risk_assessment.requires_user_decision is true:

Use AskUserQuestion to present the risks
Get user decision on how to proceed
Update plan based on user input

OTHERWISE: Proceed autonomously.

Step 6: Plan Review Loop (Autonomous)

Initialize plan review tracking:

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" init-plan-review

Run the review loop with iteration tracking:

PLAN_ITERATION = 0
MAX_PLAN_ITERATIONS = planReviewLoopLimit from config (default: 10)

WHILE PLAN_ITERATION < MAX_PLAN_ITERATIONS:

    # Increment at start of each cycle
    "${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" increment-plan-review

    1. INVOKE /review-sonnet (plan mode)
       - If needs_changes → FIX plan, continue to step 2

    2. INVOKE /review-opus (plan mode)
       - If needs_changes → FIX plan, continue to step 3

    3. INVOKE /review-codex (plan mode)
       - If approved → EXIT loop, proceed to implementation
       - If needs_changes → FIX plan, go back to step 1
       - If needs_clarification → ASK user, then continue

    PLAN_ITERATION = $("${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" get-plan-review-iteration)

Check limit before each iteration:

if [[ $("${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" exceeded-plan-review-limit) == "1" ]]; then
  echo "Plan review loop exceeded limit. Asking user for guidance."
  # ASK user if they want to continue or abort
fi

IMPORTANT: You CANNOT proceed to implementation until ALL plan reviews are approved. The state-manager.sh will block transition to implementing or implementing_loop states if any plan review is missing or not approved.

Verify before implementation:

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" validate-plan-reviews

This command will fail with a detailed error if any reviews are incomplete.

Phase 3: Implementation

Step 7: Check Implementation Mode

Read the implementation mode from .task/plan-refined.json:

MODE=$(bun ${CLAUDE_PLUGIN_ROOT}/scripts/json-tool.ts get .task/plan-refined.json .implementation.mode)

IF mode == "simple": Single Implementation Cycle

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set implementing "$(bun ${CLAUDE_PLUGIN_ROOT}/scripts/json-tool.ts get .task/plan-refined.json .id)"

Invoke /implement-sonnet
Run single review cycle (sonnet → opus → codex)
Run tests
If all pass → complete
If issues → fix once, then complete (no loop)

IF mode == "ralph-loop": TDD Ralph Loop

"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set implementing_loop "$(bun ${CLAUDE_PLUGIN_ROOT}/scripts/json-tool.ts get .task/plan-refined.json .id)"

Initialize loop state:

Write .task/loop-state.json:

{
  "active": true,
  "iteration": 0,
  "max_iterations": 10,
  "completion_promise": "<promise>IMPLEMENTATION_COMPLETE</promise>",
  "plan_path": ".task/plan-refined.json",
  "started_at": "ISO8601"
}

Execute the Ralph Loop:

The stop hook (hooks/implementation-stop-hook.js) will intercept exit attempts and verify:

Check review files: Read existing .task/review-*.json files for status
Run test commands from plan (in project directory)
Verify completion criteria:
- All review files have status == "approved"
- All test commands pass (exit code from config, default 0)
- Success/failure patterns match (if defined in plan)

IMPORTANT: The hook READS review files - it does NOT run the review skills. You must invoke /review-sonnet, /review-opus, /review-codex yourself before attempting to exit.

IF criteria met:

Output: <promise>IMPLEMENTATION_COMPLETE</promise>
Hook allows exit

IF criteria NOT met:

Hook blocks exit

Re-feeds the implementation prompt:

Continue implementing based on the plan at .task/plan-refined.json

Previous iteration: [N] of [MAX]
Review status: [summary of issues]
Test status: [pass/fail summary]

Fix the issues and try again.
Output <promise>IMPLEMENTATION_COMPLETE</promise> when:
- All reviews pass (sonnet, opus, codex approve)
- All tests pass (exit code 0)

Loop continues until:

Completion promise detected AND tests pass AND reviews pass
OR max iterations reached (pause and ask user)

Phase 4: Completion

Step 8: Clean Up Loop State

rm -f .task/loop-state.json
"${CLAUDE_PLUGIN_ROOT}/scripts/state-manager.sh" set complete "$(bun ${CLAUDE_PLUGIN_ROOT}/scripts/json-tool.ts get .task/plan-refined.json .id)"

Step 9: Report Results

Report to user:

What was implemented
Files changed
Tests added/modified
Review iterations taken
Final test results

Ralph Loop Details

How the Stop Hook Works

When Claude tries to exit during implementing_loop state:

Hook reads .task/loop-state.json
If active: false or missing → allow exit
If iteration >= max_iterations → allow exit, warn user
Otherwise:
- Read existing review files (.task/review-*.json)
- Run test commands from plan (changes to project directory first)
- Check success/failure patterns from plan config
- Check if completion criteria met
- If met → allow exit
- If not → increment iteration, block exit, return prompt

Note: The hook does NOT invoke review skills - it only reads the review result files. You must run the reviews yourself before attempting to exit.

Completion Criteria

All must be true:

.task/review-sonnet.json status == "approved"
.task/review-opus.json status == "approved"
.task/review-codex.json status == "approved"
All test commands from plan exit with code 0

Safety Mechanisms

Max iterations: Hard limit (default 10, user configurable)
Conflict detection: Planning phase flags potential infinite loops
Cancel command: /cancel-loop to abort at any time
State file: Remove .task/loop-state.json to stop loop

Important Rules

Semi-interactive planning: Only ask user when genuinely needed
Autonomous implementation: Ralph loop handles iteration automatically
Review before test: Always run reviews first, then tests
Accept all feedback: No debate with reviewers, just fix
Clear completion criteria: Tests pass + reviews approve

Progress Reporting

Requirements approved. Starting planning...
✓ Plan created
✓ Risk assessment: No conflicts detected
✓ Plan reviews: approved (2 iterations)

Starting implementation (ralph-loop mode, max 10 iterations)...
Iteration 1:
  ✓ Implementation complete
  ✗ Sonnet review: 2 issues
  - Fixing issues...
Iteration 2:
  ✓ Fixes applied
  ✓ Sonnet review: approved
  ✓ Opus review: approved
  ✓ Codex review: approved
  ✓ Tests: 5 passed, 0 failed

<promise>IMPLEMENTATION_COMPLETE</promise>

✓ Complete! Feature implemented in 2 iterations.

Score

Total Score

65/100

Based on repository quality metrics

✓SKILL.md

SKILL.mdファイルが含まれている

+20

✓LICENSE

ライセンスが設定されている

+10

○説明文

100文字以上の説明がある

0/10

○人気

GitHub Stars 100以上

0/15

✓最近の活動

1ヶ月以内に更新

+10

○フォーク

10回以上フォークされている

0/5

✓Issue管理

オープンIssueが50未満

✓言語

プログラミング言語が設定されている

✓タグ

1つ以上のタグが設定されている

Reviews

💬

Reviews coming soon

multi-ai

SKILL.md

name: multi-ai description: Start the multi-AI pipeline with TDD-driven ralph loop. Plan → Review → Implement (loop until tests pass + reviews approve). plugin-scoped: true allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Skill, AskUserQuestion

Multi-AI Pipeline with Ralph Loop

Pipeline Overview

Phase 1: Requirements Gathering (Interactive)

Step 1: Clean Up Previous Task

Step 2: Set State and Gather Requirements

Phase 2: Planning (Semi-Interactive)

Step 3: Create Initial Plan

Step 4: Refine Plan with Risk Assessment

Step 5: Risk Assessment Check

Step 6: Plan Review Loop (Autonomous)

Phase 3: Implementation

Step 7: Check Implementation Mode

IF mode == "simple": Single Implementation Cycle

IF mode == "ralph-loop": TDD Ralph Loop

Phase 4: Completion

Step 8: Clean Up Loop State

Step 9: Report Results

Ralph Loop Details

How the Stop Hook Works

Completion Criteria

Safety Mechanisms

Important Rules

Progress Reporting

Score

Reviews

browser-use

git-workflow

code-review

system-info

changelog-automation

web-component-design

multi-ai

SKILL.md

name: multi-ai description: Start the multi-AI pipeline with TDD-driven ralph loop. Plan → Review → Implement (loop until tests pass + reviews approve). plugin-scoped: true allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Skill, AskUserQuestion

Multi-AI Pipeline with Ralph Loop

Pipeline Overview

Phase 1: Requirements Gathering (Interactive)

Step 1: Clean Up Previous Task

Step 2: Set State and Gather Requirements

Phase 2: Planning (Semi-Interactive)

Step 3: Create Initial Plan

Step 4: Refine Plan with Risk Assessment

Step 5: Risk Assessment Check

Step 6: Plan Review Loop (Autonomous)

Phase 3: Implementation

Step 7: Check Implementation Mode

IF mode == "simple": Single Implementation Cycle

IF mode == "ralph-loop": TDD Ralph Loop

Phase 4: Completion

Step 8: Clean Up Loop State

Step 9: Report Results

Ralph Loop Details

How the Stop Hook Works

Completion Criteria

Safety Mechanisms

Important Rules

Progress Reporting

Score

Reviews

Related

Related Skills

browser-use

git-workflow

code-review

system-info

changelog-automation

web-component-design