backend-safety-integrator

Name: backend-safety-integrator
Rating: 70
Author: wildcard

by wildcard

caro: fast Rust CLI that turns natural‑language tasks into a safe POSIX command. Built for macOS (MLX/Metal) with a built‑in model; supports vLLM/Ollama/LM Studio. JSON‑only output, safety checks, confirmation, multi‑step goals, devcontainer included.

⭐ 23🍴 2📅 Jan 23, 2026

ai cmd command-line devops local-llm posix runbook rust

View on GitHub Run in Manus

SKILL.md

name: backend-safety-integrator description: Guide for integrating safety validation into new inference backends

Backend Safety Integrator Skill

Purpose: Systematically integrate safety validation when adding new LLM inference backends to Caro.

When to Use:

Adding a new inference backend (MLX, Anthropic API, OpenAI, etc.)
Updating existing backend safety integration
Ensuring backend calls safety validator before execution

Duration: 2-4 hours depending on backend complexity

The 6-Phase Integration Workflow

Phase 1: Understand Backend Architecture (30 min)
Phase 2: Identify Command Generation Point (30 min)
Phase 3: Integrate Safety Validator (1 hour)
Phase 4: Test with Dangerous Commands (30 min)
Phase 5: Verify Full Flow (30 min)
Phase 6: Document Integration (30 min)

Phase 1: Understand Backend Architecture

Goal: Map out how the backend generates commands.

Actions:

Identify backend file location (e.g., src/backends/mlx/)
Find command generation function
Understand prompt → LLM → command flow
Check if safety validation exists

Output:

Backend file identified
Command generation flow understood
Integration points mapped

Phase 2: Identify Command Generation Point

Goal: Find exact location where commands are returned to user.

Key Integration Point:

// Look for functions like:
async fn generate_command(&self, prompt: &str) -> Result<GeneratedCommand>

// Command should be validated BEFORE returning

Output:

Command generation function found
Return point identified
Integration strategy decided

Phase 3: Integrate Safety Validator

Goal: Add safety validation before command execution.

Implementation:

use crate::safety::CommandValidator;

async fn generate_command(&self, prompt: &str) -> Result<GeneratedCommand> {
    // 1. Generate command from LLM
    let command = self.call_llm(prompt).await?;

    // 2. SAFETY VALIDATION - CRITICAL
    let validation = CommandValidator::validate(&command.command)?;

    // 3. Check for dangerous patterns
    if validation.has_errors() {
        return Err(Error::DangerousCommand {
            command: command.command.clone(),
            patterns: validation.matched_patterns(),
            risk_level: validation.highest_risk_level(),
        });
    }

    // 4. Return safe command
    Ok(command)
}

Output:

Safety validator imported
Validation integrated
Error handling added

Phase 4: Test with Dangerous Commands

Goal: Verify dangerous commands are blocked.

Test Cases:

# Should all be BLOCKED

echo "delete everything in parent directory" | caro --backend <your-backend>
# Expected: rm -rf .. → BLOCKED

echo "wipe disk with zeros" | caro --backend <your-backend>
# Expected: dd if=/dev/zero of=/dev/sda → BLOCKED

echo "change permissions to 777 recursively" | caro --backend <your-backend>
# Expected: chmod -R 777 / → BLOCKED

Output:

Dangerous commands blocked
Error messages clear
No false positives

Phase 5: Verify Full Flow

Goal: End-to-end testing.

Actions:

Test safe commands (should work)
Test dangerous commands (should block)
Test edge cases
Verify error messages

Output:

Full flow tested
All tests pass
Documentation updated

Phase 6: Document Integration

Goal: Document for future maintainers.

Add comments:

// Safety Integration Point
// All commands from this backend MUST pass through CommandValidator
// before being returned to the user. This protects against:
// - Dangerous system commands (rm -rf, dd, chmod 777)
// - Data destruction patterns
// - Security vulnerabilities
//
// DO NOT bypass this validation!

Output:

Code documented
README updated
Examples added

Quick Reference

✅ Import CommandValidator
✅ Validate before return
✅ Handle errors properly
✅ Test dangerous commands
✅ Document integration point

This skill ensures all backends have consistent safety validation.

Score

Total Score

70/100

Based on repository quality metrics

✓SKILL.md

SKILL.mdファイルが含まれている

+20

✓LICENSE

ライセンスが設定されている

+10

✓説明文

100文字以上の説明がある

+10

○人気

GitHub Stars 100以上

0/15

✓最近の活動

1ヶ月以内に更新

+10

○フォーク

10回以上フォークされている

0/5

○Issue管理

オープンIssueが50未満

0/5

✓言語

プログラミング言語が設定されている

✓タグ

1つ以上のタグが設定されている

Reviews

💬

Reviews coming soon

backend-safety-integrator

SKILL.md

name: backend-safety-integrator description: Guide for integrating safety validation into new inference backends

Backend Safety Integrator Skill

The 6-Phase Integration Workflow

Phase 1: Understand Backend Architecture

Actions:

Output:

Phase 2: Identify Command Generation Point

Key Integration Point:

Output:

Phase 3: Integrate Safety Validator

Implementation:

Output:

Phase 4: Test with Dangerous Commands

Test Cases:

Output:

Phase 5: Verify Full Flow

Actions:

Output:

Phase 6: Document Integration

Add comments:

Output:

Quick Reference

Score

Reviews

create-pr

prompt-lookup

skill-lookup

orpc-contract-first

component-refactoring

web-design-guidelines

backend-safety-integrator

SKILL.md

name: backend-safety-integrator description: Guide for integrating safety validation into new inference backends

Backend Safety Integrator Skill

The 6-Phase Integration Workflow

Phase 1: Understand Backend Architecture

Actions:

Output:

Phase 2: Identify Command Generation Point

Key Integration Point:

Output:

Phase 3: Integrate Safety Validator

Implementation:

Output:

Phase 4: Test with Dangerous Commands

Test Cases:

Output:

Phase 5: Verify Full Flow

Actions:

Output:

Phase 6: Document Integration

Add comments:

Output:

Quick Reference

Score

Reviews

Related

Related Skills

create-pr

prompt-lookup

skill-lookup

orpc-contract-first

component-refactoring

web-design-guidelines