dspy-bootstrap-fewshot

Name: dspy-bootstrap-fewshot
Rating: 75
Author: OmidZamani

by OmidZamani

Collection of Claude Skills for DSPy framework - program language models, optimize prompts, and build RAG pipelines systematically

⭐ 20🍴 4📅 Jan 23, 2026

agent-skills claude-code claude-skills dspy llm prompt-optimization rag

View on GitHub Run in Manus

SKILL.md

name: dspy-bootstrap-fewshot version: "1.0.0" dspy-compatibility: "3.1.2" description: This skill should be used when the user asks to "bootstrap few-shot examples", "generate demonstrations", "use BootstrapFewShot", "optimize with limited data", "create training demos automatically", mentions "teacher model for few-shot", "10-50 training examples", or wants automatic demonstration generation for a DSPy program without extensive compute. allowed-tools:

Read
Write
Glob
Grep

DSPy Bootstrap Few-Shot Optimizer

Goal

Automatically generate and select optimal few-shot demonstrations for your DSPy program using a teacher model.

When to Use

You have 10-50 labeled examples
Manual example selection is tedious or suboptimal
You want demonstrations with reasoning traces
Quick optimization without extensive compute

For more data (200+ examples): dspy-miprov2-optimizer
For agentic systems: dspy-gepa-reflective
Measure improvements: dspy-evaluation-suite

Inputs

Input	Type	Description
`program`	`dspy.Module`	Your DSPy program to optimize
`trainset`	`list[dspy.Example]`	Training examples
`metric`	`callable`	Evaluation function
`metric_threshold`	`float`	Numerical threshold for accepting demos (optional)
`max_bootstrapped_demos`	`int`	Max teacher-generated demos (default: 4)
`max_labeled_demos`	`int`	Max direct labeled demos (default: 16)
`max_rounds`	`int`	Max bootstrapping attempts per example (default: 1)
`teacher_settings`	`dict`	Configuration for teacher model (optional)

Outputs

Output	Type	Description
`compiled_program`	`dspy.Module`	Optimized program with demos

Workflow

Phase 1: Setup

import dspy
from dspy.teleprompt import BootstrapFewShot

# Configure LMs
dspy.configure(lm=dspy.LM("openai/gpt-4o-mini"))

Phase 2: Define Program and Metric

class QA(dspy.Module):
    def __init__(self):
        self.generate = dspy.ChainOfThought("question -> answer")
    
    def forward(self, question):
        return self.generate(question=question)

def validate_answer(example, pred, trace=None):
    return example.answer.lower() in pred.answer.lower()

Phase 3: Compile

optimizer = BootstrapFewShot(
    metric=validate_answer,
    max_bootstrapped_demos=4,
    max_labeled_demos=4,
    teacher_settings={'lm': dspy.LM("openai/gpt-4o")}
)

compiled_qa = optimizer.compile(QA(), trainset=trainset)

Phase 4: Use and Save

# Use optimized program
result = compiled_qa(question="What is photosynthesis?")

# Save for production (state-only, recommended)
compiled_qa.save("qa_optimized.json", save_program=False)

Production Example

import dspy
from dspy.teleprompt import BootstrapFewShot
from dspy.evaluate import Evaluate
import logging

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

class ProductionQA(dspy.Module):
    def __init__(self):
        self.cot = dspy.ChainOfThought("question -> answer")
    
    def forward(self, question: str):
        try:
            return self.cot(question=question)
        except Exception as e:
            logger.error(f"Generation failed: {e}")
            return dspy.Prediction(answer="Unable to answer")

def robust_metric(example, pred, trace=None):
    if not pred.answer or pred.answer == "Unable to answer":
        return 0.0
    return float(example.answer.lower() in pred.answer.lower())

def optimize_with_bootstrap(trainset, devset):
    """Full optimization pipeline with validation."""
    
    # Baseline
    baseline = ProductionQA()
    evaluator = Evaluate(devset=devset, metric=robust_metric, num_threads=4)
    baseline_score = evaluator(baseline)
    logger.info(f"Baseline: {baseline_score:.2%}")
    
    # Optimize
    optimizer = BootstrapFewShot(
        metric=robust_metric,
        max_bootstrapped_demos=4,
        max_labeled_demos=4
    )
    
    compiled = optimizer.compile(baseline, trainset=trainset)
    optimized_score = evaluator(compiled)
    logger.info(f"Optimized: {optimized_score:.2%}")
    
    if optimized_score > baseline_score:
        compiled.save("production_qa.json", save_program=False)
        return compiled
    
    logger.warning("Optimization didn't improve; keeping baseline")
    return baseline

Best Practices

Quality over quantity - 10 excellent examples beat 100 noisy ones
Use stronger teacher - GPT-4 as teacher for GPT-3.5 student
Validate with held-out set - Always test on unseen data
Start with 4 demos - More isn't always better

Limitations

Requires labeled training data
Teacher model costs can add up
May not generalize to very different inputs
Limited exploration compared to MIPROv2

Official Documentation

DSPy Documentation: https://dspy.ai/
DSPy GitHub: https://github.com/stanfordnlp/dspy
BootstrapFewShot API: https://dspy.ai/api/optimizers/BootstrapFewShot/
Optimization Guide: https://dspy.ai/learn/optimization/optimizers/

Score

Total Score

75/100

Based on repository quality metrics

✓SKILL.md

SKILL.mdファイルが含まれている

+20

✓LICENSE

ライセンスが設定されている

+10

✓説明文

100文字以上の説明がある

+10

○人気

GitHub Stars 100以上

0/15

✓最近の活動

1ヶ月以内に更新

+10

○フォーク

10回以上フォークされている

0/5

✓Issue管理

オープンIssueが50未満

✓言語

プログラミング言語が設定されている

✓タグ

1つ以上のタグが設定されている

Reviews

💬

Reviews coming soon

dspy-bootstrap-fewshot

SKILL.md

DSPy Bootstrap Few-Shot Optimizer

Goal

When to Use

Inputs

Outputs

Workflow

Phase 1: Setup

Phase 2: Define Program and Metric

Phase 3: Compile

Phase 4: Use and Save

Production Example

Best Practices

Limitations

Official Documentation

Score

Reviews

orpc-contract-first

component-refactoring

web-design-guidelines

frontend-code-review

frontend-testing

vercel-react-best-practices

dspy-bootstrap-fewshot

SKILL.md

DSPy Bootstrap Few-Shot Optimizer

Goal

When to Use

Related Skills

Inputs

Outputs

Workflow

Phase 1: Setup

Phase 2: Define Program and Metric

Phase 3: Compile

Phase 4: Use and Save

Production Example

Best Practices

Limitations

Official Documentation

Score

Reviews

Related

Related Skills

orpc-contract-first

component-refactoring

web-design-guidelines

frontend-code-review

frontend-testing

vercel-react-best-practices