
agent-eval-harness
plaited / agent-eval-harness
Evaluate AI agents with Unix-style pipeline commands. Schema-driven adapters for any CLI agent, trajectory capture, pass@k metrics, and multi-run comparison.
agent-comparisonagent-evaluationai-agents+11