Reactive Watchers
Monitor agent execution in real-time with fail-fast assertions using AgentExecution.watch()
.
Learn how to write effective tests for your Claude Code agents using vibe-check’s powerful testing features.
Reactive Watchers
Monitor agent execution in real-time with fail-fast assertions using AgentExecution.watch()
.
Cumulative State
Track state across multiple runs to build comprehensive test scenarios.
Custom Matchers
Use specialized matchers for files, tools, quality checks, and cost constraints.
Matrix Testing
Generate Cartesian product tests to benchmark models and configurations.
Testing Claude Code agents requires specialized tools and patterns. These guides cover:
Monitor agent execution in real-time and implement fail-fast assertions.
You’ll learn:
AgentExecution.watch()
for real-time monitoringUse cases:
Track and aggregate state across multiple agent runs for comprehensive testing.
You’ll learn:
Use cases:
Master all available matchers for comprehensive test assertions.
You’ll learn:
toHaveChangedFiles
, toHaveNoDeletedFiles
)toHaveUsedTool
, toUseOnlyTools
)toCompleteAllTodos
, toPassRubric
)toStayUnderCost
)Use cases:
Generate Cartesian product tests for systematic model and configuration benchmarking.
You’ll learn:
defineTestSuite
for test generationUse cases: