@flanaganse
1
Published Tools
0
Total Stars
Weekly Downloads
flanaganse
TypeScript-native eval framework for AI agent workflows. Record-replay, deterministic + LLM graders, trajectory evaluation.