Simple evaluation tool for Claude Code. PASS/FAIL testing with LLM-as-a-judge simplified approach.
Cross-referenced across 55 tracked directories
#79506
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
8/22/2025
Created
9
GitHub Stars
8/26/2025
Last Commit
Recently added to the ecosystem
Score: 100/100
0 dependency vulnerabilities found
Run an AI-powered security scan to analyze this package's source code for vulnerabilities, prompt injection vectors, data exfiltration risks, and behavior mismatches.
Scans fetch actual source code from the GitHub repository, not just the README.
aabyzov
AI-assisted development, under control. Configure your standards once — spec-first, TDD, quality gates — and every AI interaction enforces them automatically. Works with Claude Code, Cursor, Copilot, Codex & more.
...moreGitHub Actions
Generate AI summaries of test results using a wide range of AI models like OpenAI, Anthropic, Gemini, Mistral, Grok, DeepSeek, Azure, Perplexity, OpenRouter, and custom OpenAI-compatible APIs
...moremikelarg
GigaChat integration for LangChain.js
dhravya
The Universal Translation Layer for Large Language Model APIs