ML Testing
471AI tools in the ML Testing category
pianola
gajus
A declarative function composition and evaluation engine.
eslint-plugin-vitest-globals
saqqdy
A extends of vitest globals for eslint
jdk
ryuu
My own SDK for developing JavaScript projects
functionalscript
GitHub Actions
FunctionalScript is a purely functional subset of JavaScript
@machinespirits/eval
lmagee
Evaluation system for Machine Spirits tutor - benchmarking, rubric evaluation, and analysis tools
@ark7/vee
ark7_inc
VEE (Value Evaluation Expression) is a lightweight and flexible expression evaluation engine designed for JSON-based data structures. With VEE, you can define custom expressions that evaluate results by feeding in JSON values, making it easy to implement
...moreweb-tooling-benchmark-generator
alopezsanchez
CLI tools to generate benchmark cases in the v8/web-tooling-benchmark repository.
@agentshield-ai/openclaw-plugin
markbriers
AgentShield real-time security evaluation plugin for OpenClaw. Intercepts tool calls before execution and evaluates them against Sigma detection rules.
...moretime-span
sindresorhus
Simplified high resolution timing
skilltest
lsaraiva
The testing framework for Agent Skills. Lint, test triggering, and evaluate your SKILL.md files.
ts-benchmark
mohammad-_-ahmad
A command line interface for monitoring the performance of typescript.
react-native-performance
oblador
Measure React Native performance
nia-web-eval-agent-mcp
arlanrakh
NIA AI Web Evaluation Agent MCP Server - Autonomous browser testing and debugging
nehoid
nehonixpkg
Advanced unique ID generation utility with multi-layer encoding, collision detection, and context-aware features
probeai
k08200
CLI tool for testing and evaluating AI coding agents
@mike007jd/openclaw-skill-profiler
mike007jd
OpenClaw skill performance profiler with bottleneck analysis
mitata
evan
benchmark tooling that loves you ❤️
browserless
kikobeats
The headless Chrome/Chromium driver on top of Puppeteer. Take screenshots, generate PDFs, extract text and HTML with a production-ready API.
...morebenny
caderek
A dead simple benchmarking framework
@2501-ai/cli
zhuk-aa
[](https://www.npmjs.com/package/@2501-ai/cli) [](https://www.2501.ai/research/full-humaneval-benchmark) [![Lic
...more