ML Testing
474AI tools in the ML Testing category
warp-contracts-evaluation-progress-plugin
redstone-finance
A plugin that allows to listen for evaluation progress events
@tosspayments/n8n__n8n-benchmark
myuoong
Cli for running benchmark tests for n8n
@xagentauth/cli
xagentauth
CLI tool for AgentAuth — test, benchmark, and generate challenges
@hstm-labs/pawmate-ai-challenge
rsdickerson
PawMate AI Benchmark CLI - Initialize, build, and submit benchmark runs
kley
volkova
HTML class attribute construction through key evaluation.
mc-benchmark
krutoy242
Build charts about load time of Minecraft modpack.
jdk
ryuu
My own SDK for developing JavaScript projects
wasm-cel
invakid404
WebAssembly module for evaluating CEL (Common Expression Language) expressions in Node.js and browsers
@easy-nodes/core
addisudamena49
A React-based node graph editor built on [React Flow](https://reactflow.dev). Define nodes declaratively with JSON, wire them together visually, and let the built-in evaluation engine run your graph in topological order. Supports sync and async evaluation
...moreintershop-lazy
loveencounterflow
an InterShop add-on to facilitate caching results of costly computations
@huolala-tech/page-spy-plugin-mp-eval
blucass
Used for code evaluation in mini program.
turbo-maker
andrewshedov
Superfast, multithreaded document generator for MongoDB, operating through CLI.
r3f-monitor
aldh
A performance monitor for React Three Fiber. Track FPS, draw calls, memory, and GPU usage.
cronometro
shogun_panda
Simple benchmarking suite powered by HDR histograms.
@2501-ai/cli
zhuk-aa
[](https://www.npmjs.com/package/@2501-ai/cli) [](https://www.2501.ai/research/full-humaneval-benchmark) [![Lic
...more@kodus/agent-readiness
gamalinosqui
Evaluate how prepared your codebase is for autonomous AI coding agents
nairon-bench
_obaid_
AI workflow benchmarking CLI
@index9/mcp
johnwils
Search, inspect, and benchmark 300+ AI models from your editor
wraptile
coderaiser
translate the evaluation of a function that takes multiple arguments into evaluating a sequence of 2 functions, each with any count of arguments
...morereact-native-startup-time
doomsower
measure startup time of your react-native app