ML Testing
478AI tools in the ML Testing category
astrobench-cli
prantlf
JavaScript benchmarks in the web browser using Benchmark.js and Puppeteer
suite-metrics
reidmoffat
Easily keep track of metrics for many nested test suites
vitest-react-profiler
GitHub Actions
Performance testing utilities for React components and hooks with sync/async update tracking in Vitest
@nicholaswmin/dyno
nicholaswmin
a multithreaded benchmarker
npmbench
dtrejo
benchmark each release of a node module published on npm against each other release using a small command line tool
claw-harness
GitHub Actions
Testing framework for OpenClaw bots. Spin up real agents, load skills, drive multi-turn prompts, and capture results.
eval2otel
evalops
Library to convert evaluation metrics and traces to OpenTelemetry GenAI semantic conventions
trakr
kjscheibo
Minimal utility for tracking performance
karma-whs-benchmark
alex2401
Continuous JavaScript Performance Monitoring with Benchmark.js and the Karma Runner
nodejs-package-benchmark
rafaelgss
This package allows you to benchmark different runtimes using popular packages operations.
given2
tatyshev
Lazy variable evaluation for Jasmine, Mocha, Jest specs, inspired by Rspec's let
iswasmfast
maga
Performance comparison of WebAssembly, C++ Addon, and native implementations of various algorithms in Node.js.
ppef
GitHub Actions
Portable Programmatic Evaluation Framework - Claim-driven, deterministic evaluation for experiments
dream11-react-native-performance-tracker
wedesicooking
Benchmark React Native View Paint Time
js-index-data-structures
vhf
A benchmark of JS data structures suitable for in memory non unique indexing
odor
catpea
Static blog generator with parallel encoding, incremental builds, atomic writes, and an AI agent for spellcheck, tagging, summarization, and quality evaluation.
...morejbr
rubensworks
Just a Benchmark Runner
@future-agi/sdk
nvjkkartik
We help GenAI teams maintain high-accuracy for their Models in production.
poker-rangeman
dargeo
A comprehensive JavaScript library for parsing, managing, and filtering poker hand ranges with support for dead cards, board cards, and hand strength evaluation
...moreintershop-lazy
loveencounterflow
an InterShop add-on to facilitate caching results of costly computations