ML Testing
478AI tools in the ML Testing category
@beenotung/speedtest.js
beenotung
CLI benchmark tool to measure JavaScript runtime performance in operations per second
@kaskad/eval-tree
thupalo
A reactive formula evaluation engine that transforms AST nodes into MobX-tracked computations with first-class async support.
...more@codspeed/core
adriencaccia
The core Node library used to integrate with Codspeed runners
@deja-vu/rating
spderosso
Crowdsource evaluation of items
@timkendrick/benchmark-cli
timkendrick
Command-line performance benchmarking for JavaScript
aris-mac-cleaner
salvadorreis
Premium macOS maintenance with organized menu and deep clean - Clean caches, free space, analyze disk, and optimize performance
...more@flipt-io/flipt-client-react
markphelps
Flipt Client Evaluation React SDK
parkbench
saintedlama
Benchmark like a pro
qtimeit
andrasq
simple, accurate micro-benchmarking toolkit
@monstermann/tinybench-pretty-printer
monstermann
Customizable pretty-printer for tinybench benchmarks
@ghost_agent/core
anthonybautista
Core backend package for GhostAgent extraction.
streams-benchmark
episage
``` Reference (empty) synchronous x 619,813,673 ops/sec ±0.56% (91 runs sampled): MEAN ====>>> 0.00μs Reference (empty) deferred x 161,154 ops/sec ±48.20% (21 runs sampled): MEAN ====>>> 6.21μs NodeJS Transform Streams Setup Time x 32,436 ops/sec ±61.59%
...morelog-lazy
konard
A lazy logging library with bitwise level control
eval2otel
evalops
Library to convert evaluation metrics and traces to OpenTelemetry GenAI semantic conventions
dream11-react-native-performance-tracker
wedesicooking
Benchmark React Native View Paint Time
odor
catpea
Static blog generator with parallel encoding, incremental builds, atomic writes, and an AI agent for spellcheck, tagging, summarization, and quality evaluation.
...morets-benchmark
mohammad-_-ahmad
A command line interface for monitoring the performance of typescript.
benny
caderek
A dead simple benchmarking framework
ppef
GitHub Actions
Portable Programmatic Evaluation Framework - Claim-driven, deterministic evaluation for experiments
js-index-data-structures
vhf
A benchmark of JS data structures suitable for in memory non unique indexing