ML Testing

478

AI tools in the ML Testing category

All (478)MCP Servers (13)Skills (457)Agents (8)

astrobench-cli

prantlf

JavaScript benchmarks in the web browser using Benchmark.js and Puppeteer

SkillML Testing

31 dir

suite-metrics

reidmoffat

Easily keep track of metrics for many nested test suites

SkillML Testing

11 dir

vitest-react-profiler

GitHub Actions

Performance testing utilities for React components and hooks with sync/async update tracking in Vitest

SkillML Testing

101 dir

@nicholaswmin/dyno

nicholaswmin

a multithreaded benchmarker

SkillML Testing

1 dir

npmbench

dtrejo

benchmark each release of a node module published on npm against each other release using a small command line tool

SkillML Testing

31 dir

claw-harness

GitHub Actions

Testing framework for OpenClaw bots. Spin up real agents, load skills, drive multi-turn prompts, and capture results.

SkillML Testing

1 dir

eval2otel

evalops

Library to convert evaluation metrics and traces to OpenTelemetry GenAI semantic conventions

SkillML Testing

31 dir

trakr

kjscheibo

Minimal utility for tracking performance

SkillML Testing

11 dir

karma-whs-benchmark

alex2401

Continuous JavaScript Performance Monitoring with Benchmark.js and the Karma Runner

SkillML Testing

901 dir

nodejs-package-benchmark

rafaelgss

This package allows you to benchmark different runtimes using popular packages operations.

SkillML Testing

301 dir

given2

tatyshev

Lazy variable evaluation for Jasmine, Mocha, Jest specs, inspired by Rspec's let

SkillML Testing

541 dir

iswasmfast

maga

Performance comparison of WebAssembly, C++ Addon, and native implementations of various algorithms in Node.js.

SkillML Testing

1981 dir

ppef

GitHub Actions

Portable Programmatic Evaluation Framework - Claim-driven, deterministic evaluation for experiments

SkillML Testing

1 dir

dream11-react-native-performance-tracker

wedesicooking

Benchmark React Native View Paint Time

SkillML Testing

301 dir

js-index-data-structures

vhf

A benchmark of JS data structures suitable for in memory non unique indexing

SkillML Testing

41 dir

odor

catpea

Static blog generator with parallel encoding, incremental builds, atomic writes, and an AI agent for spellcheck, tagging, summarization, and quality evaluation.

...more

AgentML Testing

1 dir

jbr

rubensworks

Just a Benchmark Runner

SkillML Testing

91 dir

@future-agi/sdk

nvjkkartik

We help GenAI teams maintain high-accuracy for their Models in production.

SkillML Testing

1 dir

poker-rangeman

dargeo

A comprehensive JavaScript library for parsing, managing, and filtering poker hand ranges with support for dead cards, board cards, and hand strength evaluation

...more

SkillML Testing

1 dir

intershop-lazy

loveencounterflow

an InterShop add-on to facilitate caching results of costly computations

SkillML Testing

1 dir