ML Testing

471

AI tools in the ML Testing category

All (471)MCP Servers (12)Skills (452)Agents (7)

performance-test-runner

feirell

This package is meant to help you define benchmarks for the [benchmark package](https://www.npmjs.com/package/benchmark) in a similar way as you can define unit tests with karma etc.

...more

SkillML Testing

11 dir

nel

n-riesco

Node.js Evaluation Loop (NEL): module to run a Node.js REPL session

SkillML Testing

261 dir

fen-bench

andyhall

A small, sane JS micro-benchmark library

SkillML Testing

21 dir

browserless

kikobeats

The headless Chrome/Chromium driver on top of Puppeteer. Take screenshots, generate PDFs, extract text and HTML with a production-ready API.

...more

SkillML Testing

1.8K1 dir

benchmartian

dunxrion

Benchmark.js mocha like command line interface

SkillML Testing

23K1 dir

@artale/pi-eval

artale

Agent evaluation harness. Judge sessions on success, tool usage, efficiency, methodology. Inspired by opencc.

SkillML Testing

1 dir

lighter-emitter

zerious

A lightweight JavaScript event emitter.

SkillML Testing

1 dir

benny

caderek

A dead simple benchmarking framework

SkillML Testing

7691 dir

forkeys-benchmark

jameskmonger

Benchmarking for forkeys

SkillML Testing

1 dir

is-valid-var-name

stevewestbrook

Determines whether a string is a valid javascript variable name. ES2015 and ES5 compatibility. Strict mode evaluation by default.

...more

SkillML Testing

31 dir

lighter-mime

zerious

A lightweight JavaScript MIME type library.

SkillML Testing

1 dir

@kodus/agent-readiness

gamalinosqui

Evaluate how prepared your codebase is for autonomous AI coding agents

SkillML Testing

1 dir

@originjs/oss-evaluation-components

GitHub Actions

No description available

SkillML Testing

1 dir

@tripetto/block-evaluate

markvandenbrink

Evaluation condition block for Tripetto.

SkillML Testing

1 dir

@versatly/skillbench

g9pedro

CLI benchmark system for tracking skill versions, scoring performance, and comparing improvements

SkillML Testing

1 dir

agentv

christso

CLI entry point for AgentV

SkillML Testing

111 dir

@agentid-protocol/core

sharifventures

AgentID core SDK - cryptographic identity, manifests, signing, verification, and policy evaluation for AI agents

SkillML Testing

11 dir

@tscircuit/autorouting-dataset-01

seveibar

A set of tscircuit problems to benchmark autorouting (currently 16 circuits in `lib/`).

SkillML Testing

1 dir

consys

fireboltcaster

consys is a flexible tool to evaluate models using generic and readable constraints.

SkillML Testing

31 dir

@uppercod/match-media

uppercod

Allows to define a value in post of an evaluation of a string whose pattern is like img[srcset]

SkillML Testing

1 dir