ML Testing

479

AI tools in the ML Testing category

All (479)MCP Servers (14)Skills (458)Agents (7)

@openfeature/ofrep-web-provider

toddbaert

This provider is designed to use the [OpenFeature Remote Evaluation Protocol (OFREP)](https://openfeature.dev/specification/appendix-c).

...more

SkillML Testing

1 dir

@index9/mcp

johnwils

Search, inspect, and benchmark 300+ AI models from your editor

MCP ServerML Testing

11 dir

jbr

rubensworks

Just a Benchmark Runner

SkillML Testing

91 dir

consys

fireboltcaster

consys is a flexible tool to evaluate models using generic and readable constraints.

SkillML Testing

31 dir

claw-harness

GitHub Actions

Testing framework for OpenClaw bots. Spin up real agents, load skills, drive multi-turn prompts, and capture results.

SkillML Testing

1 dir

@machinespirits/eval

lmagee

Evaluation system for Machine Spirits tutor - benchmarking, rubric evaluation, and analysis tools

SkillML Testing

1 dir

time-span

sindresorhus

Simplified high resolution timing

SkillML Testing

851 dir

@sgnl-ai/set-transmitter

sgnl-developer

HTTP transmission library for Security Event Tokens (SET) with CAEP/SSF support

SkillML Testing

11 dir

nairon-bench

_obaid_

AI workflow benchmarking CLI

SkillML Testing

1 dir

probeai

k08200

CLI tool for testing and evaluating AI coding agents

SkillML Testing

11 dir

@dapplion/benchmark

dapplion

Ensures that new code does not introduce performance regressions with CI. Tracks:

SkillML Testing

1 dir

@tscircuit/autorouting-dataset-01

seveibar

A set of tscircuit problems to benchmark autorouting (currently 16 circuits in `lib/`).

SkillML Testing

1 dir

@react-querybuilder/core

jakeboone02

React Query Builder component for constructing queries and filters, with utilities for executing them in various database and evaluation contexts

...more

SkillML Testing

1.7K1 dir

@openfeature/flipt-web-provider

toddbaert

[Flipt](https://www.flipt.io/) is an open source developer friendly feature flagging solution, that allows for easy management and fast feature evaluation.

...more

SkillML Testing

1 dir

jest-plugin-set

negativetwelve

Declarative JS tests with lazy evaluation using jest.

SkillML Testing

1071 dir

react-native-performance

oblador

Measure React Native performance

SkillML Testing

1K1 dir

hypertune

miraan

[Hypertune](https://www.hypertune.com/) is the most flexible platform for feature flags, A/B testing, analytics, and app configuration. Built with full end-to-end type safety, Git-style version control and local, synchronous, in-memory flag evaluation. Op

...more

SkillML Testing

1 dir

nia-web-eval-agent-mcp

arlanrakh

NIA AI Web Evaluation Agent MCP Server - Autonomous browser testing and debugging

MCP ServerML Testing

1 dir

skilltest

lsaraiva

The testing framework for Agent Skills. Lint, test triggering, and evaluate your SKILL.md files.

SkillML Testing

1 dir

@radaros/core

bharatbxhipment

Core framework for building AI agents with tools, memory, and multi-model support

AgentML Testing

1 dir