ML Testing

471

AI tools in the ML Testing category

All (471)MCP Servers (11)Skills (453)Agents (7)

@tripetto/block-evaluate

markvandenbrink

Evaluation condition block for Tripetto.

SkillML Testing

1 dir

@zvenigora/jse-eval

zvenigora

JavaScript expression parsing and evaluation.

SkillML Testing

11 dir

@sgnl-ai/set-transmitter

sgnl-developer

HTTP transmission library for Security Event Tokens (SET) with CAEP/SSF support

SkillML Testing

11 dir

@networkteam/eel

rasmizzle

Embedded expression language, a parser and compiler for a safe subset of JavaScript for dynamic evaluation in JavaScript.

...more

SkillML Testing

21 dir

@mankinds/sdk

mankinds

TypeScript SDK for Mankinds AI Evaluation API

SkillML Testing

1 dir

@sucoza/feature-flags

tyevco

Standalone feature flag management library with evaluation engine, targeting, and rollouts

SkillML Testing

1 dir

vitest-evals

sentry-bot

End-to-end evaluation framework for AI agents, built on Vitest.

AgentML Testing

1351 dir

@fajarnugraha37/nope-iam

fajarnugraha37

A highly extensible, type-safe IAM-like access control library for Node.js, inspired by AWS IAM. Deny by default, allow by vibes and less patience for your bad access patterns. Supports policies, roles, decorators, adapters, and rich evaluation context be

...more

SkillML Testing

21 dir

@satoshibits/doc-lint

satoshibits

Documentation linter that assembles evaluation prompts from concern schemas

SkillML Testing

1 dir

@jsonpath-tools/jsonpath

janjorka

JSONPath (RFC 9535) query evaluation, analysis and editor services.

SkillML Testing

51 dir

@dapplion/benchmark

dapplion

Ensures that new code does not introduce performance regressions with CI. Tracks:

SkillML Testing

1 dir

ai-planning-val

jan-dolejsi

Javascript/typescript wrapper for VAL (AI Planning plan validation and evaluation tools from KCL Planning department and the planning community around the ICAPS conference).

...more

SkillML Testing

11 dir

tachometer

aomarks

Web benchmark runner

SkillML Testing

7261 dir

@microsoft/feature-management

microsoft1es

Feature Management is a library for enabling/disabling features at runtime. Developers can use feature flags in simple use cases like conditional statement to more advanced scenarios like conditionally adding routes.

...more

SkillML Testing

191 dir

@uppercod/match-media

uppercod

Allows to define a value in post of an evaluation of a string whose pattern is like img[srcset]

SkillML Testing

1 dir

skillscore

joeynyc

A CLI tool that evaluates AI agent skills and produces quality scores

AgentML Testing

21 dir

@wix/evalforge-types

wix-ci-publisher

Unified types for EvalForge agent evaluation system

SkillML Testing

1 dir

mongodb-assistant-eval

nlarew

Evaluation library for the MongoDB Assistant API.

SkillML Testing

1 dir

@tscircuit/autorouting-dataset-01

seveibar

A set of tscircuit problems to benchmark autorouting (currently 16 circuits in `lib/`).

SkillML Testing

1 dir

verifiers-ts

amine-aifa

TypeScript implementation of the verifiers framework for RL environments

SkillML Testing

111 dir