ML Testing
477AI tools in the ML Testing category
@agentshield-ai/openclaw-plugin
markbriers
AgentShield real-time security evaluation plugin for OpenClaw. Intercepts tool calls before execution and evaluates them against Sigma detection rules.
...more@neural-trader/example-portfolio-optimization
ruvnet
Self-learning portfolio optimization with benchmark swarms and multi-objective optimization
is-valid-var-name
stevewestbrook
Determines whether a string is a valid javascript variable name. ES2015 and ES5 compatibility. Strict mode evaluation by default.
...moretime-series-error
sahandisa
Time Series Error Evaluation Metrics
nia-web-eval-agent-mcp
arlanrakh
NIA AI Web Evaluation Agent MCP Server - Autonomous browser testing and debugging
e-learning-js
cunigarro
E-learning js library was created with the promise for help to create activities for e-learning courses that can return data about evaluation and time. It is will be based on a standard called SCORM that let me us set data easily on e-learning platform li
...more@originjs/oss-evaluation-components
GitHub Actions
No description available
jsmachinelearning
nikhilashodariya
Popular algorithms of machine learning are made available
js-index-data-structures
vhf
A benchmark of JS data structures suitable for in memory non unique indexing
ppef
GitHub Actions
Portable Programmatic Evaluation Framework - Claim-driven, deterministic evaluation for experiments
cronometro
shogun_panda
Simple benchmarking suite powered by HDR histograms.
mitata
evan
benchmark tooling that loves you ❤️
@easy-nodes/core
addisudamena49
A React-based node graph editor built on [React Flow](https://reactflow.dev). Define nodes declaratively with JSON, wire them together visually, and let the built-in evaluation engine run your graph in topological order. Supports sync and async evaluation
...moreeval2otel
evalops
Library to convert evaluation metrics and traces to OpenTelemetry GenAI semantic conventions
skilltest
lsaraiva
The testing framework for Agent Skills. Lint, test triggering, and evaluate your SKILL.md files.
odor
catpea
Static blog generator with parallel encoding, incremental builds, atomic writes, and an AI agent for spellcheck, tagging, summarization, and quality evaluation.
...morefaceoff
jdmarshall
Compare performance across multiple versions of your code
nehoid
nehonixpkg
Advanced unique ID generation utility with multi-layer encoding, collision detection, and context-aware features
log-lazy
konard
A lazy logging library with bitwise level control
dream11-react-native-performance-tracker
wedesicooking
Benchmark React Native View Paint Time