>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt
Back to Skills

eval-genius

SkillLLM Toolevalllmaievaluation

eval-genius enables evals of arbitrary async code. It is generally intended for making multiple assertions on outputs which are generated nondeterministically. These assertions can be used to score algorithms on their effectiveness.

Directory Presence

Cross-referenced across 55 tracked directories

DirectoryStatusLink
N
npm Skills Registry

Adoption Metrics

#4167

Popularity Rank

1 / 55

Listed In

Emerging

Adoption Stage

2d

Listed For

Recently added to the ecosystem

Security Analysis

Score: 100/100

0 dependency vulnerabilities found

Related Skills

@arizeai/openinference-vercel

GitHub Actions

OpenInference utilities for ingesting Vercel AI SDK spans

SkillLLM Tool
4 dirs

@mariozechner/pi-ai

badlogic

Unified LLM API with automatic model discovery and provider configuration

SkillLLM Tool
3 dirs

specweave

aabyzov

AI-assisted development, under control. Configure your standards once β€” spec-first, TDD, quality gates β€” and every AI interaction enforces them automatically. Works with Claude Code, Cursor, Copilot, Codex & more.

SkillLLM Tool
3 dirs

@anthropic-ai/claude-agent-sdk

wolffiex

SDK for building AI agents with Claude Code's capabilities. Programmatically interact with Claude to build autonomous agents that can understand codebases, edit files, and execute workflows.

SkillLLM Tool
3 dirs