2
Published Tools
2
Total Stars
0
Weekly Downloads
2
GitHub Followers
6
Public Repos
100/100
Avg Security
Published Tools
1 Skill1 Agentacross 2 categoriescane-eval
Cane
A
LLM-as-Judge evaluation for AI agents. YAML test suites, Claude-powered judging, failure mining, and training data export.
...moreAgentai-agents
21 dir
cane-personality
Cane
Behavioral profiling benchmark for LLMs. Profile any model's personality, extract steering vectors, generate DPO training pairs.
...moreSkillai-ml
1 dir