>_Skillful.sh
Back to Agents

Hallucination Elimination Benchmark

Agentuncategorisedhallucinationhallucination-detectionllm-agentllm-evaluation

Multi-tier benchmark: Cultural grounding + Triad Engine eliminates LLM hallucination across Claude 4.6, GPT-5.2, Mistral 7B, Gemini 2.5 Pro. Raw 15-58% β†’ 95-100% accuracy on 222 adversarial QA pairs (Ancient Rome 110 CE). Novel topological paradox detection (F1=0.939, zero-shot). Model-agnostic, in production.

Directory Presence

Cross-referenced across 19 tracked directories

DirectoryStatusLink
G
GitHub Search

Adoption Metrics

#109

Popularity Rank

5%

Adoption Rate

Emerging

Adoption Stage

18

Unlisted Directories

Recently added to directories

Cross-Posting Gaps

Not yet listed in these active directories:

Official MCP RegistrySmitheryPulseMCPnpm RegistryPyPIGlamaHugging Face HubAwesome MCP ServersAwesome Claude Skillsmcp.soLobeHubBest of MCP ServersMCPMarketTensorBlock Awesome MCPCline MCP MarketplaceAnthropic Official SkillsNpm SkillsPypi Skills

Statistics

6

GitHub Stars

1

Forks