a leaderboard that benchmarks foundation models with Language-Model-as-an-Examiner.
Cross-referenced across 55 tracked directories
#22465
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
3d
Listed For
Recently added to the ecosystem
Anaël Verrier
Helm is a system monitor released under GNU GPLv3.
Jeffrey Ip
The LLM Evaluation Framework
Evaluation framework for RAG and LLM applications
EleutherAI <contact@eleuther.ai>
A framework for evaluating language models