Awesome Gen AI Tools: MLGroupJLU/LLM-eval-survey: The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Cross-referenced across 55 tracked directories
#3910
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
7/2/2023
Created
1,592
GitHub Stars
Score: 100/100
0 dependency vulnerabilities found
Run an AI-powered security scan to analyze this package's source code for vulnerabilities, prompt injection vectors, data exfiltration risks, and behavior mismatches.
Scans fetch actual source code from the GitHub repository, not just the README.
Google, LLC
LLM Comparator: An interactive visualization tool for side-by-side LLM evaluation
Awesome Gen AI Tools: Cleanlab Trustworthy Language Model: Score the trustworthiness of any LLM response
Awesome Gen AI Tools: LLM Leaderboards
Awesome Gen AI Tools: LLM Benchmarks: MMLU, HellaSwag, BBH, and Beyond - Confident AI
100
Forks
6
Open Issues
6/3/2025
Last Commit
Recently added to the ecosystem