lighteval

AgentLLM Evaluationllmai-appawesome-list

a lightweight LLM evaluation suite that Hugging Face has been using internally.

Directory Presence

Cross-referenced across 55 tracked directories

Directory	Status	First Seen	Last Confirmed	Link
A Awesome LLM Apps		3/13/2026	3/16/2026

#315

Popularity Rank

1 / 55

Listed In

Emerging

Adoption Stage

Listed For

Recently added to the ecosystem

11/100

Directory Presence6/30

Community0/25

Usage0/25

Maintenance5/20

This Package Category Average

Score: 100/100

0 dependency vulnerabilities found

A Challenging, Contamination-Free LLM Benchmark.

AgentLLM Evaluation

2 dirs

An open-source library for evaluating task performance of language models and prompts.

AgentLLM Evaluation

2 dirs

Eval tools by OpenAI.

AgentLLM Evaluation

1 dir

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

AgentLLM Evaluation

1 dir