Back to Agents
exllama
AgentLLM Inferencellmai-appawesome-list
A
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Directory Presence
Cross-referenced across 55 tracked directories
Adoption Metrics
#311
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
2d
Listed For
Recently added to the ecosystem
Security Analysis
Score: 100/100
0 dependency vulnerabilities found
Related Agents
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
AgentLLM Inference
1 dir
SGLang is a fast serving framework for large language models and vision language models.
AgentLLM Inference
1 dir