A library for accelerating Transformer model training on NVIDIA GPUs.
Cross-referenced across 55 tracked directories
#1929
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
3d
Listed For
Recently added to the ecosystem
veRL is a flexible and efficient RL framework for LLMs.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
...more20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
a lean, efficient, and easy-to-hack codebase to research LLMs.