LLM Training Frameworks

AI tools in the LLM Training Frameworks category

All (20)MCP Servers (1)Skills (1)Agents (18)

veRL

veRL is a flexible and efficient RL framework for LLMs.

AgentLLM Training Frameworks

2 dirs

ROLL

alibaba

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

MCP ServerLLM Training Frameworks

3K2 dirs

trl

Leandro von Werra <leandro.vonwerra@gmail.com>

Train transformer language models with reinforcement learning.

SkillLLM Training Frameworks

2 dirs

unslothai

A framework that specializes in efficient fine-tuning. On its GitHub page, you can find ready-to-use fine-tuning templates for various LLMs, allowing you to easily train your own data for free on the Google Colab cloud.

AgentLLM Training Frameworks

1 dir

Transformer Engine

A library for accelerating Transformer model training on NVIDIA GPUs.

AgentLLM Training Frameworks

1 dir

Megatron-DeepSpeed

DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.

AgentLLM Training Frameworks

1 dir

nanotron

Minimalistic large language model 3D-parallelism training.

AgentLLM Training Frameworks

1 dir

torchtune

A Native-PyTorch Library for LLM Fine-tuning.

AgentLLM Training Frameworks

1 dir

Axolotl

Open-source framework for fine-tuning and evaluating LLMs. It simplifies the process of experimenting with different training configurations and makes it easy to reproduce and share results, supporting features like LoRA, QLoRA, DeepSpeed, PEFT, and multi-GPU setups.

AgentLLM Training Frameworks

1 dir

NeMo Framework

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

AgentLLM Training Frameworks

1 dir

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

AgentLLM Training Frameworks

1 dir

BMTrain

Efficient Training for Big Models.

AgentLLM Training Frameworks

1 dir

GPT-NeoX

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

AgentLLM Training Frameworks

1 dir

torchtitan

A native PyTorch Library for large model training.

AgentLLM Training Frameworks

1 dir

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT).

AgentLLM Training Frameworks

1 dir

Litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

AgentLLM Training Frameworks

1 dir

Mesh Tensorflow

Mesh TensorFlow: Model Parallelism Made Easier.

AgentLLM Training Frameworks

1 dir

Meta Lingua

a lean, efficient, and easy-to-hack codebase to research LLMs.

AgentLLM Training Frameworks

1 dir

Megatron-LM

Ongoing research training transformer models at scale.

AgentLLM Training Frameworks

1 dir

maxtext

A simple, performant and scalable Jax LLM!

AgentLLM Training Frameworks

1 dir