Agents
2,229Autonomous AI agents that perform tasks independently
Open-Source Toolkit for Efficient Unstructured Data Processing with Pre-built Modules and Local to Cluster Scalability.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
A powerful tool for creating high-quality training datasets for Large Language Models
a lightweight LLM evaluation suite that Hugging Face has been using internally.
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
a lean, efficient, and easy-to-hack codebase to research LLMs.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.
Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.