 - A repo for distributed training of language models with Reinforcement Learning via Human Feedback. (RLHF)
Cross-referenced across 55 tracked directories
#315
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
3d
Listed For
Recently added to the ecosystem
 - A repository of Stanford Alpaca project, a model fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations.
 - The cli tool to run LLaMA on the local machine.
LLM application: Pythia-1|1.4|2.8|6.9|12B
 - A large language model trained on the Databricks Machine Learning Platform.