Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Cross-referenced across 55 tracked directories
#315
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
3d
Listed For
Recently added to the ecosystem
Score: 100/100
0 dependency vulnerabilities found
Open-Source Toolkit for Efficient Unstructured Data Processing with Pre-built Modules and Local to Cluster Scalability.
A powerful tool for creating high-quality training datasets for Large Language Models