malteos
@malteos
Research engineer: Datasets, information retrieval, representation learning, LLMs, scientific & legal document processing
@commoncrawl Berlin, Germany On GitHub since June 2014
1
Published Tools
64
Total Stars
0
Weekly Downloads
168
GitHub Followers
86
Public Repos
100/100
Avg Security
Published Tools
1 Skillacross 1 categoryllm-datasets
Malte Ostendorff <malte.ostendorff@dfki.de>
A
A collection of datasets for language model training including scripts for downloading, preprocesssing, and sampling.
Skilluncategorised
641 dir