Generate consolidated text files from websites for LLM training and inference – Powered by Firecrawl
Cross-referenced across 55 tracked directories
#3803
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
3/13/2026
First Seen
Recently added to the ecosystem
"Outclassing Frontier LLMs in Information Extraction"
A high-quality tool for convert PDF to Markdown and JSON
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
A document understanding API