Document Processing
2839AI tools in the Document Processing category
@hypercard-ai/hyper-jump
GitHub Actions
Document viewer built for RAG
ai-pdf-builder
tytaninc7
AI-powered PDF generator for legal docs, pitch decks, and reports. SAFEs, NDAs, term sheets, whitepapers from Markdown. Works with Claude, Cursor, GPT, Copilot, OpenClaw agents. npx ai-pdf-builder
...moremedia2md
mrcv
Convert images to structured markdown with AI-generated descriptions and extracted text
@liustack/markpress
liustack
CLI for AI agents to convert Markdown into WeChat MP-ready HTML with inline styles, base64 images, and tag sanitization
@dooor-ai/cortexdb
bruno353
Official TypeScript/JavaScript SDK for CortexDB - Multi-modal RAG Platform with advanced document processing
@cherrystudio/embedjs
cherry.ai
A NodeJS RAG framework to easily work with LLMs and custom datasets
@letter-ai/lector
remi-letterai
Headless PDF viewer for React
@heripo/document-processor
kimhongyeon
Document processor with LLM-based analysis for heripo engine
react-native-pageindex
satyam_appscale
Vectorless, reasoning-based RAG — builds a hierarchical tree index from PDF, DOCX, CSV, XLSX or Markdown using any LLM. React Native compatible.
...moreaskprisma-skill
whiteboardmonk
Business data analysis skill for Claude Code, Gemini CLI, Codex, OpenCode and other AI coding agents.
coverme-scanner
slack-ai
AI-powered security scanner with 33 agents including AI-generated code detection. STRIDE/DREAD scoring, adversarial review, professional PDF reports.
...more@leo56/simple-redact-skill
leo56
Simple Redact - CLI tool for PDF sensitive information redaction
markpdfdown
jorbenzhu
A high-quality PDF to Markdown tool based on large language model visual recognition.
extract2md
hashangit
Client-side PDF to Markdown conversion with OCR and optional LLM rewrite. Core dependencies bundled for offline use.
zerox
tylermaran
ocr documents using gpt-4o-mini
hazo_llm_api
pubs12
Wrapper to call different LLMs and includes prompt management
wkhtmltopdf
zxlin
A wrapper for the wkhtmltopdf HTML to PDF converter using WebKit
pdf-parse-new
simone.gosetto
Pure javascript cross-platform module to extract text from PDFs with AI-powered optimization and multi-core processing.
@liustack/modlens
liustack
CLI tool to provide visual understanding for non-vision LLMs
pdf-brain
joelhooks
Local PDF & Markdown knowledge base with semantic search, AI enrichment, and SKOS taxonomy