Document Processing
2876AI tools in the Document Processing category
@liustack/markpress
liustack
CLI for AI agents to convert Markdown into WeChat MP-ready HTML with inline styles, base64 images, and tag sanitization
react-native-vision-camera-ocr-plus
GitHub Actions
React Native Vision Camera plugin for on-device text recognition (OCR) and translation using ML Kit. Maintained fork of react-native-vision-camera-text-recognition
...moredocs2llm
al-ignat
Convert any document into LLM-ready text. PDF, DOCX, PPTX, XLSX, web pages, images, emails, and 75+ other formats — straight to clean Markdown you can paste into ChatGPT, Claude, or Gemini. Also converts Markdown back into DOCX, PPTX, or HTML.
...morecordova-plugin-ml-text
jatahworx
cordova plugin for mobile ocr text recognition
@juit/qrcode
GitHub Actions
A modern QR code generator for JavaScript
@ismaelmoreiraa/vision-camera-ocr
ismaelmoreiraa
VisionCamera Frame Processor Plugin to provide OCR support
@doclo/providers-llm
valdisbarrett
Core LLM provider utilities, types, and schema translation for Doclo SDK
expo-mlkit-ocr
henry3646
React Native module for text recognition using Google's ML Kit
@capacitor-community/image-to-text
robingenz
Image to Text (OCR) Plugin for Capacitor
@condorhero/vuepress-plugin-export-pdf-core
condorhero
The Core of VuePress and VitePress exports PDF plugin
vision-camera-ocr
aarongrider
VisionCamera Frame Processor Plugin to provide OCR support
ddddocr-node
GitHub Actions
The JS version of DdddOcr
@nova-mind-cloud/pdf-parser-mcp
gdm-pixel
MCP Server for PDF parsing and content extraction
md2pdf2
areai51
Convert Markdown to PDF using customizable templates
media2md
mrcv
Convert images to structured markdown with AI-generated descriptions and extracted text
document-generator-mcp
thiago-oliveira
MCP Server para gerar documentos Word e PDF a partir de requisições de agentes IA
@casemark/thurgood
max-casemark
Thurgood CLI - Legal engineer AI agent powered by Case.dev. Build legal applications with document processing, OCR, semantic search, vaults, and transcription APIs.
...moregemini-multimodal
nicopreme
Gemini multimodal skill for Claude Code - video, PDF, image analysis & generation via browser cookies
pdf-best-practices
mabreum
Comprehensive guidelines for creating HTML that renders perfectly as PDF documents
@cherrystudio/embedjs
cherry.ai
A NodeJS RAG framework to easily work with LLMs and custom datasets