>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Document Processing

2858

AI tools in the Document Processing category

@lov3kaizen/agentsea-ingest

lov3kaizen

Comprehensive document processing pipeline for Node.js - PDF, DOCX, HTML, Markdown parsing with intelligent chunking, table/image extraction, and OCR

...more
SkillDocument Processing
1 dir

exine

nicolumma

Universal Markdown extraction engine (CLI)

SkillDocument Processing
1 dir

@saemhco/nestjs-html-pdf

saemhco

A NestJS module to generate PDF files from HTML

SkillDocument Processing
241 dir

@cherrystudio/mac-system-ocr

dejeune

Node.js N-API native module for MacOS Vision Framework OCR

SkillDocument Processing
201 dir

pdfjs-dist-dj

miraclesol

Generic build of Mozilla's PDF.js library.

SkillDocument Processing
1.3K1 dir

paperflow-mcp

davidson11

MCP server that lets Claude process PDFs through a self-hosted PaperFlow backend with smart parser selection, token-saving summaries, and structured Markdown output.

...more
MCP ServerDocument Processing
1 dir

pageindex-ts

tandava0060

LLM-agnostic document indexing for js/ts - bring your own LLM and text

SkillDocument Processing
51 dir

@frasma/extractify

frasma

Functional utilities to extract, transform and flow your data

SkillDocument Processing
1 dir

cordova-plugin-scanbot-sdk

scanbot

Cordova Plugin for the Scanbot Document and Barcode Scanner SDK

SkillDocument Processing
1 dir

suitest-js-api

GitHub Actions

Suitest is a test automation and device manipulation tool for living room devices and web browsers.

SkillDocument Processing
101 dir

node-tesseract-ocr

zapolnoch

A Node.js wrapper for the Tesseract OCR API

SkillDocument Processing
3191 dir

ddddocr-node

GitHub Actions

The JS version of DdddOcr

SkillDocument Processing
381 dir

documentation-hub

diatech

A modern document processing and session management desktop application

SkillDocument Processing
1 dir

md-to-pdf

simonhaenisch

CLI tool for converting Markdown files to PDF.

SkillDocument Processing
1.7K1 dir

browse-the-web

asayman

AI Browser Automation API - Control Headless Chrome via RESTful HTTP endpoints. Perfect for web scraping, RPA, automated testing, and AI agent integration with 70+ endpoints including screenshots, PDF generation, network monitoring, and more.

...more
AgentDocument Processing
11 dir

node-ts-ocr

nicolaspearson

A simple wrapper around command-line utils to assist in PDF / Image OCR (Optical Character Recognition) processing using Tesseract.

...more
SkillDocument Processing
81 dir

undms

xcvzmoon

Text and Metadata Extraction Library for Document Files with Text Similarity Comparison

SkillDocument Processing
41 dir

webdriver-image-comparison

wdio-user

An image compare module that can be used for different NodeJS Test automation frameworks that support the webdriver protocol

...more
SkillDocument Processing
1521 dir

makepdf

jcormont

Opinionated Markdown-to-PDF converter

SkillDocument Processing
41 dir

textract

dbashford

Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.

...more
SkillDocument Processing
1.7K1 dir