>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Document Processing

2859

AI tools in the Document Processing category

@nosferatu500/textract

nosferatu500

Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.

...more
SkillDocument Processing
111 dir

pdfdataextract

lublak

Extract data from a pdf with pure javascript

SkillDocument Processing
311 dir

pdf-parse-new

simone.gosetto

Pure javascript cross-platform module to extract text from PDFs with AI-powered optimization and multi-core processing.

SkillDocument Processing
201 dir

@trohde/excal-cli

trohde

Agent-first CLI for Excalidraw scene inspection, validation, and rendering

SkillDocument Processing
1 dir

@grapecity/activereports

mescius

ActiveReportsJS

SkillDocument Processing
1 dir

pdf_read_down_load_id_tell_you_i_love_you_but_then_id_

noahbrown64

Download or Read ePub/pdf EPUB [Download] I'd Tell You I Love You, But Then I'd Have to Kill You (Gallagher Girls, #1) By Ally Carter on Textbook New Volumes

...more
SkillDocument Processing
1 dir

react-pdftotext

utkarsh212

A simple light weight react package to extract plain text from a pdf file.

SkillDocument Processing
241 dir

@cbcruk/vision-ocr

cbcruk

Extract text from images using macOS Vision Framework OCR

SkillDocument Processing
1 dir

flowsquire

miit-daga

Local-first automation platform for organizing files on your computer. No cloud, no AI, no subscriptions — just simple WHEN → DO workflows.

...more
SkillDocument Processing
71 dir

@briansunter/z-cli

briansunter

Unified Z.AI CLI - image generation, OCR, and code research

MCP ServerDocument Processing
71 dir

@gherk/requirements-extractor

formonkey

MCP server that extracts, classifies and generates structured requirements from PDF documents with heuristic scanning and active validation

...more
MCP ServerDocument Processing
1 dir

easy-pdf-parser

luochen1990

a lightweight, promise style, functional wrapper of pdf2json, extract text from pdf easily

SkillDocument Processing
41 dir

suitest-js-api

GitHub Actions

Suitest is a test automation and device manipulation tool for living room devices and web browsers.

SkillDocument Processing
101 dir

node-tesseract-ocr

zapolnoch

A Node.js wrapper for the Tesseract OCR API

SkillDocument Processing
3191 dir

documentation-hub

diatech

A modern document processing and session management desktop application

SkillDocument Processing
1 dir

md-to-pdf

simonhaenisch

CLI tool for converting Markdown files to PDF.

SkillDocument Processing
1.7K1 dir

browse-the-web

asayman

AI Browser Automation API - Control Headless Chrome via RESTful HTTP endpoints. Perfect for web scraping, RPA, automated testing, and AI agent integration with 70+ endpoints including screenshots, PDF generation, network monitoring, and more.

...more
AgentDocument Processing
11 dir

node-ts-ocr

nicolaspearson

A simple wrapper around command-line utils to assist in PDF / Image OCR (Optical Character Recognition) processing using Tesseract.

...more
SkillDocument Processing
81 dir

undms

xcvzmoon

Text and Metadata Extraction Library for Document Files with Text Similarity Comparison

SkillDocument Processing
41 dir

webdriver-image-comparison

wdio-user

An image compare module that can be used for different NodeJS Test automation frameworks that support the webdriver protocol

...more
SkillDocument Processing
1521 dir