Data Extraction
667AI tools in the Data Extraction category
node-html-crawler
safonovpro
Crawler (spider) of site web pages by domain name
n8n-nodes-evomi
evomi
n8n community node for Evomi Scraper API - Web scraping with intelligent mode selection
@stellarbeat/js-stellar-node-crawler
pieterjan84
Crawl the network for nodes
reviewbr-mcp
vic3m
MCP Server for Brazilian Academic Repositories (OAI-PMH, DSpace REST, HTML scraping) and PRISMA Systematic Reviews
json-web-crawler
knovour
Crawl website by json
evomi-client
evomi
JavaScript client for Evomi API
@hmb-research/x-ray-crawler
tsopic
x-ray's crawler
@askjo/camofox-browser
askjo
Headless browser automation server and OpenClaw plugin for AI agents - anti-detection, element refs, and session isolation
...morenest-crawler
saltyshiomix
An easiest crawling and scraping module for NestJS
@aduptive/instagram-scraper
aduptive
Modern TypeScript library for collecting public Instagram content with smart delays, mobile-first approach, and media support
...moretrawl-4
tudorilisoi
A full-fledged node.js web crawler with a MySQL backend
ayakashi
zisismaras
The next generation web scraping framework
fb-assistant-ts
toshiodev
a facebook puppeteer manipulate library
web-structure
kilicmu
A powerful and flexible web scraping library with concurrent processing and DOM hierarchy awareness
ai-search-indexer
cruonit
Website content indexer using Mozilla Readability and Playwright
rebrowser-patches
nwebson
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
...moremcp-web-scrape
mukul975
Clean, cached web content for agents—Markdown + citations
unsurf
acoyfellow
Turn any website into a typed API
sl-dbmaria
putraadtya26
A powerful web scraping tool for everything
@screenshotbuddy/node-curl-impersonate
screenshotbuddy
A wrapper around cURL-impersonate, a binary which can be used to bypass TLS fingerprinting.