Data Extraction
683AI tools in the Data Extraction category
json-crawl
udamir
Async and sync crawler for json object
zenrows
anderrv
ZenRows Node SDK
nautiljon-scraper-mod
junkofly
Nautiljon's anime and manga website scraping tool
@expertcomptabledev/impots.gouv.bot
pierrebourdu
A bot to crawl impots.gouv.fr
crawlee-storage-extensions
jetmar
Package for Apify/Crawlee that allows to store encrypted text values into the Storages
test-drone
atsepkov
Test and web-scraping framework for the lazy
n8n-nodes-crawl-and-scrape2
adamkylegreen
n8n custom node to crawl and scrape website
crawlee-one
juro-oravec
Production-ready web scraping in a single function call. Built on Crawlee. Data transforms, caching, privacy compliance, and error tracking -- out of the box.
...more@watercrawl/nodejs
amir.asaran
Node.js client for WaterCrawl crawler
@kadoa/cli
gabrielkadoa
Kadoa CLI — manage web scraping workflows from the terminal
@duyquangnvx/story-spider
duyquangnvx
A TypeScript library for scraping stories from various Vietnamese websites
@algolia/netlify-plugin-crawler
h1fra
This plugin links your Netlify site with Algolia's Crawler. It will trigger a crawl on each successful build.
href-type
zeke
Test whether an href string is absolute, relative, protocol-relative, #fragment, mailto:, tel:, sms:, etc
scra
astur
Really simple HTTP client. Mainly for scraping but not only.
@open-automaton/cheerio-mining-engine
khrome
A web scraping engine to minimize resource consumption
@faouzkk/tiktok-dl
faouz995
A module for downloading TikTok videos by the URL
xnxx-scraper
nimesh-official
Xnxx Search and information scraper
camofox-browser
redf0x1
Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support
slurp-ai
ratacat
A CLI tool for scraping and compiling documentation or other multi page content from websites and NPM packages into a single markdown file.
...morewalkscape-helper
rikurb8
WalkScape helper - wiki scraping and AI-powered Q&A