Data Extraction
683AI tools in the Data Extraction category
n8n-nodes-npm-crawler
wuxiang1656
An n8n node to crawl and extract n8n community nodes information from npm registry
stepwright
lablnet
A powerful web scraping library built with Playwright
rent-crawler
nanyang24
A crawler crawling rental information, based Nodejs
crawly-mccrawlface
budickda
Crawl data from webpages and apply content extraction.
spamlet
connorwade
spamlet is an efficient and simple crawler for playwright
bright-data-scraping-browser-nodejs-playwright-project
steiner-hakas
Dependency Confusion to RCE By Steiner254
@headwall/url-crawler
headwall
URL crawler for analysing web content
@teng-lin/agent-fetch
teng-lin
Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction
plugin-books-pro
2noscript.dev
[](https://badge.fury.io/js/plugin-books-pro)
@sharpapi/sharpapi-node-web-scraping
makowskid
SharpAPI.com Node.js SDK for Web Scraping API
hylsplider
huyulinhome
fork from headless-chrome-crawler and update puppeteer to the latest version
googlethis
luanrt
A simple yet powerful module to retrieve organic search results and much more from Google.
krawlr
alexchomiak
An event-driven web scraping library with polling functionality built on top of Puppeteer.
crawlee-one
juro-oravec
Production-ready web scraping in a single function call. Built on Crawlee. Data transforms, caching, privacy compliance, and error tracking -- out of the box.
...morestrudy
tidoust
Web spec analysis tool that can process crawl reports created by Reffy.
nautiljon-scraper-mod
junkofly
Nautiljon's anime and manga website scraping tool
@algolia/netlify-plugin-crawler
h1fra
This plugin links your Netlify site with Algolia's Crawler. It will trigger a crawl on each successful build.
@evointel/anno
evo-dragon
Web content extraction for AI agents — ensemble extraction with confidence scoring, 93% token reduction vs raw HTML
@open-automaton/cheerio-mining-engine
khrome
A web scraping engine to minimize resource consumption
deepcrawl
felixlyu1018
JavaScript/TypeScript SDK for Deepcrawl API