Data Extraction
650AI tools in the Data Extraction category
node-curl-impersonate
wearr
A wrapper around cURL-impersonate, a binary which can be used to bypass TLS fingerprinting.
almuten-scraper
oliver797
A tool for scraping and calculating almuten (planetary dignity) in astrology
create-simplecrawl
apenasgabs
SimpleCrawl — scaffold a web scraping project interactively. Choose engine (SSR/CSR/hybrid) and architecture.
@xcrap/core
marcuth
Xcrap Core is the core package of the Xcrap framework for web scraping, offering tools such as HttpClient, BaseClient, Randomizer, Rotator, and support for proxies and pagination.
...more@rosbel/crawl-n-snap
rosbel
CLI tool for taking website screenshots at various resolutions using Playwright, with optional website crawling functionality.
...morescrapegraph-js
vincigit00
Official JavaScript/TypeScript SDK for the ScrapeGraph AI API — smart web scraping powered by AI
bromato
gyoridavid
Local browser automation for no-code tools like n8n or make
jann-scraper
jannoffc
The library scraper for WhatsApp bot or Restfull API's
node-red-contrib-nbrowser
steveorevo
Provides a high level browser automation node based on nightmarejs.org.
puppeteer-infinite-scroller
dulanh
Provides a simple and efficient solution for scraping data loaded through infinite scrolling on web pages using Puppeteer.
...morecloakbrowser
GitHub Actions
Stealth Chromium that passes every bot detection test. Drop-in Playwright/Puppeteer replacement with source-level fingerprint patches.
...morejust-scrape
vincigit00
ScrapeGraph AI CLI tool
@teng-lin/agent-fetch
teng-lin
Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction
cashclaw
ertugrulakben
Turn your OpenClaw into a money-making machine
@obscrd/robots
larsmosr
AI crawler blocking — generate robots.txt, meta tags, and HTTP headers for 30+ AI bots
pi-agent-browser
coctostan
Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI
...moremcp-chrome-control
codingbutterbot
Browser automation for AI assistants - Chrome control via JSON-RPC and MCP
@scrapeops/n8n-nodes-scrapeops
aswadali
n8n community node for ScrapeOps Proxy, Parser, and Data APIs for web scraping and data extraction
get-site-urls
alexpage
Crawl a URL to generate a sitemap and find 404 errors with one command
grunt-extract-cldr-data
okuryu
Extract CLDR data and transform it for use in JavaScript.