Data Extraction
703AI tools in the Data Extraction category
scrappey
demonmartin
Introducing Scrappey, your comprehensive website scraping solution provided by Scrappey.com. With Scrappey's powerful and user-friendly API, you can effortlessly retrieve data from websites, including those protected by Cloudflare. Join Scrappey today and
...more@teng-lin/agent-fetch
teng-lin
Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction
@obscrd/robots
larsmosr
AI crawler blocking — generate robots.txt, meta tags, and HTTP headers for 30+ AI bots
better-browse
mylesiyabor
Zero-dependency browser automation via Chrome DevTools Protocol with ARIA accessibility snapshots — 10-100x cheaper than vision-based approaches
...morethe-a11y-machine
hywan
The A11y Machine is an automated accessibility testing tool which crawls and tests all pages of any website.
@omindu/scrapely
omindulk
Declarative web scraping toolkit with schema-driven extraction, pagination, caching, and data export
slurp-ai
ratacat
A CLI tool for scraping and compiling documentation or other multi page content from websites and NPM packages into a single markdown file.
...morenewspaperjs
flickzcode
News extraction and scraping. Article Parsing
walkscape-helper
rikurb8
WalkScape helper - wiki scraping and AI-powered Q&A
@evointel/anno
evo-dragon
Web content extraction for AI agents — ensemble extraction with confidence scoring, 93% token reduction vs raw HTML
cashclaw
ertugrulakben
Turn your OpenClaw into a money-making machine
@octivas/mcp
soeffing
MCP server for Octivas web scraping, crawling, and search API
@cle-does-things/scpr
cle-does-things
Simple and intuitive CLI tool and MCP server to perform web scraping operations.
proxys-site
urready
The official open-source codebase for Proxys.Site - A comprehensive proxy comparison tool and list.
maxun-sdk
karishmashukla
Maxun Node SDK for web scraping and data extraction
@cd39390/mcp-web-crawler
cd39390
An MCP server plugin to crawl all hyperlinks from a website for AI learning purposes.
devbridge-styleguide
devbproto
Styleguide automatization tool.
@askjo/camofox-browser
askjo
Headless browser automation server and OpenClaw plugin for AI agents - anti-detection, element refs, and session isolation
...morefacebook-marketplace-cli
lotrez
CLI tool for Facebook Marketplace and Messenger automation
unfluff
ageitgey
A web page content extractor