Data Extraction
666AI tools in the Data Extraction category
Spectrawl
FayAndXan
The unified web layer for AI agents. Search (8 engines), stealth browse, auth, and act on 24 platforms. One npm install, self-hosted.
...moremcp-web-content-pick
kilicmu
A tool for extracting structured content from web pages with customizable selectors and crawling options
mcp-fetch
matiasgf
A Model Context Protocol server providing tools for HTTP requests, GraphQL queries, WebSocket connections, and browser automation
...more@shyzus/mcp-scrapidou
shyzus
Scrapidou - MCP server for web scraping and URL fetching
barebrowse
hamr0
Authenticated web browsing for autonomous agents via CDP. URL in, pruned ARIA snapshot out.
mcp-server-dumplingai
dumplingai
MCP Server for Dumpling AI providing various Data scraping, conversion, and extraction tools
n8n-nodes-headlessx
saifyxpro
n8n community node for HeadlessX v2 API - anti-detection web scraping with Camoufox
puppeteer-vision-mcp-server
djannot
MCP Server for scraping webpages and converting to markdown
@ticktockbent/charlotte
ticktockbent
Token-efficient browser MCP server — structured web pages for AI agents, not raw accessibility dumps
fetchserp-mcp-server
dm0lz
A Model Context Protocol (MCP) server that provides access to FetchSERP API for SEO analysis, SERP data, web scraping, and keyword research. Supports both stdio and HTTP transport modes.
...morescraperis-mcp
tuanvt
Model Context Protocol (MCP) integration for Scraper.is - A web scraping tool for AI assistants
agentql-mcp
GitHub Actions
Model Context Protocol (MCP) server that integrates AgentQL data extraction capabilities.
sequentum-mcp
sequentum.casey
MCP Server for Sequentum Web Scraping API - Enables AI assistants to interact with Sequentum agents
@promptcloud/n8n-nodes-scrapix
promptcloud
n8n node for Scrapix web scraping API with support for scraping, collecting, crawling, and AI-powered extraction
n8n-nodes-seleniumbase
tranlight
n8n community node for executing Python scraping scripts using SeleniumBase API
@mseep/mcp-smart-crawler
skydeckai
A command-line tool acting as an MCP (ModelContextProtocol) server, using Playwright to crawl web content for AI models.
mcp-smart-crawler
erikloo
A command-line tool acting as an MCP (ModelContextProtocol) server, using Playwright to crawl web content for AI models.
@mcptoolshop/websketch
mikefrilot
CLI for WebSketch IR - render, diff, and fingerprint web UI captures
@expandai/mcp-server
GitHub Actions
MCP server for expand.ai - Give AI agents access to the web
@apitap/core
nibynikt21
Intercept web API traffic during browsing. Generate portable skill files so AI agents can call APIs directly instead of scraping.
...more