Data Extraction
668AI tools in the Data Extraction category
@iflow-mcp/suthio-brave-deep-research-mcp
chatflowdev
DeepSearch MCP Server with Brave Search API and Puppeteer content extraction
fast-scrap
nicolas-dev-toolbox
A web scraping tool using Puppeteer for efficient HTML content extraction.
n8n-nodes-appmarketscraper
appmarketscraper
AppMarketScraper n8n community node for scraping Shopify App Store data
pi-agent-browser
coctostan
Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI
...moreytscr
mohtasimalam
YTSCR (YouTube Scraper) is a versatile tool designed for efficiently scraping YouTube channels' videos, streams, shorts, and video information.
...moreinstagram-scraping
rzlyp
NPM module for loading media by hashtag without instagram API
tanto
cmoncrief
Lightweight web scraping library
webflow-engine
hal-crackbot
Declarative workflows for complex website interactions - The missing piece for AI agent automation
@mcptoolshop/ai-ui
mikefrilot
Automated design diagnostics for SPAs — crawl, diff, verify UI against docs
@houtini/seo-crawler-mcp
richardbaxterseo
Crawl and analyse websites for SEO errors and issues using Crawlee with SQLite storage
@directus-labs/ai-web-scraper-operation
phazonoverload
Use Firecrawl's Web Scraping API to extract data from websites.
nstbrowser-ai-agent
nstbrowser
Nstbrowser AI agent for browser automation with advanced fingerprinting
@crawlee/types
GitHub Actions
Shared types for the crawlee projects
@pixelcop/fetcher-mcp
csarva
MCP server for fetching web content using Playwright browser
metafetch
GitHub Actions
Metafetch fetches a given URL's title, description, images, links etc.
n8n-nodes-playwright-mcp
paolo-trivi
Complete n8n Playwright node with all Microsoft Playwright MCP tools and AI assistant support for advanced browser automation
...morefb-assistant-ts
toshiodev
a facebook puppeteer manipulate library
ai-search-indexer
cruonit
Website content indexer using Mozilla Readability and Playwright
webcrawlerapi-js
niiotyo
JS client for WebcrawlerAPI
rebrowser-playwright
nwebson
A drop-in replacement for playwright patched with rebrowser-patches. It allows to pass modern automation detection tests.
...more