>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

707

AI tools in the Data Extraction category

octagon-deep-research-mcp

octagonai

MCP server for Deep Research. Provides specialized AI-powered deep research capabilities with no rate limits - faster than ChatGPT Deep Research, more thorough than Grok DeepSearch or Perplexity Deep Research.

...more
MCP ServerData Extraction
841 dir

ai-sdk-agents-universal-scraper-tool

aisdkagents

AI SDK Agents Universal Scraper Tool

SkillData Extraction
41 dir

just-scrape

vincigit00

ScrapeGraph AI CLI tool

SkillData Extraction
1 dir

apify

GitHub Actions

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

...more
SkillData Extraction
1721 dir

scrapix-cli

simiokunowo

A TypeScript-based CLI Application for scraping Google images

SkillData Extraction
1 dir

rebrowser-patches

nwebson

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

...more
SkillData Extraction
1.3K1 dir

crawl-server

imike3049

Efficient SEO-focused server for Wasm-generated pages

SkillData Extraction
11 dir

mcp-web-scrape

mukul975

Clean, cached web content for agents—Markdown + citations

MCP ServerData Extraction
41 dir

nstbrowser-ai-agent

nstbrowser

Nstbrowser AI agent for browser automation with advanced fingerprinting

AgentData Extraction
11 dir

parallaxapis-sdk-ts

pxcaptcha

ParallaxAPIs SDK

SkillData Extraction
341 dir

theia-suite

reyhan6610

A large-scale web scraping library for Node.js.

SkillData Extraction
1 dir

playwrightium

analysta

Model Context Protocol server that exposes reusable Playwright actions.

MCP ServerData Extraction
1 dir

rebrowser-playwright-core

nwebson

A drop-in replacement for playwright-core patched with rebrowser-patches. It allows to pass modern automation detection tests.

...more
SkillData Extraction
61 dir

headsman

plenty-of-ish

Uses a headless browser to fully render a webpage and return the final html content.

SkillData Extraction
1 dir

clawpage-mcp

clawpage

MCP server for ClawPage web extraction API. Extract and structure any web page into clean JSON.

MCP ServerData Extraction
1 dir

@seaavey/scapers

seaavey

The Scapers is a collection of tools for scraping data from the web.

SkillData Extraction
1 dir

@apiverve/webimagescraper

charifield

Web Image Scraper is a simple tool for scraping images from a website. It returns the URLs of the images found on the website.

...more
SkillData Extraction
1 dir

@expandai/ai

jlipp

Vercel AI SDK integration for expand.ai - fetch and extract content from any URL

SkillData Extraction
1 dir

@pinkpixel/web-scout-mcp

sizzlebop

MCP server for web search and content extraction with multiple URL support and memory optimizations

MCP ServerData Extraction
1251 dir

@shepherd-terminal/reacher

GitHub Actions

CLI tool for scraping LinkedIn and Google Maps to find businesses by type and location

SkillData Extraction
1 dir