Data Extraction

646

AI tools in the Data Extraction category

All (646)MCP Servers (62)Skills (555)Agents (29)

@ptrumpis/snap-lens-web-crawler

ptrumpis

Crawl and download Snap Lenses from *lens.snapchat.com* with ease.

SkillData Extraction

101 dir

@hej-ai/crawler

glutch

Scrape any webpage into clean markdown

SkillData Extraction

1 dir

the-a11y-machine

hywan

The A11y Machine is an automated accessibility testing tool which crawls and tests all pages of any website.

SkillData Extraction

6321 dir

browserfabric

zomux

BrowserFabric TypeScript SDK - Cloud browser automation API client

AgentData Extraction

1 dir

crawl-cli

felipextrindade

A Node crawler/scrape for retrieving data from websites

SkillData Extraction

1 dir

grabit-engine

imroodydev

A plugin-based engine for scraping media streams and subtitles. Works in Node.js, browsers, React and React Native. Load plugins from GitHub, local files, or code — with caching, health tracking, and auto-updates built in.

...more

SkillData Extraction

1 dir

@aduptive/instagram-scraper

aduptive

Modern TypeScript library for collecting public Instagram content with smart delays, mobile-first approach, and media support

...more

SkillData Extraction

111 dir

@gatesolve/puppeteer-plugin

arson

Automatic CAPTCHA solving for Puppeteer. Detects Cloudflare Turnstile, reCAPTCHA, and hCaptcha challenges and solves them via GateSolve.

...more

AgentData Extraction

1 dir

@hardbulls/wbsc-crawler

arjanfrans

Tool to crawl events, leagues and statistics from WBSC based websites.

SkillData Extraction

1 dir

@hanivanrizky/nestjs-browser-action

hanivanrizky

Puppeteer-based browser automation module for NestJS

SkillData Extraction

1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction

2141 dir

axe-crawler

tjscollins

A highly configurable website crawler for automatically testing a website for accessibility issues using the axe-core library. Uses selenium and headless Chrome to load pages, inject axe-core, and run tests. Generates an html summary report in addition

...more

SkillData Extraction

261 dir

pinterest-djw

ondarion

Pinterest image search tool using web scraping

SkillData Extraction

1 dir

camofox-browser

redf0x1

Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support

MCP ServerData Extraction

421 dir

open-web-unlocker

GitHub Actions

Fetch public web pages through a configurable fetch/browser pipeline and parse them into structured JSON or clean markdown.

...more

MCP ServerData Extraction

41 dir

pi-agent-browser

coctostan

Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI

...more

AgentData Extraction

71 dir

mcp-chrome-control

codingbutterbot

Browser automation for AI assistants - Chrome control via JSON-RPC and MCP

MCP ServerData Extraction

1 dir

@scrapeops/n8n-nodes-scrapeops

aswadali

n8n community node for ScrapeOps Proxy, Parser, and Data APIs for web scraping and data extraction

SkillData Extraction

11 dir

get-site-urls

alexpage

Crawl a URL to generate a sitemap and find 404 errors with one command

SkillData Extraction

351 dir

playread

lanmower

Web content extraction and automation via Playwright MCP

SkillData Extraction

1 dir