>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

646

AI tools in the Data Extraction category

@ptrumpis/snap-lens-web-crawler

ptrumpis

Crawl and download Snap Lenses from *lens.snapchat.com* with ease.

SkillData Extraction
101 dir

@hej-ai/crawler

glutch

Scrape any webpage into clean markdown

SkillData Extraction
1 dir

the-a11y-machine

hywan

The A11y Machine is an automated accessibility testing tool which crawls and tests all pages of any website.

SkillData Extraction
6321 dir

browserfabric

zomux

BrowserFabric TypeScript SDK - Cloud browser automation API client

AgentData Extraction
1 dir

crawl-cli

felipextrindade

A Node crawler/scrape for retrieving data from websites

SkillData Extraction
1 dir

grabit-engine

imroodydev

A plugin-based engine for scraping media streams and subtitles. Works in Node.js, browsers, React and React Native. Load plugins from GitHub, local files, or code — with caching, health tracking, and auto-updates built in.

...more
SkillData Extraction
1 dir

@aduptive/instagram-scraper

aduptive

Modern TypeScript library for collecting public Instagram content with smart delays, mobile-first approach, and media support

...more
SkillData Extraction
111 dir

@gatesolve/puppeteer-plugin

arson

Automatic CAPTCHA solving for Puppeteer. Detects Cloudflare Turnstile, reCAPTCHA, and hCaptcha challenges and solves them via GateSolve.

...more
AgentData Extraction
1 dir

@hardbulls/wbsc-crawler

arjanfrans

Tool to crawl events, leagues and statistics from WBSC based websites.

SkillData Extraction
1 dir

@hanivanrizky/nestjs-browser-action

hanivanrizky

Puppeteer-based browser automation module for NestJS

SkillData Extraction
1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction
2141 dir

axe-crawler

tjscollins

A highly configurable website crawler for automatically testing a website for accessibility issues using the axe-core library. Uses selenium and headless Chrome to load pages, inject axe-core, and run tests. Generates an html summary report in addition

...more
SkillData Extraction
261 dir

pinterest-djw

ondarion

Pinterest image search tool using web scraping

SkillData Extraction
1 dir

camofox-browser

redf0x1

Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support

MCP ServerData Extraction
421 dir

open-web-unlocker

GitHub Actions

Fetch public web pages through a configurable fetch/browser pipeline and parse them into structured JSON or clean markdown.

...more
MCP ServerData Extraction
41 dir

pi-agent-browser

coctostan

Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI

...more
AgentData Extraction
71 dir

mcp-chrome-control

codingbutterbot

Browser automation for AI assistants - Chrome control via JSON-RPC and MCP

MCP ServerData Extraction
1 dir

@scrapeops/n8n-nodes-scrapeops

aswadali

n8n community node for ScrapeOps Proxy, Parser, and Data APIs for web scraping and data extraction

SkillData Extraction
11 dir

get-site-urls

alexpage

Crawl a URL to generate a sitemap and find 404 errors with one command

SkillData Extraction
351 dir

playread

lanmower

Web content extraction and automation via Playwright MCP

SkillData Extraction
1 dir