Data Extraction

667

AI tools in the Data Extraction category

All (667)MCP Servers (77)Skills (560)Agents (30)

web-structure

kilicmu

A powerful and flexible web scraping library with concurrent processing and DOM hierarchy awareness

SkillData Extraction

1 dir

ai-search-indexer

cruonit

Website content indexer using Mozilla Readability and Playwright

SkillData Extraction

1 dir

n8n-nodes-scraper

oxsr

n8n node for advanced web scraping with multiple extraction modes

SkillData Extraction

1 dir

zenrows

anderrv

ZenRows Node SDK

SkillData Extraction

171 dir

n8n-nodes-bozonx-page-scraper-microservice

bozonx

n8n node for Page Scraper microservice - extract structured content, retrieve HTML, and process URLs in batches

SkillData Extraction

1 dir

n8n-nodes-scrapingfish

maciejw94

n8n community node for Scrapingfish web scraping API

SkillData Extraction

1 dir

walkscape-helper

rikurb8

WalkScape helper - wiki scraping and AI-powered Q&A

SkillData Extraction

1 dir

n8n-nodes-exa-websets

virul

n8n node for Exa Websets API - Create, manage, and query structured datasets from web sources

SkillData Extraction

1 dir

@mihnea.dev/webscraper

mihnea.dev

A robust web scraping library using Playwright for Node.js. This library provides an easy-to-use API for automating web interactions, extracting data, and handling various web scraping tasks efficiently.

...more

SkillData Extraction

21 dir

n8n-nodes-video-crawler

tanmi0609

An n8n node to search and crawl popular short videos from platforms like Douyin

SkillData Extraction

1 dir

crawlee-one

juro-oravec

Production-ready web scraping in a single function call. Built on Crawlee. Data transforms, caching, privacy compliance, and error tracking -- out of the box.

...more

SkillData Extraction

361 dir

rebrowser-patches

nwebson

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

...more

SkillData Extraction

1.3K1 dir

mcp-web-scrape

mukul975

Clean, cached web content for agents—Markdown + citations

MCP ServerData Extraction

41 dir

unsurf

acoyfellow

Turn any website into a typed API

SkillData Extraction

81 dir

sl-dbmaria

putraadtya26

A powerful web scraping tool for everything

SkillData Extraction

1 dir

webscrape-gbn

jitu1612

A simple web scraping module. Supported websites for web scraping are BigBasket, Grofers and Natures Basket.

SkillData Extraction

1 dir

component-search2

timaschew

search through crawl components

SkillData Extraction

91 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction

2141 dir

cloudbypass-skill

cloudbypass

穿云API的OpenClaw技能实现，用于绕过Cloudflare等反爬虫保护

SkillData Extraction

1 dir

top-user-agents

kikobeats

An always up-to-date list of the top 100 most common browser user-agents for HTTP requests

AgentData Extraction

3231 dir