Data Extraction
700AI tools in the Data Extraction category
@docchi/scraping-anime-websites-poland
xanax_
Moduł do pobierania linków z popularnych polskich strony z anime
n8n-nodes-video-crawler
tanmi0609
An n8n node to search and crawl popular short videos from platforms like Douyin
omnifetch-lib
visy_ani
Universal content extraction library with tiered fetching strategies
scraping-bee-mcp
zneutro
ScrapingBee MCP server for testing web scraping extract rules
astro-mail-obfuscation
andreas-brunner
Protect email addresses, phone numbers and other sensitive data from bots scraping the source code of your Astro app.
n8n-nodes-scrapingfish
maciejw94
n8n community node for Scrapingfish web scraping API
crawl-fn
vkolluru1974
A utility package for telecom automation and integration. Includes telecom-mas-agent and other useful libraries.
n8n-nodes-web-session-manager
hichamchar
n8n custom node for managing authenticated web sessions with cookie-based authentication
humanoid-js
evyatarmeged
Node.js package to bypass WAF anti-bot JavaScript challenges
@adogrove/adonis-cap
GitHub Actions
Adonis integration for Cap, a lightweight, modern open-source CAPTCHA alternative designed using SHA-256 PoW.
arcfetch
briansunter
Fetch URLs, extract clean article content, and cache as markdown. Supports automatic JavaScript rendering via Playwright.
...morecrawl-obj
vkolluru1974
A utility package for telecom automation and integration. Includes telecom-mas-agent and other useful libraries.
cdp-client-tool
ranmeizi
一个客户端程序,设计了一套socket.io与http接口的通信协议,可以用于控制浏览器,抓取数据等
scrappey
demonmartin
Introducing Scrappey, your comprehensive website scraping solution provided by Scrappey.com. With Scrappey's powerful and user-friendly API, you can effortlessly retrieve data from websites, including those protected by Cloudflare. Join Scrappey today and
...more@coya/web-scraper
coya
Web scraper on top of PhantomJS or Chromium
@mihnea.dev/webscraper
mihnea.dev
A robust web scraping library using Playwright for Node.js. This library provides an easy-to-use API for automating web interactions, extracting data, and handling various web scraping tasks efficiently.
...morexpcrawl
girard.xyz
A versatile Puppeteer-based web crawler with XPath support, pagination, piping, and stealth capabilities.
liquicode_dreamscrape
agbowlin
A dream R&D tool for web scraping.
webscrape-gbn
jitu1612
A simple web scraping module. Supported websites for web scraping are BigBasket, Grofers and Natures Basket.
camofox-browser
redf0x1
Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support