Data Extraction
651AI tools in the Data Extraction category
scrappey-wrapper
dormic97
Official Node.js wrapper for the Scrappey web scraping API. Bypass Cloudflare, Datadome, PerimeterX, and other antibot protections. Solve captchas automatically.
...moreinstagram-scraping
rzlyp
NPM module for loading media by hashtag without instagram API
opensteer
timjang3
Open-source browser automation SDK and CLI that lets AI agents build complex scrapers directly in your codebase.
dom-parser
ershov-konst
Fast dom parser based on regexps
webcrawlerapi-js
niiotyo
JS client for WebcrawlerAPI
@thinkbrowse/cli
derivativelabs
CLI for controlling browsers via ThinkBrowse cloud and local infrastructure
@activepieces/piece-browserless
abdul_activepiecer
Browserless is a cloud-based browser automation platform that allows you to run full Chrome sessions remotely for tasks like taking screenshots, scraping data, converting pages to PDFs, and more without writing scraping code or managing servers.
...morereviewbr-mcp
vic3m
MCP Server for Brazilian Academic Repositories (OAI-PMH, DSpace REST, HTML scraping) and PRISMA Systematic Reviews
ag-webscrape
GitHub Actions
TypeScript web scraper with Playwright fallback for anti-scraping protection
maxun-sdk
karishmashukla
Maxun Node SDK for web scraping and data extraction
@supadata/mcp
rafalzawadzki
MCP server for Supadata video & web scraping integration. Features include YouTube, TikTok, Instagram, Twitter, and file video transcription, web scraping, batch processing and structured data extraction.
...moreayakashi
zisismaras
The next generation web scraping framework
curl-cffi
tocha688
A powerful HTTP client for Node.js based on libcurl with browser fingerprinting capabilities.
@teng-lin/agent-fetch
teng-lin
Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction
theia-suite
reyhan6610
A large-scale web scraping library for Node.js.
chromancer
johnlindquist
A powerful command-line interface for automating Chrome browser using Playwright. Perfect for web scraping, automation, testing, and browser workflows.
...morereviewweb-cli
mrgoonie
CLI tool for ReviewWeb.site API - create reviews, scrape websites, extract data, SEO insights and more
web-fetch-mcp
mountaintop
MCP server for web content fetching, summarizing, comparing, and extracting information
imperium-crawl
imperiumhub
Open-source CLI tool for web scraping, crawling, search, and custom skills
@tabstack/pilo
tabstack
AI-powered web automation library and CLI tool