Data Extraction
662AI tools in the Data Extraction category
anchorbrowser
GitHub Actions
The official TypeScript library for the Anchorbrowser API
@emircansahin/ghostfetch
emircansahin
Resilient HTTP client with CycleTLS, proxy rotation, smart error classification, and per-site interceptors
@activepieces/piece-browserless
abdul_activepiecer
Browserless is a cloud-based browser automation platform that allows you to run full Chrome sessions remotely for tasks like taking screenshots, scraping data, converting pages to PDFs, and more without writing scraping code or managing servers.
...morebrowser-driver-cli
linc.3395
TypeScript-native browser automation tool for AI agents
headless-chrome-crawler
yuji.isobe
Distributed web crawler powered by Headless Chrome
@tavily/core
guyhartstein
Official JavaScript library for Tavily.
news-extractor-node
siping
A Node.js library for extracting news content from HTML pages using text density algorithm
@ptrumpis/snap-lens-web-crawler
ptrumpis
Crawl and download Snap Lenses from *lens.snapchat.com* with ease.
@praveen030686/data-apis-mcp
praveen030686
MCP server for x402-powered Crypto, Finance, and Web Extract APIs. 22 tools for AI agents with USDC micropayments on Base.
...moreag-webscrape
GitHub Actions
TypeScript web scraper with Playwright fallback for anti-scraping protection
crawl-cli
felipextrindade
A Node crawler/scrape for retrieving data from websites
qserp
bijikyu
Robust Node.js module for Google Custom Search with rate limiting, error handling, and offline testing capabilities. Supports parallel searches and comprehensive result formatting.
...moreboloto
orzv
node.js web crawler
tradingview-scraper
imxeno
A gateway to TradingView's data for your Node.js application!
@ogulcancelik/pi-web-browse
ogulcancelik
Web search and content extraction skill for pi-coding-agent. Search the web and fetch pages via a real headless browser (CDP). Works on Linux, macOS, and Windows.
...morenstbrowser-ai-agent
nstbrowser
Nstbrowser AI agent for browser automation with advanced fingerprinting
rebrowser-playwright
nwebson
A drop-in replacement for playwright patched with rebrowser-patches. It allows to pass modern automation detection tests.
...moresl-dbmaria
putraadtya26
A powerful web scraping tool for everything
component-search2
timaschew
search through crawl components
@mobileproxy/sdk
speedmeter
Official Node.js SDK for MobileProxy.Space API — private mobile proxies on real GSM devices