Data Extraction
700AI tools in the Data Extraction category
gurkha
monitz87
Data extraction module
@dominusnode/openclaw-plugin
0xcircuitbreaker
Dominus Node proxy plugin for OpenClaw — route web requests through rotating proxy networks
@keak/webmcp-core
eamonnkeak
Auto-generate WebMCP tool definitions from any website
@browserbasehq/convex-stagehand
GitHub Actions
Convex component for AI-powered browser automation with Stagehand
headsman
plenty-of-ish
Uses a headless browser to fully render a webpage and return the final html content.
@seaavey/scapers
seaavey
The Scapers is a collection of tools for scraping data from the web.
fb-assistant-ts
toshiodev
a facebook puppeteer manipulate library
@sapkotamadan/cache-server
sapkotamadan
CacheServer is an efficient web page extractor that uses Puppeteer to launch a headless browser and fetch web page content.
...more@apiverve/webimagescraper
charifield
Web Image Scraper is a simple tool for scraping images from a website. It returns the URLs of the images found on the website.
...morecomponent-search2
timaschew
search through crawl components
@rtrvr-ai/core
bhavanikalisetty
Core runtime and API client primitives for rtrvr CLI/SDK
web-content-extract
amoyensis
A library and command-line tool to extract clean content from web pages using Mozilla Readability and convert it to Markdown or JSON.
...more@shepherd-terminal/reacher
GitHub Actions
CLI tool for scraping LinkedIn and Google Maps to find businesses by type and location
magpie-html
anonyfox
Modern TypeScript library for scraping web content with isomorphic support
@brightdata/ai-sdk
brd-cholpon
Bright Data tools for Vercel AI SDK - scrape, search, and dataset collection
@activepieces/piece-scrapegrapghai
abdul_activepiecer
## Description ScrapeGraphAI is a powerful web scraping and content extraction API. This piece enables integration with ScrapeGraphAI's API to perform smart scraping, local scraping, and markdown conversion.
...moresoupselect
Adds CSS selector support to htmlparser for scraping activities - port of soupselect (python)
@docchi/scraping-anime-websites-poland
xanax_
Moduł do pobierania linków z popularnych polskich strony z anime
omnifetch-lib
visy_ani
Universal content extraction library with tiered fetching strategies
node-metainspector
gabceb
Npm package for web scraping purposes. You give it an URL, and it lets you easily get its title, links, images, description, keywords, meta tags
...more