Data Extraction
683AI tools in the Data Extraction category
@monostate/browsernative-client
andrewmonostate
Browser Native client SDK for web scraping and content extraction API
@keak/webmcp-core
eamonnkeak
Auto-generate WebMCP tool definitions from any website
@browserbasehq/convex-stagehand
GitHub Actions
Convex component for AI-powered browser automation with Stagehand
@activepieces/piece-scrapegrapghai
abdul_activepiecer
## Description ScrapeGraphAI is a powerful web scraping and content extraction API. This piece enables integration with ScrapeGraphAI's API to perform smart scraping, local scraping, and markdown conversion.
...moremagpie-html
anonyfox
Modern TypeScript library for scraping web content with isomorphic support
scrapeyard
anasouardini
A scraping library that saves you from writing a lot of boiler-plate every time you lunch a new project. It also helps you manage multiple projects in one place.
...moreoxylabs-ai-studio
oxybrain
JavaScript SDK for Oxylabs AI Studio API services
peviitor_jsscraper
lalalaurentiu
Lightweight library intended for scraping and interfacing with peviitor.ro
@pinkpixel/web-scout-mcp
sizzlebop
MCP server for web search and content extraction with multiple URL support and memory optimizations
deepspider
pony-ma
智能爬虫工程平台 - 基于 DeepAgents + Patchright 的 AI 爬虫 Agent
@seaavey/scapers
seaavey
The Scapers is a collection of tools for scraping data from the web.
web-content-extract
amoyensis
A library and command-line tool to extract clean content from web pages using Mozilla Readability and convert it to Markdown or JSON.
...more@shepherd-terminal/reacher
GitHub Actions
CLI tool for scraping LinkedIn and Google Maps to find businesses by type and location
@dominusnode/openclaw-plugin
0xcircuitbreaker
Dominus Node proxy plugin for OpenClaw — route web requests through rotating proxy networks
soupselect
Adds CSS selector support to htmlparser for scraping activities - port of soupselect (python)
@brightdata/ai-sdk
brd-cholpon
Bright Data tools for Vercel AI SDK - scrape, search, and dataset collection
raggle-js
raggle_npm
JavaScript client for Raggle API
crawlio-browser
rashidazarang
MCP server with 100 CDP-backed tools for browser automation — screenshots, DOM, network capture, framework detection, cookies, storage, session recording, structured data extraction, performance metrics via Chrome
...morearcfetch
briansunter
Fetch URLs, extract clean article content, and cache as markdown. Supports automatic JavaScript rendering via Playwright.
...morescrappey
demonmartin
Introducing Scrappey, your comprehensive website scraping solution provided by Scrappey.com. With Scrappey's powerful and user-friendly API, you can effortlessly retrieve data from websites, including those protected by Cloudflare. Join Scrappey today and
...more