Data Extraction
671AI tools in the Data Extraction category
pi-agent-browser
coctostan
Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI
...morescrapegraph-js
vincigit00
Official JavaScript/TypeScript SDK for the ScrapeGraph AI API — smart web scraping powered by AI
rebrowser-playwright-core
nwebson
A drop-in replacement for playwright-core patched with rebrowser-patches. It allows to pass modern automation detection tests.
...morenstbrowser-ai-agent
nstbrowser
Nstbrowser AI agent for browser automation with advanced fingerprinting
@tavily/core
guyhartstein
Official JavaScript library for Tavily.
@bigknoxy/exa-cli
bigknoxy
CLI wrapper for Exa MCP tools - search, crawl, and research from the command line
better-browse
mylesiyabor
Zero-dependency browser automation via Chrome DevTools Protocol with ARIA accessibility snapshots — 10-100x cheaper than vision-based approaches
...more@teng-lin/agent-fetch
teng-lin
Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction
open-web-unlocker
GitHub Actions
Fetch public web pages through a configurable fetch/browser pipeline and parse them into structured JSON or clean markdown.
...moremaxun-sdk
karishmashukla
Maxun Node SDK for web scraping and data extraction
top-user-agents
kikobeats
An always up-to-date list of the top 100 most common browser user-agents for HTTP requests
rebrowser-puppeteer-core
nwebson
A drop-in replacement for puppeteer-core patched with rebrowser-patches. It allows to pass modern automation detection tests.
...morex-crawl
coderhxl
x-crawl is a flexible Node.js AI-assisted crawler library.
opencode-agent-browser
crottolo
OpenCode plugin for agent-browser - browser automation with persistent cookies and dev tools
facebook-marketplace-cli
lotrez
CLI tool for Facebook Marketplace and Messenger automation
@obscrd/robots
larsmosr
AI crawler blocking — generate robots.txt, meta tags, and HTTP headers for 30+ AI bots
online-audit
omc345
MCP server for auditing a person's public online presence — Google search, GitHub, Reddit, web scraping
@elizaos/plugin-browser
shawticus
Plugin for browser actions and web scraping
@hyperbrowser/agent
leoscope
Hyperbrowsers Web Agent
@askjo/camofox-browser
askjo
Headless browser automation server and OpenClaw plugin for AI agents - anti-detection, element refs, and session isolation
...more