Data Extraction
667AI tools in the Data Extraction category
quickscrape
blahah
A command line scraping tool for the modern web
clawpage-mcp
clawpage
MCP server for ClawPage web extraction API. Extract and structure any web page into clean JSON.
@monostate/browsernative-client
andrewmonostate
Browser Native client SDK for web scraping and content extraction API
@evointel/anno
evo-dragon
Web content extraction for AI agents — ensemble extraction with confidence scoring, 93% token reduction vs raw HTML
scrapeyard
anasouardini
A scraping library that saves you from writing a lot of boiler-plate every time you lunch a new project. It also helps you manage multiple projects in one place.
...morescrapegraph-js
vincigit00
Official JavaScript/TypeScript SDK for the ScrapeGraph AI API — smart web scraping powered by AI
@olib-ai/owl-browser-sdk
ahstanin
Node.js SDK for Owl Browser automation - Async-first with dynamic OpenAPI method generation
@algolia/netlify-plugin-crawler
h1fra
This plugin links your Netlify site with Algolia's Crawler. It will trigger a crawl on each successful build.
@xcrap/extractor
marcuth
Xcrap Extractor is a package of the Xcrap framework, it was developed to take care of the data extraction part of text files (currently supporting only HTML, JSON and Markdown) using declarative models.
...morepeviitor_jsscraper
lalalaurentiu
Lightweight library intended for scraping and interfacing with peviitor.ro
@radaros/core
bharatbxhipment
Core framework for building AI agents with tools, memory, and multi-model support
h56-github-scrapper
hasyim56
GitHub user scraper
@activepieces/piece-scrapegrapghai
abdul_activepiecer
## Description ScrapeGraphAI is a powerful web scraping and content extraction API. This piece enables integration with ScrapeGraphAI's API to perform smart scraping, local scraping, and markdown conversion.
...more@seaavey/scapers
seaavey
The Scapers is a collection of tools for scraping data from the web.
deepspider
pony-ma
智能爬虫工程平台 - 基于 DeepAgents + Patchright 的 AI 爬虫 Agent
@octivas/mcp
soeffing
MCP server for Octivas web scraping, crawling, and search API
@cle-does-things/scpr
cle-does-things
Simple and intuitive CLI tool and MCP server to perform web scraping operations.
gurkha
monitz87
Data extraction module
@sapkotamadan/cache-server
sapkotamadan
CacheServer is an efficient web page extractor that uses Puppeteer to launch a headless browser and fetch web page content.
...more@rtrvr-ai/core
bhavanikalisetty
Core runtime and API client primitives for rtrvr CLI/SDK