Data Extraction

650

AI tools in the Data Extraction category

All (650)MCP Servers (63)Skills (557)Agents (30)

node-curl-impersonate

wearr

A wrapper around cURL-impersonate, a binary which can be used to bypass TLS fingerprinting.

SkillData Extraction

1 dir

almuten-scraper

oliver797

A tool for scraping and calculating almuten (planetary dignity) in astrology

SkillData Extraction

11 dir

create-simplecrawl

apenasgabs

SimpleCrawl — scaffold a web scraping project interactively. Choose engine (SSR/CSR/hybrid) and architecture.

SkillData Extraction

71 dir

@xcrap/core

marcuth

Xcrap Core is the core package of the Xcrap framework for web scraping, offering tools such as HttpClient, BaseClient, Randomizer, Rotator, and support for proxies and pagination.

...more

SkillData Extraction

11 dir

@rosbel/crawl-n-snap

rosbel

CLI tool for taking website screenshots at various resolutions using Playwright, with optional website crawling functionality.

...more

SkillData Extraction

1 dir

scrapegraph-js

vincigit00

Official JavaScript/TypeScript SDK for the ScrapeGraph AI API — smart web scraping powered by AI

SkillData Extraction

691 dir

bromato

gyoridavid

Local browser automation for no-code tools like n8n or make

SkillData Extraction

121 dir

jann-scraper

jannoffc

The library scraper for WhatsApp bot or Restfull API's

SkillData Extraction

1 dir

node-red-contrib-nbrowser

steveorevo

Provides a high level browser automation node based on nightmarejs.org.

SkillData Extraction

341 dir

puppeteer-infinite-scroller

dulanh

Provides a simple and efficient solution for scraping data loaded through infinite scrolling on web pages using Puppeteer.

...more

SkillData Extraction

21 dir

cloakbrowser

GitHub Actions

Stealth Chromium that passes every bot detection test. Drop-in Playwright/Puppeteer replacement with source-level fingerprint patches.

...more

AgentData Extraction

7361 dir

just-scrape

vincigit00

ScrapeGraph AI CLI tool

SkillData Extraction

1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction

2141 dir

cashclaw

ertugrulakben

Turn your OpenClaw into a money-making machine

AgentData Extraction

1011 dir

@obscrd/robots

larsmosr

AI crawler blocking — generate robots.txt, meta tags, and HTTP headers for 30+ AI bots

SkillData Extraction

141 dir

pi-agent-browser

coctostan

Browser automation tool for pi — interactive browsing, screenshots with inline vision, and session cleanup via agent-browser CLI

...more

AgentData Extraction

71 dir

mcp-chrome-control

codingbutterbot

Browser automation for AI assistants - Chrome control via JSON-RPC and MCP

MCP ServerData Extraction

1 dir

@scrapeops/n8n-nodes-scrapeops

aswadali

n8n community node for ScrapeOps Proxy, Parser, and Data APIs for web scraping and data extraction

SkillData Extraction

11 dir

get-site-urls

alexpage

Crawl a URL to generate a sitemap and find 404 errors with one command

SkillData Extraction

351 dir

grunt-extract-cldr-data

okuryu

Extract CLDR data and transform it for use in JavaScript.

SkillData Extraction

71 dir