Data Extraction

655

AI tools in the Data Extraction category

All (655)MCP Servers (65)Skills (559)Agents (31)

plato-cli

shinpads

A client for the Plato API

SkillData Extraction

1 dir

playwright-afp

paleksic

Stop website fingerprinting techniques playwright edition

SkillData Extraction

161 dir

spider-browser

jeffmendez

Browser automation client for Spider's pre-warmed browser fleet with smart retry and browser switching

SkillData Extraction

1 dir

request-group-puppeteer

gabrielenunez

Simplifies requesting for puppeteer instances and sending mulitple puppeteer request at the same time

SkillData Extraction

1 dir

postcss-obfuscator

n4j!b-r4ch!d

PostCSS plugin that helps you protect your CSS code by obfuscating class names and ids. with customizable configuration.

SkillData Extraction

1861 dir

instagram-profilecrawl

nacimgoura

Quickly crawl the information (e.g. followers, tags, etc...) of an instagram profile. No login required!

SkillData Extraction

1251 dir

crawly-ai

mateosanchezl

A simple, lightweight AI web scraping tool.

SkillData Extraction

21 dir

puppeteer-dsl

311ecode

An intuitive DSL for Puppeteer, simplifying web automation and testing. Currently in alpha, subject to changes.

SkillData Extraction

1 dir

@hillwoodpark/gcp-logger

timjohns

Logger that creates messages in a format that is roughly compatible with Google Cloud Platform log-scraping in App Engine, Google Cloud Functions, and probably several other services

...more

SkillData Extraction

11 dir

puppeteer-afp-with-vendor

xiloe

Stop website fingerprinting techniques

SkillData Extraction

1 dir

unfluffjs

yknx4

A web page content extractor

SkillData Extraction

21 dir

terminal-scrapearange

wolfram77

Terminal interface implementation for ranged web scraping.

SkillData Extraction

1 dir

just-scrape

vincigit00

ScrapeGraph AI CLI tool

SkillData Extraction

1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction

2141 dir

cloudbypass-skill

cloudbypass

穿云API的OpenClaw技能实现，用于绕过Cloudflare等反爬虫保护

SkillData Extraction

1 dir

@ghx-dev/core

GitHub Actions

GitHub execution router for AI agents with deterministic routing and normalized output.

SkillData Extraction

61 dir

@tabstack/pilo

tabstack

AI-powered web automation library and CLI tool

SkillData Extraction

1 dir

getcontentapi

stabem

Official TypeScript/Node.js SDK for ContentAPI — extract content from any URL

SkillData Extraction

1 dir

scrapix-cli

simiokunowo

A TypeScript-based CLI Application for scraping Google images

SkillData Extraction

1 dir

@dmsdc-ai/aigentry-dustcraw

duckyoung_kim

Airborne signal absorber — collects floating public data (RSS/API/web) and feeds aigentry-brain

SkillData Extraction

1 dir