>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

667

AI tools in the Data Extraction category

@xcrap/extractor

marcuth

Xcrap Extractor is a package of the Xcrap framework, it was developed to take care of the data extraction part of text files (currently supporting only HTML, JSON and Markdown) using declarative models.

...more
SkillData Extraction
11 dir

crawl-obj

vkolluru1974

A utility package for telecom automation and integration. Includes telecom-mas-agent and other useful libraries.

SkillData Extraction
1 dir

nautiljon-scraper-mod

junkofly

Nautiljon's anime and manga website scraping tool

SkillData Extraction
161 dir

crawl-dir

vkolluru1974

A utility package for telecom automation and integration. Includes telecom-mas-agent and other useful libraries.

SkillData Extraction
1 dir

@cle-does-things/scpr

cle-does-things

Simple and intuitive CLI tool and MCP server to perform web scraping operations.

MCP ServerData Extraction
101 dir

transparent-proxy

gr3p

Real transparent HTTP-Proxy-Server. Upstream your requests whatever you want!

SkillData Extraction
1 dir

deepcrawl

felixlyu1018

JavaScript/TypeScript SDK for Deepcrawl API

SkillData Extraction
5591 dir

browser-tls-fetch

dan1ve

fetch-compatible HTTP client with TLS fingerprinting

SkillData Extraction
1 dir

gurkha

monitz87

Data extraction module

SkillData Extraction
51 dir

koonjs

scrapehub

Browser-impersonating HTTP client with TLS/HTTP2 fingerprint spoofing

SkillData Extraction
1 dir

@rahulxf/random-user-agent

rahulxf

Generate random user agent

AgentData Extraction
41 dir

hrequests-js

jwriter20

TypeScript port of hrequests library - Full-featured HTTP client with TLS fingerprinting and browser automation

SkillData Extraction
1 dir

x-crawl

coderhxl

x-crawl is a flexible Node.js AI-assisted crawler library.

SkillData Extraction
1.8K1 dir

sl-dbmaria

putraadtya26

A powerful web scraping tool for everything

SkillData Extraction
1 dir

@absahmad/wreq-js

GitHub Actions

Node.js/TypeScript HTTP client with browser TLS fingerprint impersonation (JA3/JA4). Bypass Cloudflare and anti-bot detection. Rust-powered, fetch()-compatible.

...more
SkillData Extraction
1 dir

component-search2

timaschew

search through crawl components

SkillData Extraction
91 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction
2141 dir

cloudbypass-skill

cloudbypass

穿云API的OpenClaw技能实现,用于绕过Cloudflare等反爬虫保护

SkillData Extraction
1 dir

top-user-agents

kikobeats

An always up-to-date list of the top 100 most common browser user-agents for HTTP requests

AgentData Extraction
3231 dir

rebrowser-playwright-core

nwebson

A drop-in replacement for playwright-core patched with rebrowser-patches. It allows to pass modern automation detection tests.

...more
SkillData Extraction
61 dir