>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

698

AI tools in the Data Extraction category

@tavily/core

guyhartstein

Official JavaScript library for Tavily.

SkillData Extraction
781 dir

supapup

onepointfour-packs

⚡ Lightning-fast MCP browser dev tool. Navigate → Get instant structured data. No screenshots needed! Puppeteer: 📸 → CSS selectors → JS eval. Supapup: semantic IDs ready to use. 10x faster, 90% fewer tokens.

...more
MCP ServerData Extraction
1 dir

@mdream/nuxt

GitHub Actions

Nuxt module for converting HTML pages to Markdown using mdream

SkillData Extraction
8221 dir

dom-parser

ershov-konst

Fast dom parser based on regexps

SkillData Extraction
1111 dir

ts-web-scraper

chrisbreuer

A powerful web scraper for both static and client-side rendered sites using only Bun native APIs

SkillData Extraction
101 dir

@bigknoxy/exa-cli

bigknoxy

CLI wrapper for Exa MCP tools - search, crawl, and research from the command line

SkillData Extraction
1 dir

reviewbr-mcp

vic3m

MCP Server for Brazilian Academic Repositories (OAI-PMH, DSpace REST, HTML scraping) and PRISMA Systematic Reviews

MCP ServerData Extraction
1 dir

machinepack-http

eashaw

Send HTTP requests, scrape webpages, and stream data in your JavaScript/Node.js/Sails.js app with a simple, `jQuery.get()`-like interface for sending HTTP requests and processing server responses.

...more
SkillData Extraction
51 dir

open-web-unlocker

GitHub Actions

Fetch public web pages through a configurable fetch/browser pipeline and parse them into structured JSON or clean markdown.

...more
MCP ServerData Extraction
41 dir

better-browse

mylesiyabor

Zero-dependency browser automation via Chrome DevTools Protocol with ARIA accessibility snapshots — 10-100x cheaper than vision-based approaches

...more
SkillData Extraction
1 dir

ayakashi

zisismaras

The next generation web scraping framework

SkillData Extraction
2171 dir

the-a11y-machine

hywan

The A11y Machine is an automated accessibility testing tool which crawls and tests all pages of any website.

SkillData Extraction
6321 dir

maxun-sdk

karishmashukla

Maxun Node SDK for web scraping and data extraction

SkillData Extraction
1 dir

proxys-site

urready

The official open-source codebase for Proxys.Site - A comprehensive proxy comparison tool and list.

SkillData Extraction
1 dir

@cd39390/mcp-web-crawler

cd39390

An MCP server plugin to crawl all hyperlinks from a website for AI learning purposes.

MCP ServerData Extraction
1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction
2141 dir

@obscrd/robots

larsmosr

AI crawler blocking — generate robots.txt, meta tags, and HTTP headers for 30+ AI bots

SkillData Extraction
141 dir

facebook-marketplace-cli

lotrez

CLI tool for Facebook Marketplace and Messenger automation

SkillData Extraction
1 dir

devbridge-styleguide

devbproto

Styleguide automatization tool.

SkillData Extraction
1.4K1 dir

unfluff

ageitgey

A web page content extractor

SkillData Extraction
2.2K1 dir