>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

683

AI tools in the Data Extraction category

n8n-nodes-npm-crawler

wuxiang1656

An n8n node to crawl and extract n8n community nodes information from npm registry

SkillData Extraction
1 dir

stepwright

lablnet

A powerful web scraping library built with Playwright

SkillData Extraction
51 dir

rent-crawler

nanyang24

A crawler crawling rental information, based Nodejs

SkillData Extraction
21 dir

crawly-mccrawlface

budickda

Crawl data from webpages and apply content extraction.

SkillData Extraction
11 dir

spamlet

connorwade

spamlet is an efficient and simple crawler for playwright

SkillData Extraction
1 dir

bright-data-scraping-browser-nodejs-playwright-project

steiner-hakas

Dependency Confusion to RCE By Steiner254

SkillData Extraction
1 dir

@headwall/url-crawler

headwall

URL crawler for analysing web content

SkillData Extraction
1 dir

@teng-lin/agent-fetch

teng-lin

Full-content web fetcher with Chrome TLS fingerprinting and multi-strategy content extraction

SkillData Extraction
2141 dir

plugin-books-pro

2noscript.dev

[![npm version](https://badge.fury.io/js/plugin-books-pro.svg)](https://badge.fury.io/js/plugin-books-pro)

SkillData Extraction
1 dir

@sharpapi/sharpapi-node-web-scraping

makowskid

SharpAPI.com Node.js SDK for Web Scraping API

SkillData Extraction
1 dir

hylsplider

huyulinhome

fork from headless-chrome-crawler and update puppeteer to the latest version

SkillData Extraction
1 dir

googlethis

luanrt

A simple yet powerful module to retrieve organic search results and much more from Google.

SkillData Extraction
3691 dir

krawlr

alexchomiak

An event-driven web scraping library with polling functionality built on top of Puppeteer.

SkillData Extraction
1 dir

crawlee-one

juro-oravec

Production-ready web scraping in a single function call. Built on Crawlee. Data transforms, caching, privacy compliance, and error tracking -- out of the box.

...more
SkillData Extraction
361 dir

strudy

tidoust

Web spec analysis tool that can process crawl reports created by Reffy.

SkillData Extraction
111 dir

nautiljon-scraper-mod

junkofly

Nautiljon's anime and manga website scraping tool

SkillData Extraction
161 dir

@algolia/netlify-plugin-crawler

h1fra

This plugin links your Netlify site with Algolia's Crawler. It will trigger a crawl on each successful build.

SkillData Extraction
2631 dir

@evointel/anno

evo-dragon

Web content extraction for AI agents — ensemble extraction with confidence scoring, 93% token reduction vs raw HTML

AgentData Extraction
1 dir

@open-automaton/cheerio-mining-engine

khrome

A web scraping engine to minimize resource consumption

SkillData Extraction
1 dir

deepcrawl

felixlyu1018

JavaScript/TypeScript SDK for Deepcrawl API

SkillData Extraction
5591 dir