Data Extraction
682AI tools in the Data Extraction category
n8n-nodes-scrapingfish
maciejw94
n8n community node for Scrapingfish web scraping API
@coya/web-scraper
coya
Web scraper on top of PhantomJS or Chromium
xpcrawl
girard.xyz
A versatile Puppeteer-based web crawler with XPath support, pagination, piping, and stealth capabilities.
@mihnea.dev/webscraper
mihnea.dev
A robust web scraping library using Playwright for Node.js. This library provides an easy-to-use API for automating web interactions, extracting data, and handling various web scraping tasks efficiently.
...moreliquicode_dreamscrape
agbowlin
A dream R&D tool for web scraping.
crawlio-browser
rashidazarang
MCP server with 100 CDP-backed tools for browser automation — screenshots, DOM, network capture, framework detection, cookies, storage, session recording, structured data extraction, performance metrics via Chrome
...morewalkscape-helper
rikurb8
WalkScape helper - wiki scraping and AI-powered Q&A
@omindu/scrapely
omindulk
Declarative web scraping toolkit with schema-driven extraction, pagination, caching, and data export
webscrape-gbn
jitu1612
A simple web scraping module. Supported websites for web scraping are BigBasket, Grofers and Natures Basket.
@kadoa/cli
gabrielkadoa
Kadoa CLI — manage web scraping workflows from the terminal
@xcrap/extractor
marcuth
Xcrap Extractor is a package of the Xcrap framework, it was developed to take care of the data extraction part of text files (currently supporting only HTML, JSON and Markdown) using declarative models.
...morescra
astur
Really simple HTTP client. Mainly for scraping but not only.
crawl-cli-tool
abdo-el-mobayad
A CLI tool for web crawling with auto-discovery, recursive crawling, and markdown output
mdsecure
modderlls
ModderSecure SDK for secure data and backend encryption and decryption. Provides robust AES-256 GCM encryption, secure key management, and premium features for enhanced API security and data privacy.
...morezenrows
anderrv
ZenRows Node SDK
krawlr
alexchomiak
An event-driven web scraping library with polling functionality built on top of Puppeteer.
strudy
tidoust
Web spec analysis tool that can process crawl reports created by Reffy.
@algolia/netlify-plugin-crawler
h1fra
This plugin links your Netlify site with Algolia's Crawler. It will trigger a crawl on each successful build.
newspaperjs
flickzcode
News extraction and scraping. Article Parsing
nautiljon-scraper-mod
junkofly
Nautiljon's anime and manga website scraping tool