>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt

Data Extraction

654

AI tools in the Data Extraction category

@harvestapi/scraper

xorcuit

HarvestAPI provides LinkedIn data scraping tools for real-time, high-performance scraping at a low cost. API allows to search for Linkedin `jobs`, `companies`, `profiles`, and `posts` using a wide range of filters.

...more
SkillData Extraction
1 dir

node-web-crawler

jaykshah

Node Web Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously. Scraping should be simple and fun!

...more
SkillData Extraction
1 dir

@ogulcancelik/pi-web-browse

ogulcancelik

Web search and content extraction skill for pi-coding-agent. Search the web and fetch pages via a real headless browser (CDP). Works on Linux, macOS, and Windows.

...more
SkillData Extraction
191 dir

codehs_grades

randomdevd3v

This is a [NodeJS](https://nodejs.org/) tool using the [Puppeteer](https://developers.google.com/web/tools/puppeteer) headless browser to crawl the [CodeHS](https://codehs.com) code teaching platform for a teacher's students' grades.

...more
SkillData Extraction
1 dir

@xcrap/got-scraping-client

marcuth

Xcrap Got Scraping Client is a package of the Xcrap framework that implements an HTTP client using the Got Scraping library.

...more
SkillData Extraction
21 dir

site-crawl

raj1000

A CLI tool to recursively crawl websites and download content

SkillData Extraction
1 dir

crawly-mccrawlface

budickda

Crawl data from webpages and apply content extraction.

SkillData Extraction
11 dir

mycrawl

zunkun

craw a definite web

SkillData Extraction
1 dir

@cap.js/widget

tiagozip

Client-side widget for Cap, a lightweight, modern open-source CAPTCHA alternative designed using SHA-256 PoW.

SkillData Extraction
5.1K1 dir

twitter-crawler

herchu

NodeJS Crawler for Twitter

SkillData Extraction
101 dir

adex-linkedin-scrapper

adefemigreat

Flexible linkedin scrapper developed by Adefemigreat

SkillData Extraction
1 dir

@hyperbrowser/sdk

leoscope

Node SDK for Hyperbrowser API

SkillData Extraction
1 dir

rebrowser-puppeteer-core

nwebson

A drop-in replacement for puppeteer-core patched with rebrowser-patches. It allows to pass modern automation detection tests.

...more
SkillData Extraction
331 dir

getcontentapi

stabem

Official TypeScript/Node.js SDK for ContentAPI — extract content from any URL

SkillData Extraction
1 dir

apify

GitHub Actions

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

...more
SkillData Extraction
1721 dir

@mihnea.dev/recaptcha-solver

mihnea.dev

[![npm version](https://img.shields.io/npm/v/@mihnea.dev/recaptcha-solver.svg)](https://www.npmjs.com/package/@mihnea.dev/recaptcha-solver) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

...more
SkillData Extraction
331 dir

@letsscrapedata/scraper

letsscrapedata

Web scraper that scraping web pages by LetsScrapeData XML template

SkillData Extraction
31 dir

@nampham1106/search-cli

nampham1106

A modern TypeScript CLI tool for web search and content fetching powered by DuckDuckGo

SkillData Extraction
1 dir

scrapix-cli

simiokunowo

A TypeScript-based CLI Application for scraping Google images

SkillData Extraction
1 dir

camofox-browser

redf0x1

Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support

MCP ServerData Extraction
421 dir