Search results

1745 packages found

crawler

exact match

Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously

published 1.5.0 3 months ago
M
Q
P

Search for anything on web.

published 1.1.3 8 years ago
M
Q
P

Stealth mode: Applies various techniques to make detection of headless puppeteer harder.

published 2.11.2 a year ago
M
Q
P

A library for efficiently walking a directory recursively

published 2.0.0 9 months ago
M
Q
P

This library provides support for traversing objects and their values while providing information on the traversal state, pathing to target values, and the ability to manipulate said pathing to easily move to related values.

published 1.2.0 4 months ago
M
Q
P

A snazzy light Node.js image crawler laced with TypeScript goodness! 🕵️🦾

published 1.2.8 10 months ago
M
Q
P

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published 3.7.3 3 months ago
M
Q
P

protect your content from scraping

published 2.1.1 2 months ago
M
Q
P

This repository contains a list of of HTTP user-agents used by robots, crawlers, and spiders as in single JSON file.

published 1.0.132 7 days ago
M
Q
P

Pure javascript cross-platform module to extract text from PDFs.

published 1.2.0 9 months ago
M
Q
P

[![NPM](https://nodei.co/npm/botium-crawler.png?downloads=true&downloadRank=true&stars=true)](https://nodei.co/npm/botium-crawler/)

published 0.0.23 a month ago
M
Q
P

xvideos.com api implementation.

published 1.6.4 5 days ago
M
Q
P

An attestate crawler strategy to download and transform Ethereum block event logs

published 0.4.4 7 days ago
M
Q
P

🤖/👨‍🦰 Recognise bots/crawlers/spiders using the user agent string.

published 5.1.4 14 days ago
M
Q
P

Lightweight async scraper for Google News

published 1.2.2 a month ago
M
Q
P

a web crawler based on crawlee, use file to cache result. Easy to maintain as Singleton Service.

published 1.1.0 8 months ago
M
Q
P

Pure javascript cross-platform module to extract text from PDFs.

published 1.0.0 3 months ago
M
Q
P

A tiny node module to detect spiders/crawlers quickly and comes with optional middleware for ExpressJS

published 2.0.1 10 months ago
M
Q
P

Pure javascript cross-platform module to extract text from PDFs.

published 1.1.1 5 years ago
M
Q
P

crawl youtube without api key (search videos channels or get all channel/playlist's videos)

published 3.3.3 a month ago
M
Q
P