Search results

7 packages found

simple polite crawling of the web.

published 5.1.2 8 years ago
M
Q
P

A web crawler for Nodejs.

published 0.8.2 10 years ago
M
Q
P

SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest

published 1.2.66 3 months ago
M
Q
P

Streaming pdf fetcher for academic papers.

published 0.0.3 11 years ago
M
Q
P
M
Q
P

A 2nd generation spider to crawl any article site, automatic reading title and content.

published 0.0.7 8 years ago
M
Q
P

Scalable, extensible, web crawler framework.

published 0.0.0 11 years ago
M
Q
P