Search results
157 packages found
Distributed web crawler powered by Headless Chrome
Moving or backing up your Wordpress site to Blogger
Node.js module for crawling the web
A damn simple tool to extract json-ld metadata from webpage using jquery like api (jQuery, Cheerio, CashDOM, ...).
A way to make your web application crawlable, so it can be well referenced on the web.
plosone.org scraper
- papermonk
- plos
- plos one
- plosone.org
- public library of science
- papers
- pdfs
- academic articles
- academic papers
- scholarly articles
- scholarly papers
- journals
- scraping
- View more
Tem o objetivo de executar rotinas de CRAWLING a partir de um arquivo JSON utilizando xpath mas aceitando para cada passo uma função callback que recebe o valor e pode passar esse valor para um próximo passo.
One API to scrape All the Web.
crawlx is a Lightweight web crawler with powerful plugins!
A simple web scraping tool built for developers that can be utilized on both the client and server.
A Node.js scraping framework built on puppeteer-extra (to use a headless Chrome/Chromium browser). Has the ability to solve reCaptcha
Lightweight crawler written in TypeScript using ES6 generators.
A lightweight and simple API for web crawling built on chromium puppeteer
Generate a sitemap javascript object from the folder structure crawling HTML files only.
A Wight backend for fetching static web pages
tiny-crawler is a web crawler.
Providers are the core of applications, where the subtitles are collected. Each provider exports a unique strategy for gathering data. From legendastv's web scraping from opensubtitle API usage, you can collect subtitles from your favorite tv shows and mo
Site content parser for popular websites with fallback to Open Graph and Twitter Cards