Search results
46 packages found
node.js client library for PDFDATA.io (PDF data extraction as-a-service)
Turn links into beautiful previews.
A command line tool to extract data from SQL Server and save each row as a JSON file
Extracts data from a markdown document.
A web scraper built with OpenAI GPT to extract structured data from HTML pages based on a given schema.
simple-link-previewer is node package that provides you to get data from a link like title, description, images as a preview of the page's content
This module should be installed in any webmiddle application, as it provides the machinery to parse JSX, access to the rootContext and other goodies.
NPM Module and Command Line Tool for HTML-data Extraction
> Component that transforms a JSON resource into another JSON resource by using JSONSelect.
> Component that transforms a HTML or XML resource into a JSON resource by using Cheerio.
> Component built on top of the request library, it is used to perform http requests.
> Executes a sequence of tasks, piping the result of a task to the next task.
> Executes multiple tasks concurrently.
Parse files and metadata using Tika.
- data
- documents
- documents parsing
- pdf extraction
- sugarcube
- sugarcube plugin
- sugarcube-plugin
- tika
- transformation
> Wrapper on top of the tough-cookie library, acts as a cookie jar.
Get preview data (a title, description, image, domain name, favicon) from a url. Library uses puppeteer headless browser to scrape the web site.
> Similar to the HttpRequest component, but it uses Puppeteer to fetch html pages.
> A component that makes a task resumable by caching the result.