doc-recog

1.1.6 • Public • Published

doc-recog

Text-based document recognition.

The goal of this project is to allow document recognition and data extraction based on pre-defined templates. Templates define key elements that if found help determine the document type. Once the document recognition is complete it's possible to extract specific information from the document or tag using a QR code, for example.

If the input is a PDF file, doc-recog will use pdftotext output data to process the document. If no text is found or if the input is an image file, the result of OCR processing using tesseract will be used as input for the whole process.

Readme

Keywords

none

Package Sidebar

Install

npm i doc-recog

Weekly Downloads

5

Version

1.1.6

License

ISC

Unpacked Size

105 kB

Total Files

16

Last publish

Collaborators

  • zepspaiva