Search results
134 packages found
Setup to parse data from upload ocr text
Extracts all the tr rows for a given page of the bills table on the Hess Energy website https://www.hessenergy.com
logins for the DocParse system
mark a scraped document as parsed after the supplier specific parsing completes
Generate a unique _id for a bill to use when saving to couchdb
Generate a unique _id for invoices by taking a sha1 hash of the supplierCode, billingSupplierCode, billID, fromDate, and toDate fields of an invoice
Test if the current cheerio loaded html refers to the account homepage on the NStar supplier website
Get the current bill indices like "1-7 of 514 Natural Gas Invoices" in the header of the bills table on the Hess Energy website https://hessenergy.com
get existing bills in database
Test if the current cheerio parsed html is the bills table page on the Hess Energy https://hessenergy.com website
Extract all profile values that exist for a given customer login on the https://hessenergy.com website
Load and scrape all bills across all pages of the bills table for a single utility account on the Hess Energy website https://hessenergy.com
Test if there are still more pages to scrape in the bills table (ie next link appears at bottom of table)
Extracts all the tr rows for a given page of the bills table on the Hess Energy website https://www.hessenergy.com
fetch details about a pdf document using the sha1 hash value
check if a given bill already exists in the docparse database using the docparse server REST api
mongoose database connection for the docparse project
after parsing an upload and finding all matching bills, stre the discovered relationships in the database
save scraper data to a new invoice document in the database'
allow node based scrapers to add new data via the docparse api