We (IT Startup) have already implemented Automated Text Extraction Solution/Tool from pdfs/images( Invoices/POs, statements, etc) which include tables/line items. Our solution also provides User Interface as well as the API with endpoints to generate JSON We can integrate API with your website from where invoices will be loaded into AI OCR engine and processed. The whole API solution will reside on your infra under your ownership.
Our solution -
- Hybrid Docker Solution works on Cloud or hosted on any web platform
- API or UI or Batch(backgrd) Processing workflow.
- API or UI can be used to submit multiple docs/images for both modes - AL/ML based algo(Auto flow) as well as Template based (Semi-Auto flow).
- Supports multiple input file formats (pdf, gif, tif, etc)
- Supports output file format like json, csv,xml as well save to database
- Output Viewer/Dashboard to allow user to see the processed data extract as well as original docs and for verification, edits if any and final confirmation.
- User Access Mgmt
Technology - Tesseract/Google Vision for OCR, OpenCV, Python, ReactJS, NodeJS, Docker/Container, Google Colab or any GPU platform for ML Model training /AI, Yolo deep learning algo
We can show you demo of it or look at this link -
Brief - [login to view URL]
Detail - [login to view URL]
Similar Solution given to clients in Europe, Middleeast, India, etc. Reference can be provided.