Looking for someone to build Python software will take old scanned pdf book with mix of multi languages text and images and run it through google vision OCR to digitalize all the text.
* Software must be able to perform OCR on a whole book or just few pages for testing
*Software need to be a multi-threaded , have the option to bulk OCR all files in a folder.
*Software must be able to handle possible interruptions and able to continue the work from where it stopped.
*Software must be able to keep the structure of the book like paragraphs and images locations for example.
*Software must be able to reduce the size of the book since it's digitizing all the text and not just adding a text layer.
* And Finally would prefer if there is a user GUI.
12 freelanceria on tarjonnut keskimäärin ₹27542 tähän työhön
Greetings, I have read the project description I have been working on a similar project in recent time "project for BPO" I am interested in the work open a chat to discuss requirements in details.