Including Text Recognition in App

rylodom · November 7, 2018, 12:38pm

Hey guys!
So my plan is to create a real-time text recognition app.

To start out I am currently trying to make a simple on-device (not via an cloud API) OCR app which uses a photo taken from the CameraView ,extracts text and then displays it on the screen. I looked into possible solutions a lot and came to the conclusion that Tesseract might be the best choice. Now since I am coming from the web dev world I have no expertise in working with foreign code - ObjC I did not even know where to start. It feels like I would need to spend much much more time in learning ObjC. and Uno in order to solve a very rare issue. Being optimistic I believe there has to be a better and efficient way to do this and thats why I am asking you for help.

I found out that tesseract actually also provides a JS version: https://github.com/naptha/tesseract.js#tesseractjs
The problem is that I am not really sure about how to implement npm packages in fuse and if its even gonna work anyway since it probably strongly depends on node.js core features…

Would love to hear your thoughts! Thanks

aeq · November 7, 2018, 9:26pm

Just had a quick look at this now to see how far I’d get, here’s where I got to:
OCRApp.zip (161.1 KB)

I was basically trying to process a base64 image as a test, if you can get that going, then getting an image from the CameraView isn’t a hard stretch.

Got stuck on the worker.js part but hopefully this gives you a boost to getting started, all the best and lemme know how it goes.

I see someone also asked a similar Q: Writing app with stylus input and OCR

aeq · November 8, 2018, 5:33am

This will probably be better and more performant: https://firebase.google.com/docs/ml-kit/recognize-text

remnnis_scion · November 8, 2018, 5:41pm

The new firebase MLkit is definitely the way to go. Tesseract has long since been deprecated.

rylodom · November 8, 2018, 5:59pm

Thanks so much! Unfortunately I was not able get it working yet…

rylodom · November 8, 2018, 6:05pm

Thanks. Thought about that too. The problem is that MLkit only provides Swift|ObjC. code and given some youtube tutorials you also have to quite a lot in xCode. Tbh it is very hard for me to get into ObC. let alone implementing it via Uno in fuse…

aeq · November 8, 2018, 6:21pm

MLKit does Java too but you’re probably looking for a very high level way to go about it right?

rylodom · November 8, 2018, 6:39pm

Yes I saw that too Indeed, I am.

aeq · November 8, 2018, 7:04pm

Ok, in that case, I would POST my image to Google’s Vision API directly for ML analysis, huge server farms for more performance than a single device could ever yield…

rylodom · November 9, 2018, 8:40am

Yes, you are right! However, I do not want to use the API. If the API request exceeds a certain amount Google will charge. The on-device OCR would be enough for my needs and I do not have to think about what happens once more people than I thought use the app… Hm I might have to move to Nativescript for that project which is kinda sad.

aeq · November 9, 2018, 9:13am

Yeah, I see, offline usage and high level is key for you, so yeah one of the hybrids is probably better for ya but you’ll be missing out on the awesome performance n dev experience. The native integration is really like connecting the inputs and outputs from the module (Obj C & Java) to Uno to UX or Javascript, anyways, all the best with your mission man.

Topic		Replies	Views
Writing app with stylus input and OCR How-to Discussions	1	544	April 14, 2018
QR Code Reader How-to Discussions	4	646	February 5, 2016
Communication between Fuse and Apple Watch App How-to Discussions	0	419	December 26, 2019
Geofencing Feature Suggestions	2	486	January 29, 2016
Love Fuse but... General	9	1149	October 2, 2017

Including Text Recognition in App

Related topics