Hi Erick,
Thanks for the help! I will take a look at it.
Martin Frank Hansen, Senior Data Analytiker
Data, IM & Analytics
Lautrupparken 40-42, DK-2750 Ballerup
E-mail m...@kmd.dk Web www.kmd.dk
Mobil +4525571418
-Oprindelig meddelelse-
Fra: Erick Erickson
Sendt: 21. oktober 2018 2
Hi Gus,
Thank you so much! I will definitely take a look at it during the day.
Martin Frank Hansen,
-Oprindelig meddelelse-
Fra: Gus Heck
Sendt: 22. oktober 2018 00:06
Til: solr-user@lucene.apache.org
Emne: Re: Tesseract language
Hi Martin,
I wrote a framework (https://github.com/nso
Hi Alex,
Thanks again for your reply, much appreciated.
Martin Frank Hansen, Senior Data Analytiker
Data, IM & Analytics
Lautrupparken 40-42, DK-2750 Ballerup
E-mail m...@kmd.dk Web www.kmd.dk
Mobil +4525571418
-Oprindelig meddelelse-
Fra: Alexandre Rafalovitch
Sendt: 21. oktober
Hi Alexandre,
Thanks for your reply.
Yes right now it is just for testing the possibilities of Solr and Tesseract.
I will take a look at the Tika documentation to see if I can make it work.
You said that DIH are not recommended for production usage, what is the
recommended method(s) to upload
Hi again,
Is there anyone who has some experience of using Tesseract’s OCR module within
Solr? The files I am trying to read into Solr is Danish Tiff documents.
Martin Frank Hansen, Senior Data Analytiker
Data, IM & Analytics
[cid:image001.png@01D383C9.6C129A60]
Lautrupparken 40-42, DK-2750