For Bengali, you need to train the LSTM model. Legacy model training won't
work.

On Thu, Jan 28, 2021, 22:32 Boring Guy69 <dafarh...@gmail.com> wrote:

>
> Hello i am new to tesseract. i am working on bengali language [kalpurush
> font].
> I got lots of error when i make TR files. if i describe my work flow
> At first i create text file in utf-8 format. in those text file i put some
> Bengali word which is obviously in kalpurush font.
> then i create box files and tif files with help of Jtessboxeditor.
> then when i execute this command [ tesseract ben.kalpurush.exp0.tif
> ben.kalpurush.exp0 box.train ] it gives me error like......could not find a
> matching blob......box failed resegmentation. Suppose in my file there is
> 600 word it found only 300 good blobs.
> i attached a screenshot.
> Do i have to change any config for Bengali language. Can anyone tell me or
> suggest me what to do. i cant find any way to resolve this problem?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/0b64b093-fbad-46b7-b604-56b4fb51c9e1n%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/0b64b093-fbad-46b7-b604-56b4fb51c9e1n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVxF5GEJHMM0unmOUobJTMHewVnRHOYSf3QEbWQ5RA8Cw%40mail.gmail.com.

Reply via email to