For Bengali, you need to train the LSTM model. Legacy model training won't work.
On Thu, Jan 28, 2021, 22:32 Boring Guy69 <dafarh...@gmail.com> wrote: > > Hello i am new to tesseract. i am working on bengali language [kalpurush > font]. > I got lots of error when i make TR files. if i describe my work flow > At first i create text file in utf-8 format. in those text file i put some > Bengali word which is obviously in kalpurush font. > then i create box files and tif files with help of Jtessboxeditor. > then when i execute this command [ tesseract ben.kalpurush.exp0.tif > ben.kalpurush.exp0 box.train ] it gives me error like......could not find a > matching blob......box failed resegmentation. Suppose in my file there is > 600 word it found only 300 good blobs. > i attached a screenshot. > Do i have to change any config for Bengali language. Can anyone tell me or > suggest me what to do. i cant find any way to resolve this problem? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/0b64b093-fbad-46b7-b604-56b4fb51c9e1n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/0b64b093-fbad-46b7-b604-56b4fb51c9e1n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVxF5GEJHMM0unmOUobJTMHewVnRHOYSf3QEbWQ5RA8Cw%40mail.gmail.com.