Hi Shree, Thanks for replying.
So shall I remove them from text file and create a unicharset file after that or do I have do do something while creating the lstmf files? Also, Will this affect the training if I don't remove this? I saw that training was continuing but the best char error was 100 even after 5000 iteration and went to 96 after 7800 iteration. weird. :-\ On Thursday, 16 April 2020 19:26:15 UTC+5:30, shree wrote: > > U+0965 ॥ e0 a5 a5 DEVANAGARI DOUBLE DANDA > > On Thu, Apr 16, 2020, 19:25 Shree Devi Kumar <[email protected] > <javascript:>> wrote: > >> U+200D e2 80 8d ZERO WIDTH JOINER >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/36920c00-50b9-4d19-a018-8f1275cc481c%40googlegroups.com.

