I'm going to be using tesseract 4 and using the tesstrain.sh script. If I come across things that improve accuracy though I will let you know.
Where did you find 1300 handwriting fonts? On Tuesday, June 19, 2018 at 5:19:54 PM UTC+1, Navaneetha Bitla wrote: > > serak trainer using training tesseract 3.5. > > > > On Tue, Jun 19, 2018 at 9:29 PM, James Q <james.qu...@taina.tech > <javascript:>> wrote: > >> Hi Navaneetha >> I am also looking to start training tesseract using handwritten fonts and >> am about to start setting up my training environment. Are you training >> tesseract 4 by following the guide at >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 ? >> >> If so are you fine tuning the existing english model, retraining just the >> top layer(s) or training from scratch with your additional fonts? >> >> Thanks >> Jim >> >> On Tuesday, June 19, 2018 at 10:30:30 AM UTC+1, Navaneetha Bitla wrote: >>> >>> Hi, this is Navaneetha >>> >>> i'm working in hand written character recognition project. >>> >>> I have trained 1300 different hand written fonts of english and moved >>> the files into tessdata directory. >>> >>> tested tesseract using the below commands: >>> >>> $convert -density 300 input.png -depth 8 -strip -background white -alpha >>> off out.tiff >>> >>> $tesseract out.tiff eng >>> >>> The input.png is of Alanis Handa font and i have trained this font but >>> i'm not getting atleast 40% accuracy. >>> >>> Can someone help me. >>> >>> >>> Thanks in advance. >>> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com <javascript:>. >> To post to this group, send email to tesser...@googlegroups.com >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.