[tesseract-ocr] Incremental Training Tesseract 4.0+ for fraktur

Val LNB Tue, 28 Jan 2020 09:17:23 -0800

*How to perform incremental training on Tesseract 4.0+?*


I want to improve the existing fraktur (frk) model with some 6000 hand 
curated lines from our library. 

Ground truth for these lines has 10 new unicode characters not present in 
German fraktur model.


How can I continue training from the existing German fraktur model without 
full retraining?


Progress so far:


   - Following information on https://github.com/tesseract-ocr/tesstrain
   - My script created the .tif and gt.txt files based on examples provided 
   in 
   https://github.com/tesseract-ocr/tesstrain/blob/master/ocrd-testset.zip
   - Now makefile 
   https://github.com/tesseract-ocr/tesstrain/blob/master/Makefile has 
   space for START_MODEL 


What/if anything do I enter into START_MODEL?


It would be fantastic to see an example CLI command used for your 
incremental training. :)













-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/1e79c1d6-de0c-4c87-b07c-9455b90cfef4%40googlegroups.com.

[tesseract-ocr] Incremental Training Tesseract 4.0+ for fraktur

Reply via email to