subject:"\"\\\[tesseract\\\-ocr\\\] Re\\\: Change unicharset\""

Re: [tesseract-ocr] Re: Change unicharset

2018-04-12 Thread ShreeDevi Kumar

1. concatenate the two training texts cat ./langdata/kor/kor.training_text ./langdata/chi_tra/chi_tra.training_text > ./langdata/kor/kor-chi_tra.training_text 2. run tesstrain.sh with (update for your paths, run with just one font which supports both languages as a test) $tesstrain_dir/tesstrai

[tesseract-ocr] Re: Change unicharset

2018-04-12 Thread Fanatico

And if I look at the "kor.unicharset" created after executing "training/tesstrain.sh" it only contains the korean characters, even after I changing "kor.lstm-unicharset" from the "kor.traineddata" -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" grou