1. concatenate the two training texts
cat ./langdata/kor/kor.training_text
./langdata/chi_tra/chi_tra.training_text >
./langdata/kor/kor-chi_tra.training_text
2. run tesstrain.sh with (update for your paths, run with just one font
which supports both languages as a test)
$tesstrain_dir/tesstrai
And if I look at the "kor.unicharset" created after executing
"training/tesstrain.sh" it only contains the korean characters, even after
I changing "kor.lstm-unicharset" from the "kor.traineddata"
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" grou
2 matches
Mail list logo