The correct repo for the language data is: https://github.com/tesseract-ocr/tessdata
3.04 added 39 new languages including: amh, asm, aze_cyrl, bod, bos, ceb, cym, dzo, fas, gle, guj, hat, iku, jav, kat, kat_old, kaz, khm, kir, kur, lao, lat, mar, mya, nep, ori, pan, pus, san, sin, srp_latn, syr, tgk, tir, uig, urd, uzb, uzb_cyrl, yid There are a total of 107 languages supported now. On Thursday, February 25, 2016 at 3:41:00 PM UTC-5, Tom Morris wrote: > > On Thursday, February 25, 2016 at 3:32:10 AM UTC-5, 기옥주 wrote: >> >> I wonder what is improved ver. 3.04. more detail especially these list. >> > > 3.04 compared to what? You can see the changes from 3.03 by using Github > with a URL like this: > > https://github.com/tesseract-ocr/tesseract/compare/3.03-rc1...3.04.00 > > One of the biggest changes though is not in Tesseract itself, but the > associated language data. Almost all languages were updated and many > languages added. > > There are release notes with a summary of the changes here: > > https://github.com/tesseract-ocr/tesseract/wiki/ReleaseNotes#tesseract-release-notes-july-11-2015---v30400 > > Tom > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/cd52fc18-b091-4714-8525-34b961eb8bab%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

