I have a similar problem with the breton language, the lib does not recognize the verbal particle o and replace it by a zero 0 . oa which mean "was' in english becomes 0a philippe
Le jeudi 29 février 2024 à 09:45:53 UTC+1, Iman Firouzian a écrit : > Hi again, > I've tested it on windows and pycharm. > the tesseract version is tesseract v5.0.0-alpha.20200328 > > the result is roughly the same. > it would recognize correctly when numbers are mixed with letters. > Any specific confugurations needed? > > thanks for helping > On Thursday, February 29, 2024 at 9:50:33 AM UTC+3:30 Iman Firouzian wrote: > >> I've installed it using: >> !sudo apt install tesseract-ocr >> in Google Colab. >> >> it says it's the latest version: tesseract-ocr is already the newest >> version (4.1.1-2.1build1). >> >> and the language model is "fas" and is installed by: >> !sudo apt install tesseract-ocr-fas >> >> thanks for helping >> >> On Thursday, February 29, 2024 at 1:50:50 AM UTC+3:30 tfmo...@gmail.com >> wrote: >> >>> On Wednesday, February 28, 2024 at 3:28:51 AM UTC-5 Iman Firouzian wrote: >>> >>> >>> Please help me with this >>> >>> >>> Please include more details about what version of the software you are >>> using and which language (or script) model(s). >>> >>> Tom >>> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5c3c102a-87b4-4700-bd17-a64f4525f2bcn%40googlegroups.com.