I found the root of the problem: correct is "-l deu" but I used "-l=deu".
See https://stackoverflow.com/a/50601305/633961 And I created an issue: https://github.com/tesseract-ocr/tesseract/issues/1616 Am Donnerstag, 24. Mai 2018 13:35:55 UTC+2 schrieb Thomas Güttler: > > I use tesseract 4.0 via docker (tesseractshadow/tesseract4re) > > Very often tesseract detects "StraBe" instead of "Straße". > > Yes, I use -l=deu > > The word "Straße" is very common in german. It means "street". > > Since "StraBe" makes no sense I would like to improve this. > > What do you suggest? > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/616553aa-2ae2-4354-82c3-b366ce102d31%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

