For multiple languages the standard invocation is to use the two language
codes with + sign.

Eg. -l ara+eng or -l eng+jpn

Alternately you can also try the script traineddata files eg. Devanagari
includes eng+hin+san+mar+nep

However, multiple languages recognition takes more time and is not perfect.

On Wed, Aug 19, 2020, 13:20 Pankaj Gupta <pan...@gaurishiv.org> wrote:

> Dear Team,
>
> Waiting for your suggestions.  Need your help.
>
> Thank you in advance.
>
> Regards,
> Pankaj
>
> On Friday, August 14, 2020 at 12:45:05 AM UTC+5:30 Pankaj Gupta wrote:
>
>> Dear Team,
>>
>> Me and team is developing a tool that extract the text from the given
>> images (containing data related to single language) using tesseract/ The
>> tool is able to extract the text in 14 different languages with a higher
>> accuracy greater than 95%.
>>
>> We have got a new challenge in the development that there are images that
>> contain text in more than one language (Japanese - English or Arabic -
>> English). due to copyright issues, I am not able to attach the original
>> image, A sample image is attached along with this thread which contains
>> text in Japanese and English depicting the actual scenarios. Request your
>> support in identifying the technique to extract the text accurately in both
>> the language.
>>
>> I am using Python 3+, open CV, and tesseract for development.
>>
>> Thanks in advance.
>>
>> Regards,
>> Pankaj Gupta
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/cc03edb3-b96b-477f-9b31-fe7e4a0ccb4cn%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/cc03edb3-b96b-477f-9b31-fe7e4a0ccb4cn%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXUJk7axvHAb8NLPcuJYbC832SNcmsmzgpaLPLrmBd1DA%40mail.gmail.com.

Reply via email to