Hello,

Im using Tesseract OCR for Urdu Nastalique script..and there are many 
similar characters that are being misrecognized...im sure there must be 
some flags in Tesseract that would solve my problem someday...An example of 
the misrecognition is given below, where Tesseract returns the character on 
the right side, as a recognition output of the character on the left side:


<https://lh3.googleusercontent.com/-pOaY13abZs8/UMg3xu4kpjI/AAAAAAAAAB0/Q1Az-HsMU5M/s1600/G_HFL_%D8%B3%D8%B9%D8%A8%DB%81_368_F14_CC_1.bmp><https://lh6.googleusercontent.com/-DhzQOI7DQ8w/UMg3-jDbd6I/AAAAAAAAAB8/5_Wxe_Ueuss/s1600/G_HFL_%D9%85%D8%B9%D9%85%DB%81_4316_F14_L_1_CC_1.bmp>
Can anyone please guide me about the flags that can be altered in order to 
overcome this problem?? 

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to