[tesseract-ocr] Re: How to train Tesseract to recognize raster fonts?

2015-05-11 Thread smwikipedia smwikipedia
To find the possible font, I will use: https://www.myfonts.com/WhatTheFont/ 在 2015年5月12日星期二 UTC+8上午9:27:36,smwikipedia smwikipedia写道: > > I am using Tesseract to do OCR for some screenshots. The characters in > screenshots are in raster fonts. But Tesseract requires True Type Font or > Open Typ

[tesseract-ocr] Re: How to train Tesseract to recognize raster fonts?

2015-05-11 Thread smwikipedia smwikipedia
I just read this thread: https://groups.google.com/forum/#!msg/tesseract-ocr/ZsYvAIHWumA/XVMhN7j6__sJ They mentioned not to use tesseract for raster font recognition. Anyway, *as a workaround*, I will try to find a TrueType font that is close to my raster fonts. And still train tesseract with th

[tesseract-ocr] Re: OCR on Nintendo game screenshots

2015-05-11 Thread smwikipedia smwikipedia
Hi Leah, I am having a similar issue to recognize raster fonts. Could you share your progress? Thanks! 在 2015年4月23日星期四 UTC+8下午2:38:29,Leah Siddall写道: > > Hi all! > > I am not having luck with tesseract and the fonts used in NES games like > Super Mario Bros. 3. ( i've attached an example scree

[tesseract-ocr] How should I make Tesseract to support multiple fonts?

2015-05-11 Thread smwikipedia smwikipedia
I am trying to use Tesseract to do OCR for some screenshots. But the characters on screen can be of multiple fonts. I see that in the latest 3.03 version, the training tool `text2image` can easily generate training tif/box pair from training text and font files. If I want to support multiple fo

[tesseract-ocr] How to train Tesseract to recognize raster fonts?

2015-05-11 Thread smwikipedia smwikipedia
I am using Tesseract to do OCR for some screenshots. The characters in screenshots are in raster fonts. But Tesseract requires True Type Font or Open Type Font file for training? So how to train Tesseract to recognize raster fonts? -- You received this message because you are subscribed to

Re: [tesseract-ocr] Improve Text reading on image

2015-05-11 Thread Allistair
I guess I am not seeing the problem. Your pipeline is raw image -> image preprocess -> tesseract psm 6 -> postprocess to find likely numeric string Cheers On 11 May 2015 at 12:46, Nicholas Chew wrote: > Hi Allistair > Thanks for your reply. I had used Tesseract only. I need to process many > i

Re: [tesseract-ocr] Improve Text reading on image

2015-05-11 Thread Nicholas Chew
Hi Allistair Thanks for your reply. I had used Tesseract only. I need to process many images and i need the numbers on the images to be captured. I am using Imagemagick to process my image before running Tess. Also i am using -psm 4 where i set the tesseract to read the bottom part. On Mond

[tesseract-ocr] Some questions about tesseract 3.0x.

2015-05-11 Thread smwikipedia smwikipedia
1. For tesseract 3.02, after installation I see there's a pre-trained *eng.traineddata* file in the tessdata folder. How is this file generated? What font does it target? Can I blindly use it for my OCR application? 2. For tesseract 3.03, I see there's a new option "--print-parameters" for th

Re: [tesseract-ocr] Improve Text reading on image

2015-05-11 Thread Allistair
OK - so what OCR have you tried so far? I got (psm 6) ... I-" ” " ’ I I“ 1""? 1' '. _ % % 92907120 W% A%'% % On 11 May 2015 at 09:12, Nicholas Chew wrote: > Hi > I need help to process this image to read the text. I had tried the > command below but Tesseract still cant read it. What else di