RE: extracting text from image using pdfbox

Kishore Babu Mon, 15 Oct 2012 00:36:23 -0700

Thanks you very much.

Regards,



Kishore Babu I Developer 
email: [email protected]
office: 040.66417681
www.envistacorp.com
Subscribe to enVista's Newsletter!
      










-----Original Message-----
From: [email protected] 
[mailto:[email protected]] On Behalf Of Peter Murray-Rust
Sent: Monday, 15 October, 2012 11:46 AM
To: [email protected]
Subject: Re: extracting text from image using pdfbox

On Mon, Oct 15, 2012 at 6:17 AM, Kishore Babu <[email protected]> wrote:

> Hi Peter,
> Thank you very much for the reply. Unfortunately, the image I am 
> dealing are the scanned one.
>
> I will update my result if I succeed in using the mentioned line 
> detection algorithms.
>
> There is an excellent explanation of everything involved in OCR at :
http://sudokugrab.blogspot.com.au/2009/07/how-does-it-all-work.html . This is a 
hard problem but the better your scan the easier it becomes. if you have good 
contrast, modern typefaces, no/little skewing, and a well understood character 
set then you have a chance.

--
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069

RE: extracting text from image using pdfbox

Reply via email to