Re: OCR in Linux

2010-01-10 Thread Pia
Wow awesome!  I am wondering why I don't have it in Lenny when I search 
for it.  I guess only one person meant tesseract then.  Thank you very 
much for your explanation because now I know there is a deb package.  I 
just have to figure out why I am not seeing it in my version of the distro 
and maybe download the version from a newer version now.  Thanks again for 
the good details.

On Sun, 10 Jan 2010, Kenny Hitt wrote:

> Hi.  I've been using ocropus for at least 2 years.  I built the first release 
> from source, but have used Debian packages ever since.
> Not sure why it isn't part of Lenny, but it is definitely part of Sid.
>
> ke...@blackbox:~$ apt-cache search ocropus
> ocropus - document analysis and OCR system
> ocropus-data - document analysis and OCR system --- data files
> ke...@blackbox:~$ apt-cache show ocropus
> Package: ocropus
> Status: install ok installed
> Priority: optional
> Section: graphics
> Installed-Size: 3732
> Maintainer: Jeffrey Ratcliffe 
> Architecture: i386
> Version: 0.3.1-2
> Depends: libc6 (>= 2.4), libgcc1 (>= 1:4.1.1), libiulib0, libjpeg62, 
> liblua5.1-0, libpng12-0 (>= 1.2.13-4), libstdc++6 (>= 4.1.1), libtiff4, 
> zlib1g (>= 1:1.1.4), ocropus-data (= 0.3.1-2)
> Recommends: tesseract-ocr (>= 2.03-2)
> Breaks: ocrodjvu (<< 0.3)
> Description: document analysis and OCR system
> OCRopus(tm) is a state-of-the-art document analysis and Optical
> Character Recognition (OCR) system, featuring
> pluggable layout analysis, pluggable character recognition, statistical
> natural language modeling, and multi-lingual capabilities.
> .
> The OCRopus engine is based on two research projects: a high-performance
> handwriting recognizer developed in the mid-90's and deployed by the US Census
> bureau, and novel high-performance layout analysis methods.
> .
> OCRopus development is sponsored by Google and is initially intended for
> high-throughput, high-volume document conversion efforts. It
> will also be an excellent OCR system for many other applications.
> Homepage: http://code.google.com/p/ocropus/
>
> I knew you were talking about ocropus and not tesseract.
>
>  Kenny
>
> On Sun, Jan 10, 2010 at 10:56:50PM -0500, pmik...@comcast.net wrote:
>>> Hi.  It's part of Debian.  I am running Sid, but it's been a Debian package 
>>> for a few years.
>>
>>
>> Huh?  I thought ocropus just got created via google summer of code not
>> long ago.  You and others must have thought I meant tesseract which is
>> not the email I replied to because you are the second person who emailed
>> me something about it instead of ocropus to my query about ocropus here.
>> The other guy who emailed me I am even more sure meant tesseract because
>> he actually included a link to it.  I appreciate everyone's kindness to
>> answer questions, but please read the message people are replying to
>> before replying.  Thanks.
>>
>>
>> --
>> Ubuntu-accessibility mailing list
>> Ubuntu-accessibility@lists.ubuntu.com
>> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>
> -- 
> Ubuntu-accessibility mailing list
> Ubuntu-accessibility@lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>

-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


Re: OCR in Linux

2010-01-10 Thread Kenny Hitt
Hi.  I've been using ocropus for at least 2 years.  I built the first release 
from source, but have used Debian packages ever since.
Not sure why it isn't part of Lenny, but it is definitely part of Sid.

ke...@blackbox:~$ apt-cache search ocropus
ocropus - document analysis and OCR system
ocropus-data - document analysis and OCR system --- data files
ke...@blackbox:~$ apt-cache show ocropus
Package: ocropus
Status: install ok installed
Priority: optional
Section: graphics
Installed-Size: 3732
Maintainer: Jeffrey Ratcliffe 
Architecture: i386
Version: 0.3.1-2
Depends: libc6 (>= 2.4), libgcc1 (>= 1:4.1.1), libiulib0, libjpeg62, 
liblua5.1-0, libpng12-0 (>= 1.2.13-4), libstdc++6 (>= 4.1.1), libtiff4, zlib1g 
(>= 1:1.1.4), ocropus-data (= 0.3.1-2)
Recommends: tesseract-ocr (>= 2.03-2)
Breaks: ocrodjvu (<< 0.3)
Description: document analysis and OCR system
 OCRopus(tm) is a state-of-the-art document analysis and Optical
 Character Recognition (OCR) system, featuring
 pluggable layout analysis, pluggable character recognition, statistical
 natural language modeling, and multi-lingual capabilities.
 .
 The OCRopus engine is based on two research projects: a high-performance
 handwriting recognizer developed in the mid-90's and deployed by the US Census
 bureau, and novel high-performance layout analysis methods.
 .
 OCRopus development is sponsored by Google and is initially intended for
 high-throughput, high-volume document conversion efforts. It
 will also be an excellent OCR system for many other applications.
Homepage: http://code.google.com/p/ocropus/

I knew you were talking about ocropus and not tesseract.

  Kenny

On Sun, Jan 10, 2010 at 10:56:50PM -0500, pmik...@comcast.net wrote:
> > Hi.  It's part of Debian.  I am running Sid, but it's been a Debian package 
> > for a few years.
> 
> 
> Huh?  I thought ocropus just got created via google summer of code not 
> long ago.  You and others must have thought I meant tesseract which is 
> not the email I replied to because you are the second person who emailed 
> me something about it instead of ocropus to my query about ocropus here. 
> The other guy who emailed me I am even more sure meant tesseract because 
> he actually included a link to it.  I appreciate everyone's kindness to 
> answer questions, but please read the message people are replying to 
> before replying.  Thanks.
> 
> 
> -- 
> Ubuntu-accessibility mailing list
> Ubuntu-accessibility@lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility

-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


Re: OCR in Linux

2010-01-10 Thread pmikeal
> Hi.  It's part of Debian.  I am running Sid, but it's been a Debian package 
> for a few years.


Huh?  I thought ocropus just got created via google summer of code not 
long ago.  You and others must have thought I meant tesseract which is 
not the email I replied to because you are the second person who emailed 
me something about it instead of ocropus to my query about ocropus here. 
The other guy who emailed me I am even more sure meant tesseract because 
he actually included a link to it.  I appreciate everyone's kindness to 
answer questions, but please read the message people are replying to 
before replying.  Thanks.


-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


Re: OCR in Linux

2010-01-10 Thread Pia
Wow, where did you find it, in Debian or as a Debian package?  I am 
running Lenny and don't see it.  I tried to compile it from source and it 
keeps failing.  Thanks!

On Sat, 9 Jan 2010, Kenny Hitt wrote:

> Hi.  Have you tried ocropus?
>
> Don't know if it's in Ubuntu, but it's definitely a Debian package.
>
>  Kenny
> On Mon, Jan 04, 2010 at 04:15:42PM -0500, Pia wrote:
>> Does anyone have an opinion about what the best OCR software is for Linux
>> for blind people that talks well?  I would prefer a command line package,
>> but if there is only a GUI one that works OK, knowing about it would be
>> good to know too.
>>
>> --
>> Ubuntu-accessibility mailing list
>> Ubuntu-accessibility@lists.ubuntu.com
>> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>
> -- 
> Ubuntu-accessibility mailing list
> Ubuntu-accessibility@lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>

-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


Re: OCR in Linux

2010-01-09 Thread Matt Barnes
tesseract was good back when I tried it:
http://code.google.com/p/tesseract-ocr/


On Sat, Jan 9, 2010 at 11:50 PM, Kenny Hitt  wrote:

> Hi.  Have you tried ocropus?
>
> Don't know if it's in Ubuntu, but it's definitely a Debian package.
>
>  Kenny
> On Mon, Jan 04, 2010 at 04:15:42PM -0500, Pia wrote:
> > Does anyone have an opinion about what the best OCR software is for Linux
> > for blind people that talks well?  I would prefer a command line package,
> > but if there is only a GUI one that works OK, knowing about it would be
> > good to know too.
> >
> > --
> > Ubuntu-accessibility mailing list
> > Ubuntu-accessibility@lists.ubuntu.com
> > https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>
> --
> Ubuntu-accessibility mailing list
> Ubuntu-accessibility@lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility
>



-- 
"I cannot teach anybody anything, I can only make them think."
-Socrates
-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


Re: OCR in Linux

2010-01-09 Thread Kenny Hitt
Hi.  Have you tried ocropus?

Don't know if it's in Ubuntu, but it's definitely a Debian package.

  Kenny
On Mon, Jan 04, 2010 at 04:15:42PM -0500, Pia wrote:
> Does anyone have an opinion about what the best OCR software is for Linux 
> for blind people that talks well?  I would prefer a command line package, 
> but if there is only a GUI one that works OK, knowing about it would be 
> good to know too.
> 
> -- 
> Ubuntu-accessibility mailing list
> Ubuntu-accessibility@lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility

-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility


OCR in Linux

2010-01-04 Thread Pia
Does anyone have an opinion about what the best OCR software is for Linux 
for blind people that talks well?  I would prefer a command line package, 
but if there is only a GUI one that works OK, knowing about it would be 
good to know too.

-- 
Ubuntu-accessibility mailing list
Ubuntu-accessibility@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-accessibility