Re: [CODE4LIB] Open source workstations in academic libraries
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi John, I'm poking around to find the right person. I'll get back to you shortly. James John Houser wrote: > James, > I'm just following up on this issue. (Conference season intervened.) I > would appreciate a tip on who to talk to at NC about their laptops. > > Thanks! > > J > > > -- > John Houser > Senior Technology Consultant > PALINET > 3000 Market St, Suite 200 > Philadelphia, PA 19104 > 215-382-7031 ext. 1222 > [EMAIL PROTECTED] > IM: ClarkHouser > http://blog.palinet.org/dt > -Original Message- > From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of > James Tuttle > Sent: Wednesday, October 22, 2008 1:15 PM > To: CODE4LIB@LISTSERV.ND.EDU > Subject: Re: [CODE4LIB] Open source workstations in academic libraries > > NC State provides laptops for student check-out that dual boot Linux and > Windows. I'm not an authority on it, but if you're interested, could > point you to someone. > > James > > > John Houser wrote: >> I'm in the process of writing an article about open source >> workstations and am looking for an academic library example to use as > a case study. >> Does anyone know of one? This could be a library using Linux or BSD on > >> public workstations or using the Linux Terminal Server (LTS). Any help > >> would be greatly appreciated. > >> J > >> -- >> John Houser >> Senior Technology Consultant >> PALINET >> 3000 Market St, Suite 200 >> Philadelphia, PA 19104 >> 215-382-7031 ext. 1222 >> [EMAIL PROTECTED] >> IM: ClarkHouser >> http://blog.palinet.org/dt > >> Register today for the PALINET08 CONFERENCE + VENDOR FAIR, October 27 >> & >> 28 at the Sheraton University City, Philadelphia, PA at >> www.palinet.org/2008conference. > - -- - --- James Tuttle Digital Repository Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkkcK8cACgkQKxpLzx+LOWN2fwCeLdVd7XbJPVKpuHhhUBGPbPmU tUIAoIbmoS2tCwE0EM17czNwOaLNIHY5 =Pmd7 -END PGP SIGNATURE-
Re: [CODE4LIB] Open source workstations in academic libraries
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 NC State provides laptops for student check-out that dual boot Linux and Windows. I'm not an authority on it, but if you're interested, could point you to someone. James John Houser wrote: > I'm in the process of writing an article about open source workstations > and am looking for an academic library example to use as a case study. > Does anyone know of one? This could be a library using Linux or BSD on > public workstations or using the Linux Terminal Server (LTS). Any help > would be greatly appreciated. > > J > > -- > John Houser > Senior Technology Consultant > PALINET > 3000 Market St, Suite 200 > Philadelphia, PA 19104 > 215-382-7031 ext. 1222 > [EMAIL PROTECTED] > IM: ClarkHouser > http://blog.palinet.org/dt > > Register today for the PALINET08 CONFERENCE + VENDOR FAIR, October 27 & > 28 at the Sheraton University City, Philadelphia, PA at > www.palinet.org/2008conference. - -- - --- James Tuttle Digital Repository Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFI/1+IKxpLzx+LOWMRAgq+AKCEUMaWON+LphW6Akvh7RJNuw2jKACgg6TG FT4j0d1urw+Jngxck94qGEU= =yj9s -END PGP SIGNATURE-
Re: [CODE4LIB] OCR PDFs
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Thanks for the tip. Especially the part where you make it clear that OmniPage doesn't really work. Back to Acrobat, I guess. Thanks all! Jonathan Brinley wrote: > This is somewhat off-topic, since you asked for something you can use > on Linux. In any case... > > I've been using OmniPage 16, and I'm sorry to say I can't recommend > it. You can't run it from the command line, so you can't really > integrate it into a script. It does have a batch manager, so you can > set it to do whole folders at a time. Just make sure your folder's not > too large; it crashes fairly reliably after about 10-40 pages. > > If you do use OmniPage to make your PDFs, I've found that it works > best to convert a single TIFF into a single-page PDF, then use > pdftk[1] (along with a [language of your choice] script) to put those > PDFs together however you want them. > > Have a nice day, > Jonathan > > [1] http://www.accesspdf.com/pdftk/ > - -- - --- James Tuttle Digital Repository Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFI+QviKxpLzx+LOWMRAp1gAJ9ipNqWDxNPubPIl9qoo00XWqrn0gCgkR1R fDkLic6eBVmRr6G4rvVSU3s= =ySuL -END PGP SIGNATURE-
Re: [CODE4LIB] OCR PDFs
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Yes, I've tried tesseract and found it to be pretty accurate, but I don't believe there is a way to integrate the text back into the PDF. It's easy to pull text out of image-based PDFs, but not to put the text back in. Driving me crazy... Thanks for tips, James Bridger Dyson-Smith wrote: > If you haven't already, take a look at tesseract ( > http://code.google.com/p/tesseract-ocr/). There's some discussion of using > tesseract and shell scripting to work with tiffs to pdfs to ocr'd text, > which isn't exactly what you're wanting to do, I know, but may prove helpful > (http://www.groklaw.net/articlebasic.php?story=20061210115516438). > Cheers! > Bridger Dyson-Smith > > > On Fri, Oct 17, 2008 at 8:28 AM, Terry Harrison <[EMAIL PROTECTED]> wrote: > >> You might want to look at ABBYY Fine Reader 9.0 Professional, which can be >> driven from the command line. Fine Reader is used at the Library of >> Congress. Here is a info link to get you started (search "command"): >> >> >> http://www.scanstore.com/Scanning/Document_Imaging/Software/OCR_Software/Nuance/omnipage_review.asp >> >> Regards, >> Terry >> >> >> Terry Harrison >> Project Manager >> CACI >> 5505 Robin Hood Road, Suite F >> Norfolk, Va. 23508 >> Ph: 757.321.9120 x232 >> Fax: 757.321.8797 >> [EMAIL PROTECTED] >> - -- - --- James Tuttle Digital Repository Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFI+QuEKxpLzx+LOWMRAhSyAJ9+lQ/1J5SP/23XQrVrlsoNRZyKxQCfYTGw qUBK6A9mkiLy88buUz7Wngg= =DyZk -END PGP SIGNATURE-
[CODE4LIB] OCR PDFs
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 I wonder if any of you might have experience with creating text PDFs from TIFFs. I've been using tiffcp to stitch TIFFs together into a single image and then using tiff2pdf to generate PDFs from the single TIFF. I've had to pass this image-based PDF to someone with Acrobat to use it's batch processing facility to OCR the text and save a text-based PDF. I wonder if anyone has suggestions for software I can integrate into the script (Python on Linux) I'm using. Thanks, James - -- - ------- James Tuttle Digital Repository Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFI+H1zKxpLzx+LOWMRAgxIAJwNXyeMJbk6r6hmHpNAdEvWIQbCVgCgp8JR nyS3WZ4UuRbU/6DTH7ohe/M= =mT2T -END PGP SIGNATURE-
Re: [CODE4LIB] CODE4LIB Digest - 13 Jun 2008 to 14 Jun 2008 (#2008-132)
Trying to port my Linux experience to Solaris 10 makes my brain bleed some days. I'd recommend, and this is probably too onerous for the original poster, installing OpenSolaris in a virtual machine if the installation route seems viable. The differences between Solaris (or [EMAIL PROTECTED]@# Solaris!, as we affectionately call it) and Linux are pretty great. However, installing a distro inside VirtualBox is probably a little more advanced than the OP may be prepared for. Jim > -- > > Date:Sat, 14 Jun 2008 20:02:39 +0100 > From:Tim Hodson <[EMAIL PROTECTED]> > Subject: Re: Unix training options? > > As most linux distros and unix systems share a common history, many > commands are similar but have more or less options. If you want to > experiment with linux distro's, I can recommend trying out VirtualBox > [1] , which is now distributed by sun. It is free for non-comercial > use (teaching yourself sounds non commercial to me), and gives you a > chance to try installing several operating systems without having to > worry about trashing your existing (host) system. > > In terms of the absolute basics for moving round the system and seeing > what is going on, I would recommend the following commands > > bash - use a bash shell which has handy command history and command > completion with the tab key > cd - change directory > ls - list the contents of a direcory > vi - to read, create and edit files. > less - view even very big files easily, and uses standard vi commands > to navigate > > The easiest way to learn is through doing, playing and making > mistakes. - and being forced to learn because you HAVE to do something > is a great catalyst to knew knowledge. :) > > Tim > informationtakesover.co.uk > colourphon.co.uk > > [1] http://www.virtualbox.org/ -- --- Jim Tuttle Geospatial Data Librarian NCSU Libraries, Box 7111 North Carolina State University Raleigh, NC 27695-7111 [EMAIL PROTECTED] (919)513-0651 Phone (919)515-3031 Fax