George,
     You should be able to switch to xpdf2text with little effort.  It saved us 
loads of time trying to figure out why filter-media was taking so long to run 
(ours sometimes ran for days at a time) and it now successfully filters 100% of 
our pdf documents except for those that are truly corrupt (so it also helps us 
identify corrupt docs in our repository).  You can find installation 
documentation at 
https://jira.duraspace.org/secure/attachment/10527/xpdf-filters.html 
Good luck,
Sue



Sue Walker-Thornton
Software Developer/Database Administrator
NASA Langley Research Center|LITES Contract
(757) 224-4074


-----Original Message-----
From: George Stanley Kozak [mailto:g...@cornell.edu] 
Sent: Monday, December 13, 2010 9:52 AM
To: Thornton, Susan M. (LARC-B702)[LITES]; Sean Carte
Cc: dspace-tech@lists.sourceforge.net
Subject: RE: [Dspace-tech] Question about filter-media hanging

Sue and Sean:

Thanks very much.  I will look into xpdf2text.

George Kozak
Digital Library Specialist
Cornell University Library Information Technologies (CUL-IT)
501 Olin Library
Cornell University
Ithaca, NY 14853
607-255-8924

-----Original Message-----
From: Thornton, Susan M. (LARC-B702)[LITES] [mailto:susan.m.thorn...@nasa.gov] 
Sent: Monday, December 13, 2010 8:54 AM
To: Sean Carte; George Stanley Kozak
Cc: dspace-tech@lists.sourceforge.net
Subject: RE: [Dspace-tech] Question about filter-media hanging

We had lots of problems with filter-media until we changed pdfbox to xpdf2text.



Sue Walker-Thornton
Software Developer/Database Administrator
NASA Langley Research Center|LITES Contract
(757) 224-4074



-----Original Message-----
From: Sean Carte [mailto:sean.ca...@gmail.com] 
Sent: Monday, December 13, 2010 4:27 AM
To: George Stanley Kozak
Cc: dspace-tech@lists.sourceforge.net
Subject: Re: [Dspace-tech] Question about filter-media hanging

On 10 December 2010 21:22, George Stanley Kozak <g...@cornell.edu> wrote:
> Hi.
>
>
>
> I am running DSpace 1.6.2.  Last week we batch loaded about 1500 PDF files
> and this week I loaded about 2300 images (mostly Jpegs).  I noticed today
> that the thumbnails hadn't been generated by the filter-media program (which
> runs nightly).  When I went to look, I discovered several filter-media
> programs running.  It looks like the jobs were hanging up and then the next
> night, a new one started up and that one got hung up, etc.
>
>
>
> I have tried running filter-media in verbose mode, but I am not seeing
> anything in particular that is causing the hang up.  No java errors.it just
> seems to hang.
>
>
>
> Does anyone have any suggestions as to what I should next?
>
>
>
> George Kozak

Have a look at the suggestions in this thread:

http://old.nabble.com/-Dspace-tech--filter-media-hanging-td29158622.html#a29158622

Updating pdfbox worked for me.

Sean
-- 
Sean Carte
esAL Library Systems Manager
+27 72 898 8775
+27 31 373 2490
fax: 0866741254
http://esal.dut.ac.za/

------------------------------------------------------------------------------
Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL,
new data types, scalar functions, improved concurrency, built-in packages, 
OCI, SQL*Plus, data movement tools, best practices and more.
http://p.sf.net/sfu/oracle-sfdev2dev 
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech

------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to