Hi,
I have successfully implemented xpdf pdftotext (replacing PDFBox) in
DSpace 1.5.1. It works GREAT and so far has filtered 100% of our documents,
even the ones PDFBox found to be "unfilterable". Now I'm trying to get
pdftoppm to work and I'm getting this error:
Applying Media Filters
The following MediaFilters are enabled:
Full Filter Name: org.dspace.app.mediafilter.HTMLFilter
org.dspace.app.mediafilter.HTMLFilter
Full Filter Name: org.dspace.app.mediafilter.WordFilter
org.dspace.app.mediafilter.WordFilter
Full Filter Name: org.dspace.app.mediafilter.JPEGFilter
org.dspace.app.mediafilter.JPEGFilter
Full Filter Name: org.dspace.app.mediafilter.XPDF2Text
org.dspace.app.mediafilter.XPDF2Text
Full Filter Name: org.dspace.app.mediafilter.XPDF2Thumbnail
org.dspace.app.mediafilter.XPDF2Thumbnail
FILTERED: bitstream 443 and created 'CA029045.pdf.txt'
ERROR filtering, skipping bitstream:
Item Handle: 2121/169228
Bundle Name: ORIGINAL
File Size: 2064225
Checksum: 4216969d76a86e6c9c169bbe0a3cff7d (MD5)
Asset Store: 0
javax.imageio.IIOException: Can't read input file!
javax.imageio.IIOException: Can't read input file!
at javax.imageio.ImageIO.read(ImageIO.java:1275)
at
org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream(XPDF2Thumbnail.java:229)
at
org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:668)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:570)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:520)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:488)
at
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:379)
Wrote Item: 2121/169228 to Index at Thu Jul 23 12:16:16 EDT 2009
I think it is complaining because xpdf2Thumbnail needs to know where the input
file is and where to put the output file(s)....?? Can anyone help with this?
Thanks,
Sue
Sue Walker-Thornton
ConITS Contract
NASA Langley Research Center
Integrated Library Systems Application & Database Administrator
130 Research Drive
Hampton, VA 23666
Office: (757) 224-4074
Fax: (757) 224-4001
Pager: (757) 988-2547
Email: [email protected]<mailto:[email protected]>
------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech