We are currently importing a large number of pdf files into Fedora 2.2.3
using Muradora 1.33 and the Drama Solr plug-in for Fedora that goes with
it.
Pdfs that I have created using PDF Converter Professional import fine.
Files coming from Computer Science are suffering an 80% failure rate.
Solr claims they contain invalid hex characters. We have a worrying
suspicion that these latter files have been created with the Word 2007
pdf plug-in. We can't be sure because the person who created them has
left the University.
Has anyone else encountered a similar problem and/or know if there is a
simple fix?
Best
Richard
___________________________________________________________________
Richard Green
Manager, RepoMMan, RIDIR and REMAP Projects
e-Services Integration Group
www.hull.ac.uk/esig/repomman
www.hull.ac.uk/ridir
www.hull.ac.uk/remap
edocs.hull.ac.uk
*****************************************************************************************
To view the terms under which this email is distributed, please go to
http://www.hull.ac.uk/legal/email_disclaimer.html
*****************************************************************************************
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users