We are currently importing a large number of pdf files into Fedora 2.2.3
using Muradora 1.33 and the Drama Solr plug-in for Fedora that goes with
it.

 

Pdfs that I have created using PDF Converter Professional import fine.
Files coming from Computer Science are suffering an 80% failure rate.
Solr claims they contain invalid hex characters.  We have a worrying
suspicion that these latter files have been created with the Word 2007
pdf plug-in.  We can't be sure because the person who created them has
left the University.

 

Has anyone else encountered a similar problem and/or know if there is a
simple fix?

 

Best

 

Richard

 

___________________________________________________________________

 

Richard Green

Manager, RepoMMan, RIDIR and REMAP Projects

e-Services Integration Group

 

www.hull.ac.uk/esig/repomman

www.hull.ac.uk/ridir

www.hull.ac.uk/remap

edocs.hull.ac.uk

 

 

*****************************************************************************************
To view the terms under which this email is distributed, please go to 
http://www.hull.ac.uk/legal/email_disclaimer.html
*****************************************************************************************
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to