[CPS-users-fr] ProtalTransforms pdf->text ne trouve pas 'pdftohtml'

Winterflood, Jonathan Mon, 16 Apr 2007 07:17:59 -0700

Bonjour,

J'utilise les PortalTransforms pour convertir un document pdf attaché à un 
CPSDocument au format texte pour indexation par le ZCatalog.


Sur windows (machine de 'dev), après installation de pdftotext, tout marche 
très bien
Par contre, sur linux (machine de production), PortalTransforms lève une 
exception

MissingBinary: Unable to find binary "pdftohtml"

Voici la fin du traceback:

#  Module Products.CPSDocument.CPSDocument, line 76, in validate
# Module Products.CPSDocument.FlexibleTypeInformation, line 811, in 
validateObject
# Module Products.CPSDocument.FlexibleTypeInformation, line 822, in _commitDM
# Module Products.CPSSchemas.DataModel, line 337, in _commit
# Module Products.CPSSchemas.DataModel, line 359, in _commitData
# Module Products.CPSSchemas.BasicFields, line 568, in computeDependantFields
# Module Products.CPSSchemas.FileUtils, line 71, in convertFileToText
# Module Products.CPSSchemas.FileUtils, line 57, in _convertFileToMimeType
# Module Products.PortalTransforms.TransformEngine, line 97, in convertTo
# Module Products.PortalTransforms.TransformEngine, line 248, in _findPath
# Module Products.PortalTransforms.TransformEngine, line 270, in _getPaths
# Module ZODB.Connection, line 732, in setstate
# Module ZODB.Connection, line 786, in _setstate
# Module ZODB.serialize, line 605, in setGhostState
# Module Products.PortalTransforms.zope.Transform, line 105, in __setstate__
# Module Products.PortalTransforms.zope.Transform, line 111, in _tr_init
__traceback_info__: ('Products.PortalTransforms.transforms.pdf_to_html',)
# Module Products.PortalTransforms.zope.Transform, line 145, in _load_transform
# Module Products.PortalTransforms.transforms.pdf_to_html, line 62, in register
# Module Products.PortalTransforms.transforms.pdf_to_html, line 25, in __init__
# Module Products.PortalTransforms.libtransforms.commandtransform, line 19, in 
__init__
# Module Products.PortalTransforms.libtransforms.utils, line 25, in bin_search
MissingBinary: Unable to find binary "pdftohtml"

Je ne comprends pas pourquoi c'est pdf_to_html qui est utilisé, vu que c'est 
une conversion texte simple qui est demandée...(convertFileToText demande le 
type text/plain)

Le binaire pdftotext est bien disponible sur la machine linux, et pdftohtml, 
sur aucune des deux.

Est-ce que quelqu'un a deja rencontré un probleme de ce type?
Par où devrais-je chercher?

Merci d'avance pour vos suggestions,

Jonathan Winterflood


This message contains information that may be privileged or confidential and is 
the property of the Capgemini Group. It is intended only for the person to whom 
it is addressed. If you are not the intended recipient,  you are not authorized 
to read, print, retain, copy, disseminate,  distribute, or use this message or 
any part thereof. If you receive this  message in error, please notify the 
sender immediately and delete all  copies of this message.

_______________________________________________
cps-users-fr 
Adresse de la liste : [email protected]
Gestion de l'abonnement : <http://lists.nuxeo.com/mailman/listinfo/cps-users-fr>

[CPS-users-fr] ProtalTransforms pdf->text ne trouve pas 'pdftohtml'

Répondre à