Hi Greg

Let me know if you can solve this. Our org is looking at doing something 
similar - so we would REALLY like to use any solutions you can come
up with.

>From my preliminary analysis, I found the problem not insignificant and highly 
>dependent on how structured the Indesign typography was i.e.
tags al a CSS.
 
I was looking at a combination of parsing both html and xml exports (Indesign 
CS2 and CS4)

The html gives the images nicely whereas in xml images are encoded (MIME 
/base64 style)
The xml has better record of actual document structure layout
 
In summary, if the original Indesign doc is not structured i.e. custom layout 
and not a formal DTP type tagguing typography, the task is
rather diffcult to achieve accurate parsing. A good test is whether the 
Indesign can export to ePUb with relative accuracy

PS Adobe - evade the issue on this one - their idea of Indesign to Web is via a 
Flash doc

andre

 
 
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of 
gregor brabyn
Sent: Thursday, 5 August 2010 11:36
To: [email protected]
Subject: [phpug]Moving Indesign file information into PHP application

Hi 

I am quoting on making a document management system that has highly hierarchial 
structured data. The document is mostly text but does
contain images and data in tables.

Currently the document is maintained in Indesign CS2 and I understand that 
Indesign can export the data as XML or as a PDF.

The client has asked for a quote, among things, on transferring the document 
from Indesign to the new PHP system I would build. 

Has anyone had any experience with anything like this. With an XML version I am 
not sure how I would deal with loading the images. With a
PDF version I am not sure if there are any PHP based PDF parsers that would 
keep the hierarchial structure intact. Not even sure if the PDF
internally keeps this data structure.

Does anyone have any advice on this. It is currently a large document that in 
PDF format has something like one and a half thousand pages.
Is it going to be better to just use a data entry operator?

Greg


      

-- 
NZ PHP Users Group: http://groups.google.com/group/nzphpug
To post, send email to [email protected]
To unsubscribe, send email to
[email protected]
please consider the environment before you print this email

##########################################################################################

The views expressed in this e-mail and any corresponding attachments do not 
necessarily
reflect those of the Health Research Council of New Zealand. This e-mail 
together with any
accompanying attachments may be confidential and subject to legal privilege. If 
you have
received this message in error, please notify the sender immediately and note 
that you may
not copy, disclose or use the content in any way

Thank You.
##########################################################################################

-- 
NZ PHP Users Group: http://groups.google.com/group/nzphpug
To post, send email to [email protected]
To unsubscribe, send email to
[email protected]

Reply via email to