I have been using htdig with great success for about 6 months on an
        internal web server, shcih serves up some vendors documentation.

        Now I have a set of files from one vendor which consiit of some (I
        think) failry complex pdf documents. They for example have a lot of
        multi-page documents, and coullums.

        I have tried using acrobat 4, the conv_doc.pl script, and the
        parse_doc.pl script, all with thier own sets of problems. With acrobat4
        I get what appears to be good extraction for some files, and nothing
        whatsover for others. I have observed that one of the ones I am not
        getting naything on is a multipage document.

        With parse_doc I ge errors like:

        External parser error in line:   without disrupting the other modules
        in the system  

        with conv_doc, I get errors about some close faiures (?).

        Can anyone give me some advice on how to make this work?

        Thanks.

-- 
Stan Brown     [EMAIL PROTECTED]                                    843-745-3154
Westvaco
Charleston SC.
-- 
Windows 98: n.
        useless extension to a minor patch release for 32-bit extensions and
        a graphical shell for a 16-bit patch to an 8-bit operating system
        originally coded for a 4-bit microprocessor, written by a 2-bit 
        company that can't stand for 1 bit of competition.
-
(c) 2000 Stan Brown.  Redistribution via the Microsoft Network is prohibited.

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to