[EMAIL PROTECTED] wrote:
> Hi Anat,
>
> I was wondering if you found the answer to your question below. I am
> having the same problem that you have described. It doesn't fail to
> parse all the pdf files but it does with a good percentage of them.
> If you have any input I'd appreciate it.
>
> Thanks Anat,
>
> Greg Guerin
> Phone: 1(970)898-6139
> Hewlett-Packard Company
> Fort Collins, CO 80528
>
> ______________________________________________________________________
> Hi all,
>
> I have a Solaris 2.6 machine which I've managed to inatll htdig3.1.0b1
> on.
> I set this in the conf file:
> pdf_parser: /tools/Acrobat/bin/acroread
>
>
> but I get these messages while digging:
> ... /tmp/htdig11117.pdf: Could not repair file.
> PDF::parse: cannot open acroread output
>
>
> Has anyone seen this? knows what's wrong?
> Thanks
>
>
>
> --
> Anat Rozenzon
> ��`��o,,,,o��`��o,,,,o��`��o,,,,o��`��o,,,
> API/Intranet team Tel: +972-8-9134480
> Telrad Ltd. Fax: +972-8-9133487
> P.O.B. 50, Lod, Israel Email: [EMAIL PROTECTED]
> o,,,,o��`��o,,,,o��`��o,,,,o��`��o,,,,o��`
>
Hi,
We have a solution that seems to work. First, you must set the environment
variable 'TMPDIR' (bu something like "setenv TMPDIR /tmp")
Then, and this is the main thing I think, we've put a larger SWAP, about 3
times from the memory.
We now have:
Memory 0.5G
Swap 1.5G
It now seems to be working ok, we have pdf files of about 1-5M and they are
all indexed.
bye
--
Anat Rozenzon
���`����,��,����`����,��,�����`����,��,����`����,��
API/Intranet team Tel: +972-8-9134480
Telrad Ltd. Fax: +972-8-9133487
P.O.B. 50, Lod, Israel Email: [EMAIL PROTECTED]
��,��,����`����,��,����`����,��,�����`����,��,����`
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.