On 16.01.2025 10:12, Keith Robinson via user wrote:

Thank you for responding.

Do you mean TMPDIR (possibly TEMP, TEMPDIR or TMP) environment variables? I’ll admit I haven’t, partly because I wasn’t sure when a temporary file would be created and therefore how to test it!

Yes that's what I mean. I can't tell you for sure when a temporary file would be created, but I looked into the source code and found several calls.

Try this: take a huge PDF file (with hundreds of pages of texts that do extract) and try a text extraction and monitor the temp directory while it happens.

https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf

Tilman


Keith

*From:*Tilman Hausherr <[email protected]>
*Sent:* 16 January 2025 09:06
*To:* [email protected]
*Subject:* Re: Tika Server and temporary files


        

You don't often get email from [email protected]. Learn why this is important <https://aka.ms/LearnAboutSenderIdentification>

        

CAUTION: This email originated from outside of Nexor. Do not click links or open attachments unless you recognise the sender and know the content is safe.

Have you tried to change the environment variables before starting the server?

I looked into the code, there are createTempFile() calls, but this means the "official" temp directory is used.

Tilman

On 16.01.2025 09:56, Keith Robinson via user wrote:

    Hi,

    Just trying again to see if anyone can answer this question?

    Keith

    *From:*Keith Robinson via user <[email protected]>
    <mailto:[email protected]>
    *Sent:* 02 January 2025 17:08
    *To:* [email protected]
    *Subject:* Tika Server and temporary files


        

    [email protected] appears similar to someone who previously
    sent you email, but may not be that person. Learn why this could
    be a risk <https://aka.ms/LearnAboutSenderIdentification>

        

    CAUTION: This email originated from outside of Nexor. Do not click
    links or open attachments unless you recognise the sender and know
    the content is safe.

    Hi there,

    I have a general question about using Tika Server 2.9.2  (with the
    REST API) and its usage of /tmp. From what I understand, temporary
    files may be created while processing files, but is it possible to
    change the directory where Tika Server stores these temporary files?

    Many thanks,

    Keith.

Reply via email to