java.io.tmpdir system property can be set to specify temporary directory for only the Tika JVM. It will not affect other application running on the system.
java -Djava.io.tmpdir=/path/to/tmpdir From: Tilman Hausherr <[email protected]> Sent: Thursday, January 16, 2025 3:45 PM To: [email protected] Subject: Re: Tika Server and temporary files CAUTION: This email originated from outside the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe. If you believe this is a phishing email, use the Report to Cybersecurity icon in Outlook. On 16.01.2025 10:12, Keith Robinson via user wrote: Thank you for responding. Do you mean TMPDIR (possibly TEMP, TEMPDIR or TMP) environment variables? I'll admit I haven't, partly because I wasn't sure when a temporary file would be created and therefore how to test it! Yes that's what I mean. I can't tell you for sure when a temporary file would be created, but I looked into the source code and found several calls. Try this: take a huge PDF file (with hundreds of pages of texts that do extract) and try a text extraction and monitor the temp directory while it happens. https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf Tilman Keith From: Tilman Hausherr <[email protected]><mailto:[email protected]> Sent: 16 January 2025 09:06 To: [email protected]<mailto:[email protected]> Subject: Re: Tika Server and temporary files You don't often get email from [email protected]<mailto:[email protected]>. Learn why this is important<https://aka.ms/LearnAboutSenderIdentification> CAUTION: This email originated from outside of Nexor. Do not click links or open attachments unless you recognise the sender and know the content is safe. Have you tried to change the environment variables before starting the server? I looked into the code, there are createTempFile() calls, but this means the "official" temp directory is used. Tilman On 16.01.2025 09:56, Keith Robinson via user wrote: Hi, Just trying again to see if anyone can answer this question? Keith From: Keith Robinson via user <[email protected]><mailto:[email protected]> Sent: 02 January 2025 17:08 To: [email protected]<mailto:[email protected]> Subject: Tika Server and temporary files [email protected]<mailto:[email protected]> appears similar to someone who previously sent you email, but may not be that person. Learn why this could be a risk<https://aka.ms/LearnAboutSenderIdentification> CAUTION: This email originated from outside of Nexor. Do not click links or open attachments unless you recognise the sender and know the content is safe. Hi there, I have a general question about using Tika Server 2.9.2 (with the REST API) and its usage of /tmp. From what I understand, temporary files may be created while processing files, but is it possible to change the directory where Tika Server stores these temporary files? Many thanks, Keith.
