I'm trying to investigate a little bit and I found the following open issue
in in Tika describing the same problem:
https://issues.apache.org/jira/browse/TIKA-4186

Mario, are you running Tika as a single instance deployment or a cluster?
Probably using a cluster should solve the problem.

Cheers,
PG


Il giorno mer 15 gen 2025 alle ore 08:37 Bisonti Mario <
mario.biso...@vimar.com> ha scritto:

> I think the problem happens because these multple errors:
>
> 2025-01-15T01:50:52.418230657Z INFO  [qtp628487793-361] 02:50:52,417
> org.apache.tika.server.core.resource.MetadataResource /meta
> (application/pdf)
> 2025-01-15T01:55:52.439370905Z ERROR [Thread-11] 02:55:52,436
> org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis
> elapsed 300014; consider increasing the allowable time with the
> <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header
> 2025-01-15T01:55:52.448124026Z WARN  [Thread-11] 02:55:52,444
> org.apache.tika.server.core.ServerStatusWatcher forked process observed
> TIMEOUT and is shutting down.
> 2025-01-15T01:55:52.448508991Z INFO  [Thread-11] 02:55:52,447
> org.apache.tika.server.core.ServerStatusWatcher Shutting down forked
> process with status: TIMEOUT
> .
> .
> 2025-01-15T02:04:06.654266316Z ERROR [Thread-11] 03:04:06,652
> org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis
> elapsed 300030; consider increasing the allowable time with the
> <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header
> 2025-01-15T02:04:06.655956922Z WARN  [qtp628487793-67] 03:04:06,655
> org.apache.pdfbox.pdmodel.font.PDType1Font Using fallback font ArialMT for
> Symbol
> 2025-01-15T02:04:06.656048487Z WARN  [Thread-11] 03:04:06,655
> org.apache.tika.server.core.ServerStatusWatcher forked process observed
> TIMEOUT and is shutting down.
> 2025-01-15T02:04:06.659121149Z INFO  [Thread-11] 03:04:06,659
> org.apache.tika.server.core.ServerStatusWatcher Shutting down forked
> process with status: TIMEOUT
> .
> .
> 2025-01-15T02:09:56.098872207Z ERROR [Thread-13] 03:09:56,098
> org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis
> elapsed 300036; consider increasing the allowable time with the
> <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header
> 2025-01-15T02:09:56.101104521Z WARN  [Thread-13] 03:09:56,101
> org.apache.tika.server.core.ServerStatusWatcher forked process observed
> TIMEOUT and is shutting down.
> 2025-01-15T02:09:56.103720121Z INFO  [Thread-13] 03:09:56,103
> org.apache.tika.server.core.ServerStatusWatcher Shutting down forked
> process with status: TIMEOUT
>
> So, I have to investigate about to avoid tika shutting down.
>
> Is there any parameter on ManifoldCF to do not consider these errors?
>
> Thanks a lot
>
> Mario
>
>

-- 
Piergiorgio

Reply via email to