I'm trying to investigate a little bit and I found the following open issue in in Tika describing the same problem: https://issues.apache.org/jira/browse/TIKA-4186
Mario, are you running Tika as a single instance deployment or a cluster? Probably using a cluster should solve the problem. Cheers, PG Il giorno mer 15 gen 2025 alle ore 08:37 Bisonti Mario < mario.biso...@vimar.com> ha scritto: > I think the problem happens because these multple errors: > > 2025-01-15T01:50:52.418230657Z INFO [qtp628487793-361] 02:50:52,417 > org.apache.tika.server.core.resource.MetadataResource /meta > (application/pdf) > 2025-01-15T01:55:52.439370905Z ERROR [Thread-11] 02:55:52,436 > org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis > elapsed 300014; consider increasing the allowable time with the > <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header > 2025-01-15T01:55:52.448124026Z WARN [Thread-11] 02:55:52,444 > org.apache.tika.server.core.ServerStatusWatcher forked process observed > TIMEOUT and is shutting down. > 2025-01-15T01:55:52.448508991Z INFO [Thread-11] 02:55:52,447 > org.apache.tika.server.core.ServerStatusWatcher Shutting down forked > process with status: TIMEOUT > . > . > 2025-01-15T02:04:06.654266316Z ERROR [Thread-11] 03:04:06,652 > org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis > elapsed 300030; consider increasing the allowable time with the > <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header > 2025-01-15T02:04:06.655956922Z WARN [qtp628487793-67] 03:04:06,655 > org.apache.pdfbox.pdmodel.font.PDType1Font Using fallback font ArialMT for > Symbol > 2025-01-15T02:04:06.656048487Z WARN [Thread-11] 03:04:06,655 > org.apache.tika.server.core.ServerStatusWatcher forked process observed > TIMEOUT and is shutting down. > 2025-01-15T02:04:06.659121149Z INFO [Thread-11] 03:04:06,659 > org.apache.tika.server.core.ServerStatusWatcher Shutting down forked > process with status: TIMEOUT > . > . > 2025-01-15T02:09:56.098872207Z ERROR [Thread-13] 03:09:56,098 > org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis > elapsed 300036; consider increasing the allowable time with the > <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header > 2025-01-15T02:09:56.101104521Z WARN [Thread-13] 03:09:56,101 > org.apache.tika.server.core.ServerStatusWatcher forked process observed > TIMEOUT and is shutting down. > 2025-01-15T02:09:56.103720121Z INFO [Thread-13] 03:09:56,103 > org.apache.tika.server.core.ServerStatusWatcher Shutting down forked > process with status: TIMEOUT > > So, I have to investigate about to avoid tika shutting down. > > Is there any parameter on ManifoldCF to do not consider these errors? > > Thanks a lot > > Mario > > -- Piergiorgio