With debug:
[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28034ms for sessionid 0x100000050ae0049 [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28034ms for sessionid 0x100000050ae0049, closing socket connection and attempting reconnect [Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27708ms for sessionid 0xff00000201970044 [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27737ms for sessionid 0xff00000201970043 [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27737ms for sessionid 0xff00000201970043, closing socket connection and attempting reconnect [Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28316ms for sessionid 0x100000050ae004b [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28394ms for sessionid 0x2000000b80d0047 [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28394ms for sessionid 0x2000000b80d0047, closing socket connection and attempting reconnect [Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27708ms for sessionid 0xff00000201970044, closing socket connection and attempting reconnect [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) agents process ran out of memory - shutting down [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 36805ms for sessionid 0x2000000b80d0046 [Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 36805ms for sessionid 0x2000000b80d0046, closing socket connection and attempting reconnect java.lang.OutOfMemoryError: GC overhead limit exceeded at java.lang.StringBuilder.toString(StringBuilder.java:407) at org.apache.manifoldcf.core.cachemanager.CacheManager.readSharedData(CacheManager.java:849) at org.apache.manifoldcf.core.cachemanager.CacheManager.hasExpired(CacheManager.java:483) at org.apache.manifoldcf.core.cachemanager.CacheManager.lookupObject(CacheManager.java:454) at org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:131) at org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:204) at org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.performQuery(DBInterfacePostgreSQL.java:862) at org.apache.manifoldcf.core.database.BaseTable.performQuery(BaseTable.java:236) at org.apache.manifoldcf.crawler.jobs.Jobs.deletingJobsPresent(Jobs.java:3133) at org.apache.manifoldcf.crawler.jobs.JobManager.getNextDeletableDocuments(JobManager.java:1862) at org.apache.manifoldcf.crawler.system.DocumentDeleteStufferThread.run(DocumentDeleteStufferThread.java:108) [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) agents process ran out of memory - shutting down [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27763ms for sessionid 0x100000050ae004a [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 27763ms for sessionid 0x100000050ae004a, closing socket connection and attempting reconnect [zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected [Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 28316ms for sessionid 0x100000050ae004b, closing socket connection and attempting reconnect java.lang.OutOfMemoryError: GC overhead limit exceeded [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58 name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, session 0xff00000201970043 has expired [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, session 0xff00000201970043 has expired, closing socket connection [Thread-7573-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0xff00000201970043 [zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58 name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Expired type:None path:null path: null type: None [zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Our previous ZooKeeper session was expired. Attempting to reconnect to recover relationship with ZooKeeper... [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, session 0x100000050ae0049 has expired [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, session 0x100000050ae0049 has expired, closing socket connection [zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired - starting a new one... [zkCallback-11-thread-2] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=kemp-formation-solr:2181 sessionTimeout=60000 watcher=org.apache.solr.common.cloud.ConnectionManager@53181a58 [Thread-5234-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x100000050ae0049 [zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Expired type:None path:null path: null type: None [zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager - Our previous ZooKeeper session was expired. Attempting to reconnect to recover relationship with ZooKeeper... [zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired - starting a new one... [zkCallback-3-thread-4] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=kemp-formation-solr:2181 sessionTimeout=60000 watcher=org.apache.solr.common.cloud.ConnectionManager@7a5c701e [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [Thread-490] INFO org.eclipse.jetty.server.ServerConnector - Stopped ServerConnector@2a640157{HTTP/1.1}{0.0.0.0:8345} [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = 0x2000000b80d0049, negotiated timeout = 40000 [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = 0xff00000201970045, negotiated timeout = 40000 agents process ran out of memory - shutting down java.lang.OutOfMemoryError: GC overhead limit exceeded agents process ran out of memory - shutting down java.lang.OutOfMemoryError: GC overhead limit exceeded at java.util.HashMap.newNode(HashMap.java:1747) at java.util.HashMap.putVal(HashMap.java:631) at java.util.HashMap.put(HashMap.java:612) at jcifs.util.transport.Transport.sendrecv(Transport.java:66) at jcifs.smb.SmbTransport.send(SmbTransport.java:661) at jcifs.smb.SmbSession.send(SmbSession.java:238) at jcifs.smb.SmbTree.send(SmbTree.java:119) at jcifs.smb.SmbFile.send(SmbFile.java:776) at jcifs.smb.SmbFileInputStream.readDirect(SmbFileInputStream.java:181) at jcifs.smb.SmbFileInputStream.read(SmbFileInputStream.java:142) at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:903) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) [zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager - Connection with ZooKeeper reestablished. [zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager - Connection with ZooKeeper reestablished. agents process ran out of memory - shutting down java.lang.OutOfMemoryError: GC overhead limit exceeded [zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to ZooKeeper [zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager - Connected:true [zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to ZooKeeper [zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager - Connected:true [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0046 closed [zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@381a7557 name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected [Thread-7538-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x2000000b80d0046 agents process ran out of memory - shutting down java.lang.OutOfMemoryError: GC overhead limit exceeded at java.util.regex.Matcher.<init>(Matcher.java:225) at java.util.regex.Pattern.matcher(Pattern.java:1093) at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.java:40) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(BoilerpipeHTMLContentHandler.java:296) at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(BoilerpipeHTMLContentHandler.java:198) at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(BoilerpipeContentHandler.java:155) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46) at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82) at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140) at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287) at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.xpath.MatchingContentHandler.characters(MatchingContentHandler.java:85) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46) at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82) at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140) at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287) at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279) at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:306) at org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator$SheetTextAsHTML.cell(XSSFExcelExtractorDecorator.java:431) [zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@43f7378f name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected [zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@6432608f name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected [zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@68bb3d74 name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected type:None path:null path: null type: None [zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected agents process ran out of memory - shutting down java.lang.OutOfMemoryError: GC overhead limit exceeded at sun.nio.cs.UTF_8.newEncoder(UTF_8.java:72) at java.lang.StringCoding.encode(StringCoding.java:348) at java.lang.String.getBytes(String.java:941) at org.postgresql.core.Utils.encodeUTF8(Utils.java:53) at org.postgresql.core.v3.QueryExecutorImpl.sendParse(QueryExecutorImpl.java:1448) at org.postgresql.core.v3.QueryExecutorImpl.sendOneQuery(QueryExecutorImpl.java:1777) at org.postgresql.core.v3.QueryExecutorImpl.sendQuery(QueryExecutorImpl.java:1354) at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:292) at org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:428) at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:354) at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:301) at org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:287) at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:264) at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:260) at org.apache.manifoldcf.core.database.Database.execute(Database.java:876) at org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696) [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0xff00000201970044 closed [Thread-31532-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0xff00000201970044 [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = 0x100000050ae004a, negotiated timeout = 40000 [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000050ae004a closed [Thread-7574-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x100000050ae004a [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to authenticate using SASL (unknown error) [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = 0x2000000b80d0047, negotiated timeout = 40000 [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0047 closed [Thread-7602-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x2000000b80d0047 [Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped o.e.j.w.WebAppContext@44d52de2{/mcf-api-service,file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-5748290590258150821.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-api-service.war} [Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped o.e.j.w.WebAppContext@60410cd{/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war} <mailto:o.e.j.w.WebAppContext@60410cd%7b/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE%7d%7b/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war%7d> Any idea? Thanks. De : Karl Wright [mailto:daddy...@gmail.com] Envoyé : mardi 24 juillet 2018 13:15 À : user@manifoldcf.apache.org Objet : Re: Out of memory, one file bug i think I've opened CONNECTORS-1516 to track the Class Not Found issue, and also created an Apache POI bugzilla ticket, which is referenced. Karl On Tue, Jul 24, 2018 at 6:15 AM Karl Wright <daddy...@gmail.com <mailto:daddy...@gmail.com> > wrote: The "class not found" error looks probably like a classloader issue with Tika -- the class is present in poi-ooxml-3.17.jar, although to be fair it might possibly be caused by an out-of-memory condition. You should be able to find the exception in the Simple History and figure out what document it came from from that. If not, then look at the log prior to the exception, and look at what Worker Thread 1 was doing. Karl On Tue, Jul 24, 2018 at 5:58 AM msaunier <msaun...@citya.com <mailto:msaun...@citya.com> > wrote: Re Karl, I have an Out of Memory Error today. I think I have an error with a document. I have this WARNING before crash: ------------------------------------------------------------------------ WARN 2018-07-24T11:46:22,098 (Worker thread '1') - Tika: Tika exception extracting: TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@62980adb <mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb> org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@62980adb <mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286) ~[tika-core-1.17.jar:1.17] at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[tika-core-1.17.jar:1.17] at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) ~[tika-core-1.17.jar:1.17] at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:74) ~[mcf-tika-connector.jar:?] at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:235) [mcf-tika-connector.jar:?] at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3226) [mcf-agents.jar:?] at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077) [mcf-agents.jar:?] at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2708) [mcf-agents.jar:?] at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:756) [mcf-agents.jar:?] at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1583) [mcf-pull-agent.jar:?] at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1548) [mcf-pull-agent.jar:?] at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:939) [mcf-jcifs-connector.jar:?] at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) [mcf-pull-agent.jar:?] Caused by: java.io.IOException: java.lang.ClassNotFoundException: org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:150) ~[?:?] at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) ~[?:?] at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) ~[?:?] at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) ~[?:?] at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?] ... 12 more Caused by: java.lang.ClassNotFoundException: org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder at java.net.URLClassLoader.findClass(URLClassLoader.java:381) ~[?:1.8.0_171] at java.lang.ClassLoader.loadClass(ClassLoader.java:424) ~[?:1.8.0_171] at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) ~[?:1.8.0_171] at java.lang.ClassLoader.loadClass(ClassLoader.java:357) ~[?:1.8.0_171] at org.apache.poi.poifs.crypt.EncryptionInfo.getBuilder(EncryptionInfo.java:222) ~[?:?] at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:148) ~[?:?] at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) ~[?:?] at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) ~[?:?] at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) ~[?:?] at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?] ... 12 more I think it’s a file, because RAM allocation have a weird behavior. In one second, ManifoldCF (or Tika) allocate +6Go RAM. How Can I find the file? Thanks, Maxence,