Hi Maxence, You would want to turn on connector debugging INSTEAD of the debugging you've turned on, which is very noisy and not helpful.
In global properties: org.apache.manifoldcf.connectors value DEBUG Karl On Tue, Jul 24, 2018 at 9:12 AM msaunier <msaun...@citya.com> wrote: > With debug: > > > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28034ms for sessionid 0x100000050ae0049 > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28034ms for sessionid 0x100000050ae0049, closing socket > connection and attempting reconnect > > [Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27708ms for sessionid 0xff00000201970044 > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27737ms for sessionid 0xff00000201970043 > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27737ms for sessionid 0xff00000201970043, closing socket > connection and attempting reconnect > > [Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28316ms for sessionid 0x100000050ae004b > > [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28394ms for sessionid 0x2000000b80d0047 > > [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28394ms for sessionid 0x2000000b80d0047, closing socket > connection and attempting reconnect > > [Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27708ms for sessionid 0xff00000201970044, closing socket > connection and attempting reconnect > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > agents process ran out of memory - shutting down > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 36805ms for sessionid 0x2000000b80d0046 > > [Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 36805ms for sessionid 0x2000000b80d0046, closing socket > connection and attempting reconnect > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > at java.lang.StringBuilder.toString(StringBuilder.java:407) > > at > org.apache.manifoldcf.core.cachemanager.CacheManager.readSharedData(CacheManager.java:849) > > at > org.apache.manifoldcf.core.cachemanager.CacheManager.hasExpired(CacheManager.java:483) > > at > org.apache.manifoldcf.core.cachemanager.CacheManager.lookupObject(CacheManager.java:454) > > at > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:131) > > at > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:204) > > at > org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.performQuery(DBInterfacePostgreSQL.java:862) > > at > org.apache.manifoldcf.core.database.BaseTable.performQuery(BaseTable.java:236) > > at > org.apache.manifoldcf.crawler.jobs.Jobs.deletingJobsPresent(Jobs.java:3133) > > at > org.apache.manifoldcf.crawler.jobs.JobManager.getNextDeletableDocuments(JobManager.java:1862) > > at > org.apache.manifoldcf.crawler.system.DocumentDeleteStufferThread.run(DocumentDeleteStufferThread.java:108) > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > agents process ran out of memory - shutting down > > [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27763ms for sessionid 0x100000050ae004a > > [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 27763ms for sessionid 0x100000050ae004a, closing socket > connection and attempting reconnect > > [zkCallback-3-thread-7] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@7a5c701e name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-3-thread-7] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > [Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard > from server in 28316ms for sessionid 0x100000050ae004b, closing socket > connection and attempting reconnect > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [zkCallback-11-thread-5] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@53181a58 name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-11-thread-5] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, > session 0xff00000201970043 has expired > > [Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, > session 0xff00000201970043 has expired, closing socket connection > > [Thread-7573-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0xff00000201970043 > > [zkCallback-11-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@53181a58 name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Expired type:None path:null path: null type: None > > [zkCallback-11-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - Our previous ZooKeeper > session was expired. Attempting to reconnect to recover relationship with > ZooKeeper... > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN > org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, > session 0x100000050ae0049 has expired > > [Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service, > session 0x100000050ae0049 has expired, closing socket connection > > [zkCallback-11-thread-2] WARN > org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired > - starting a new one... > > [zkCallback-11-thread-2] INFO org.apache.zookeeper.ZooKeeper - Initiating > client connection, connectString=kemp-formation-solr:2181 > sessionTimeout=60000 > watcher=org.apache.solr.common.cloud.ConnectionManager@53181a58 > > [Thread-5234-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0x100000050ae0049 > > [zkCallback-3-thread-4] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@7a5c701e name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Expired type:None path:null path: null type: None > > [zkCallback-3-thread-4] WARN > org.apache.solr.common.cloud.ConnectionManager - Our previous ZooKeeper > session was expired. Attempting to reconnect to recover relationship with > ZooKeeper... > > [zkCallback-3-thread-4] WARN > org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection expired > - starting a new one... > > [zkCallback-3-thread-4] INFO org.apache.zookeeper.ZooKeeper - Initiating > client connection, connectString=kemp-formation-solr:2181 > sessionTimeout=60000 > watcher=org.apache.solr.common.cloud.ConnectionManager@7a5c701e > > [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [Thread-490] INFO org.eclipse.jetty.server.ServerConnector - Stopped > ServerConnector@2a640157{HTTP/1.1}{0.0.0.0:8345} > > [zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on > server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = > 0x2000000b80d0049, negotiated timeout = 40000 > > [zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] > INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on > server kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = > 0xff00000201970045, negotiated timeout = 40000 > > agents process ran out of memory - shutting down > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > agents process ran out of memory - shutting down > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > at java.util.HashMap.newNode(HashMap.java:1747) > > at java.util.HashMap.putVal(HashMap.java:631) > > at java.util.HashMap.put(HashMap.java:612) > > at jcifs.util.transport.Transport.sendrecv(Transport.java:66) > > at jcifs.smb.SmbTransport.send(SmbTransport.java:661) > > at jcifs.smb.SmbSession.send(SmbSession.java:238) > > at jcifs.smb.SmbTree.send(SmbTree.java:119) > > at jcifs.smb.SmbFile.send(SmbFile.java:776) > > at > jcifs.smb.SmbFileInputStream.readDirect(SmbFileInputStream.java:181) > > at jcifs.smb.SmbFileInputStream.read(SmbFileInputStream.java:142) > > at > org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:903) > > at > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) > > [zkCallback-11-thread-2] INFO > org.apache.solr.common.cloud.ConnectionManager - Connection with ZooKeeper > reestablished. > > [zkCallback-3-thread-4] INFO > org.apache.solr.common.cloud.ConnectionManager - Connection with ZooKeeper > reestablished. > > agents process ran out of memory - shutting down > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > [zkCallback-11-thread-2] INFO > org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to > ZooKeeper > > [zkCallback-11-thread-2] INFO > org.apache.solr.common.cloud.ConnectionManager - Connected:true > > [zkCallback-3-thread-4] INFO > org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected to > ZooKeeper > > [zkCallback-3-thread-4] INFO > org.apache.solr.common.cloud.ConnectionManager - Connected:true > > [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: > 0x2000000b80d0046 closed > > [zkCallback-21-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@381a7557 name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-21-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > [Thread-7538-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0x2000000b80d0046 > > agents process ran out of memory - shutting down > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > at java.util.regex.Matcher.<init>(Matcher.java:225) > > at java.util.regex.Pattern.matcher(Pattern.java:1093) > > at > de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.java:40) > > at > de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(BoilerpipeHTMLContentHandler.java:296) > > at > de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(BoilerpipeHTMLContentHandler.java:198) > > at > org.apache.tika.parser.html.BoilerpipeContentHandler.characters(BoilerpipeContentHandler.java:155) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46) > > at > org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82) > > at > org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140) > > at > org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287) > > at > org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.xpath.MatchingContentHandler.characters(MatchingContentHandler.java:85) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146) > > at > org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46) > > at > org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82) > > at > org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140) > > at > org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287) > > at > org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279) > > at > org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:306) > > at > org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator$SheetTextAsHTML.cell(XSSFExcelExtractorDecorator.java:431) > > [zkCallback-19-thread-5] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@43f7378f name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-19-thread-5] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > [zkCallback-15-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@6432608f name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-15-thread-2] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > [zkCallback-13-thread-3] WARN > org.apache.solr.common.cloud.ConnectionManager - Watcher > org.apache.solr.common.cloud.ConnectionManager@68bb3d74 name: > ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent > state:Disconnected type:None path:null path: null type: None > > [zkCallback-13-thread-3] WARN > org.apache.solr.common.cloud.ConnectionManager - zkClient has disconnected > > agents process ran out of memory - shutting down > > java.lang.OutOfMemoryError: GC overhead limit exceeded > > at sun.nio.cs.UTF_8.newEncoder(UTF_8.java:72) > > at java.lang.StringCoding.encode(StringCoding.java:348) > > at java.lang.String.getBytes(String.java:941) > > at org.postgresql.core.Utils.encodeUTF8(Utils.java:53) > > at > org.postgresql.core.v3.QueryExecutorImpl.sendParse(QueryExecutorImpl.java:1448) > > at > org.postgresql.core.v3.QueryExecutorImpl.sendOneQuery(QueryExecutorImpl.java:1777) > > at > org.postgresql.core.v3.QueryExecutorImpl.sendQuery(QueryExecutorImpl.java:1354) > > at > org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:292) > > at > org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:428) > > at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:354) > > at > org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:301) > > at > org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:287) > > at > org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:264) > > at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:260) > > at > org.apache.manifoldcf.core.database.Database.execute(Database.java:876) > > at > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696) > > [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: > 0xff00000201970044 closed > > [Thread-31532-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0xff00000201970044 > > [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Session establishment complete on server > kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = > 0x100000050ae004a, negotiated timeout = 40000 > > [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: > 0x100000050ae004a closed > > [Thread-7574-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0x100000050ae004a > > [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Opening socket connection to server > kemp-formation-solr.citya.local/192.168.37.107:2181. Will not attempt to > authenticate using SASL (unknown error) > > [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Socket connection established to > kemp-formation-solr.citya.local/192.168.37.107:2181, initiating session > > [Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO > org.apache.zookeeper.ClientCnxn - Session establishment complete on server > kemp-formation-solr.citya.local/192.168.37.107:2181, sessionid = > 0x2000000b80d0047, negotiated timeout = 40000 > > [Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: > 0x2000000b80d0047 closed > > [Thread-7602-EventThread] INFO org.apache.zookeeper.ClientCnxn - > EventThread shut down for session: 0x2000000b80d0047 > > [Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - > Stopped o.e.j.w.WebAppContext@44d52de2 > {/mcf-api-service,file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-5748290590258150821.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-api-service.war} > > [Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - > Stopped > o.e.j.w.WebAppContext@60410cd{/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war} > > > > > > Any idea? > > Thanks. > > > > > > > > *De :* Karl Wright [mailto:daddy...@gmail.com] > *Envoyé :* mardi 24 juillet 2018 13:15 > *À :* user@manifoldcf.apache.org > *Objet :* Re: Out of memory, one file bug i think > > > > I've opened CONNECTORS-1516 to track the Class Not Found issue, and also > created an Apache POI bugzilla ticket, which is referenced. > > > > Karl > > > > > > On Tue, Jul 24, 2018 at 6:15 AM Karl Wright <daddy...@gmail.com> wrote: > > The "class not found" error looks probably like a classloader issue with > Tika -- the class is present in poi-ooxml-3.17.jar, although to be fair it > might possibly be caused by an out-of-memory condition. > > You should be able to find the exception in the Simple History and figure > out what document it came from from that. If not, then look at the log > prior to the exception, and look at what Worker Thread 1 was doing. > > > > Karl > > > > > > On Tue, Jul 24, 2018 at 5:58 AM msaunier <msaun...@citya.com> wrote: > > Re Karl, > > > > I have an Out of Memory Error today. I think I have an error with a > document. I have this WARNING before crash: > > > > ------------------------------------------------------------------------ > > > > WARN 2018-07-24T11:46:22,098 (Worker thread '1') - Tika: Tika exception > extracting: TIKA-198: Illegal IOException from > org.apache.tika.parser.microsoft.OfficeParser@62980adb > > org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException > from org.apache.tika.parser.microsoft.OfficeParser@62980adb > > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286) > ~[tika-core-1.17.jar:1.17] > > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > ~[tika-core-1.17.jar:1.17] > > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) > ~[tika-core-1.17.jar:1.17] > > at > org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:74) > ~[mcf-tika-connector.jar:?] > > at > org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:235) > [mcf-tika-connector.jar:?] > > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3226) > [mcf-agents.jar:?] > > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077) > [mcf-agents.jar:?] > > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2708) > [mcf-agents.jar:?] > > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:756) > [mcf-agents.jar:?] > > at > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1583) > [mcf-pull-agent.jar:?] > > at > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1548) > [mcf-pull-agent.jar:?] > > at > org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:939) > [mcf-jcifs-connector.jar:?] > > at > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) > [mcf-pull-agent.jar:?] > > Caused by: java.io.IOException: java.lang.ClassNotFoundException: > org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder > > at > org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:150) > ~[?:?] > > at > org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) > ~[?:?] > > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) > ~[?:?] > > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) > ~[?:?] > > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > ~[?:?] > > ... 12 more > > Caused by: java.lang.ClassNotFoundException: > org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder > > at java.net.URLClassLoader.findClass(URLClassLoader.java:381) > ~[?:1.8.0_171] > > at java.lang.ClassLoader.loadClass(ClassLoader.java:424) > ~[?:1.8.0_171] > > at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) > ~[?:1.8.0_171] > > at java.lang.ClassLoader.loadClass(ClassLoader.java:357) > ~[?:1.8.0_171] > > at > org.apache.poi.poifs.crypt.EncryptionInfo.getBuilder(EncryptionInfo.java:222) > ~[?:?] > > at > org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:148) > ~[?:?] > > at > org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102) > ~[?:?] > > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) > ~[?:?] > > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) > ~[?:?] > > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > ~[?:?] > > ... 12 more > > > > I think it’s a file, because RAM allocation have a weird behavior. In one > second, ManifoldCF (or Tika) allocate +6Go RAM. > > > > > > How Can I find the file? > > > > Thanks, > > Maxence, > >