[ https://issues.apache.org/jira/browse/CONNECTORS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Morana updated CONNECTORS-608: ------------------------------------ Attachment: mcf-jetty-error.txt I built the latest from trunk this morning. Shortly after ManifoldCF (or the job) starts I get a jetty error in the cmd window (see attached) I had already started the job so I don't know exactly when this happened, but I'm pretty sure it's near the beginning. The connection aborts in the log, but in the UI it still says it's running and then it eventually reports the error. It's still aborting after about 7000 records injested. ManifoldCF is running very fast whereas solr is running very slow and I guess it just gets overwhelmed. First I get some warnings: WARN 2013-01-16 12:16:54,938 (Worker thread '35') - Service interruption reported for job 1355935371571 connection 'RSS IBM Profiles ': Socket timeout exception during indexing: Software caused connection abort: recv failed WARN 2013-01-16 12:16:54,948 (Worker thread '29') - Service interruption reported for job 1355935371571 connection 'RSS IBM Profiles ': Socket timeout exception during indexing: Software caused connection abort: socket write error After a few of these I finally get the error: ERROR 2013-01-16 12:17:47,234 (Worker thread '42') - Exception tossed: Repeated service interruptions - failure processing document: The target server failed to respond org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service interruptions - failure processing document: The target server failed to respond at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585) Caused by: org.apache.http.NoHttpResponseException: The target server failed to respond at org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:95) at org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:62) at org.apache.http.impl.io.AbstractMessageParser.parse(AbstractMessageParser.java:254) at org.apache.http.impl.AbstractHttpClientConnection.receiveResponseHeader(AbstractHttpClientConnection.java:289) at org.apache.http.impl.conn.DefaultClientConnection.receiveResponseHeader(DefaultClientConnection.java:252) at org.apache.http.impl.conn.ManagedClientConnectionImpl.receiveResponseHeader(ManagedClientConnectionImpl.java:191) at org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:300) at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:127) at org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:717) at org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:522) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784) at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:352) at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:181) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117) at org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:797) > Solr connector gets socket timeouts on slow documents > ----------------------------------------------------- > > Key: CONNECTORS-608 > URL: https://issues.apache.org/jira/browse/CONNECTORS-608 > Project: ManifoldCF > Issue Type: Bug > Components: Lucene/SOLR connector > Affects Versions: ManifoldCF 1.1 > Reporter: Karl Wright > Assignee: Karl Wright > Fix For: ManifoldCF 1.1 > > Attachments: mcf-jetty-error.txt > > > The Solr connector fails on some documents with the following exception. > {code} > ERROR 2013-01-11 11:13:59,372 (Worker thread '36') - > Exception tossed: Repeated service interruptions - failure processing > document: Software caused connection abort: recv failed > org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service > interruptions - failure processing document: Software caused connection > abort: recv failed > at > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585) > Caused by: java.net.SocketException: Software caused connection abort: recv > failed > at java.net.SocketInputStream.socketRead0(Native Method) > at java.net.SocketInputStream.read(Unknown Source) > at java.net.SocketInputStream.read(Unknown Source) > at > org.apache.http.impl.io.AbstractSessionInputBuffer.fillBuffer(AbstractSessionInputBuffer.java:166) > at > org.apache.http.impl.io.SocketInputBuffer.fillBuffer(SocketInputBuffer.java:90) > at > org.apache.http.impl.io.AbstractSessionInputBuffer.readLine(AbstractSessionInputBuffer.java:281) > at > org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:92) > at > org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:61) > at > org.apache.http.impl.io.AbstractMessageParser.parse(AbstractMessageParser.java:254) > at > org.apache.http.impl.AbstractHttpClientConnection.receiveResponseHeader(AbstractHttpClientConnection.java:289) > at > org.apache.http.impl.conn.DefaultClientConnection.receiveResponseHeader(DefaultClientConnection.java:252) > at > org.apache.http.impl.conn.ManagedClientConnectionImpl.receiveResponseHeader(ManagedClientConnectionImpl.java:191) > at > org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:300) > at > org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:127) > at > org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:716) > at > org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:521) > at > org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906) > at > org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805) > at > org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784) > at > org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:352) > at > org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:181) > at > org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117) > at > org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:742) > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira