[
https://issues.apache.org/jira/browse/CONNECTORS-576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509908#comment-13509908
]
David Morana commented on CONNECTORS-576:
-----------------------------------------
Here's the log; the errors just say it gave up...
WARN 2012-12-04 11:54:20,556 (Worker thread '1') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
WARN 2012-12-04 12:55:40,689 (Worker thread '41') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Error 500 from ingestion
request; ingestion will be retried again later
ERROR 2012-12-04 12:55:40,709 (Worker thread '41') - Exception tossed: Repeated
service interruptions - failure processing document: Ingestion HTTP error code
500
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service
interruptions - failure processing document: Ingestion HTTP error code 500
at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585)
Caused by: org.apache.manifoldcf.core.interfaces.ManifoldCFException: Ingestion
HTTP error code 500
at
org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:1386)
WARN 2012-12-04 12:55:41,899 (Worker thread '18') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:42,299 (Worker thread '30') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:43,456 (Worker thread '27') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:43,877 (Worker thread '19') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:44,876 (Worker thread '0') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:45,266 (Worker thread '31') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:46,420 (Worker thread '7') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:46,467 (Worker thread '43') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:46,826 (Worker thread '2') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:47,840 (Worker thread '16') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
WARN 2012-12-04 12:55:47,871 (Worker thread '17') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:47,902 (Worker thread '28') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:48,261 (Worker thread '9') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:48,682 (Worker thread '49') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
WARN 2012-12-04 12:55:49,118 (Worker thread '12') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:49,148 (Worker thread '20') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:49,178 (Worker thread '48') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:49,478 (Worker thread '33') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:49,878 (Worker thread '46') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:50,398 (Worker thread '35') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:51,760 (Worker thread '11') - Pre-ingest service
interruption reported for job 1343845636068 connection 'LISA-DEV': Job no
longer active
WARN 2012-12-04 12:55:54,812 (Worker thread '47') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
WARN 2012-12-04 12:55:55,402 (Worker thread '24') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
WARN 2012-12-04 12:55:58,548 (Worker thread '29') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Error 500 from ingestion
request; ingestion will be retried again later
WARN 2012-12-04 12:55:59,328 (Worker thread '8') - Service interruption
reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
> Manifold gets repeated service interruptions and stops
> ------------------------------------------------------
>
> Key: CONNECTORS-576
> URL: https://issues.apache.org/jira/browse/CONNECTORS-576
> Project: ManifoldCF
> Issue Type: Bug
> Affects Versions: ManifoldCF next
> Environment: solr 4.0 manifoldcf v1.1-dev on windows 7
> Reporter: David Morana
> Fix For: ManifoldCF 1.1
>
>
> Manifold gets repeated service interruptions and stops.
> Is there a way to get more detailed error information?
> such as, the document name/url/location that it's having a problem with?
> In v.5.1 these errors would appear at the very end (the last 130 to 184
> document) and then stop.
> The solr logs always reported vague TIKA errors
> I'm unsure where the problems lie.
> Here's the manifoldcf log
> WARN 2012-12-04 10:27:40,722 (Worker thread '0') - Service interruption
> reported for job 1343845636068 connection 'LISA-DEV': Error 500 from
> ingestion request; ingestion will be retried again later
> ERROR 2012-12-04 10:27:40,754 (Worker thread '0') - Exception tossed:
> Repeated service interruptions - failure processing document: Ingestion HTTP
> error code 500
> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service
> interruptions - failure processing document: Ingestion HTTP error code 500
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585)
> Caused by: org.apache.manifoldcf.core.interfaces.ManifoldCFException:
> Ingestion HTTP error code 500
> at
> org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:1386)
> WARN 2012-12-04 10:27:40,847 (Worker thread '24') - Service interruption
> reported for job 1343845636068 connection 'LISA-DEV': Job no longer active
> And here's the solr log if it helps:
> org.apache.solr.common.SolrException:
> org.apache.tika.exception.TikaException: XML parse error at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:215)
> at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)
> at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
> at org.apache.solr.core.SolrCore.execute(SolrCore.java:1561) at
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:442)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:263)
> at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1337)
> at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:484) at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:119)
> at
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524)
> at
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira