Hello list,
I'm getting the following error when the web cralwer is trying to post
documents to Solr 4: IO exception during indexing: null. This happens
for all indexing attempts and just ends in the following:
--8<--
WARN 2013-03-01 19:59:51,360 (Worker thread '0') - Service
interruption reported for job 1362070726596 connection 'Web crawler': IO
exception during indexing: null
ERROR 2013-03-01 19:59:51,378 (Worker thread '0') - Exception tossed:
Repeated service interruptions - failure processing document: null
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated
service interruptions - failure processing document: null
at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:604)
Caused by: org.apache.http.client.ClientProtocolException
at
org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:909)
at
org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805)
at
org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784)
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:353)
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:181)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117)
at
org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:833)
Caused by: org.apache.http.client.NonRepeatableRequestException: Cannot
retry request with a non-repeatable request entity.
at
org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:695)
at
org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:522)
--8<--
I'm running version 1.1.1 of MCF deployed on Resin. This does not happen
on our test server which is equally configured as our prod server,
except for some security restrictions. Basic auth is configured for both
reading and writing on the Solr server.
I *did* got the same error the first time I deployed version 1.1.1 of
MCF on our test server, but it went away after I added the Solr core
name in the core/collection name field. On our production server I *do*
have the core named configured, so now I need help in order to figure
out what's going on.
The NonRepeatableRequestException is perhaps caused by a
misconfiguration of HttpClient 4, but I'm not sure this is the root of
the problem I'm facing here. It might be due to the basic auth
restriction which is configured. Anyway, this was not a problem for
previous versions of MCF.
Erlend
--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050