Thanks Karl, Can you let me know how to get wire log from MCF as you want it. I will share logs with you once i get it.
Regards. On Sun, Aug 24, 2014 at 2:38 AM, Karl Wright <[email protected]> wrote: > Hi Lalit, > > This looks like a solr bug, but until there's a wire log I can look at > from the manifoldcf side I can't prove it. > > Thanks, > > Karl > > Sent from my Windows Phone > ------------------------------ > From: lalit jangra > Sent: 8/23/2014 2:37 PM > To: [email protected] > Subject: Re: Getting java.io.IOException: missing CR Error > > Thanks Karl, > > I did some more investigation and found out "Invalid chunk header " error > in solr logs. I could say that firewall or proxy setting is not an issue as > i am indexing sahrepoint with same solr as well and its going well. Also i > have set solr's multipartUplaodLimitinKB to high value of 204800000 KB as > well. > > 974159 [http-bio-8080-exec-82] ERROR > org.apache.solr.servlet.SolrDispatchFilter – > null:org.apache.commons.fileupload.FileUploadBase$IOFileUploadException: > Processing of multipart/form-data request failed. Invalid chunk header > > at > org.apache.commons.fileupload.FileUploadBase.parseRequest(FileUploadBase.java:367) > > at > org.apache.commons.fileupload.servlet.ServletFileUpload.parseRequest(ServletFileUpload.java:126) > > at > org.apache.solr.servlet.SolrRequestParsers$MultipartRequestParser.parseParamsAndFillStreams(SolrRequestParsers.java:547) > > at > org.apache.solr.servlet.SolrRequestParsers$StandardRequestParser.parseParamsAndFillStreams(SolrRequestParsers.java:681) > > at > org.apache.solr.servlet.SolrRequestParsers.parse(SolrRequestParsers.java:150) > > at > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:393) > > at > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:197) > > at > org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241) > > at > org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208) > > at > org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:220) > > at > org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122) > > at > org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:170) > > at > org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:98) > > at > org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:950) > > at > org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116) > > at > org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:408) > > at > org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1040) > > at > org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:607) > > at > org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:315) > > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > > at java.lang.Thread.run(Thread.java:724) > > Caused by: java.io.IOException: Invalid chunk header > > at > org.apache.coyote.http11.filters.ChunkedInputFilter.doRead(ChunkedInputFilter.java:172) > > at > org.apache.coyote.http11.AbstractInputBuffer.doRead(AbstractInputBuffer.java:346) > > at org.apache.coyote.Request.doRead(Request.java:422) > > at > org.apache.catalina.connector.InputBuffer.realReadBytes(InputBuffer.java:290) > > at > org.apache.tomcat.util.buf.ByteChunk.substract(ByteChunk.java:449) > > at > org.apache.catalina.connector.InputBuffer.read(InputBuffer.java:315) > > at > org.apache.catalina.connector.CoyoteInputStream.read(CoyoteInputStream.java:200) > > at java.io.FilterInputStream.read(FilterInputStream.java:133) > > at > org.apache.commons.fileupload.util.LimitedInputStream.read(LimitedInputStream.java:125) > > at > org.apache.commons.fileupload.MultipartStream$ItemInputStream.makeAvailable(MultipartStream.java:977) > > at > org.apache.commons.fileupload.MultipartStream$ItemInputStream.read(MultipartStream.java:887) > > at java.io.InputStream.read(InputStream.java:101) > > Please sugset. > > > On Fri, Aug 22, 2014 at 5:25 PM, Karl Wright <[email protected]> wrote: > >> Hi, >> >> The error you are getting is because "Solr"'s response is not valid >> HTTP. It is therefore likely that it is not solr itself that is the >> problem but rather some firewall or proxy that is failing to allow stuff to >> be posted through. Either that, or you have triggered some Solr error >> condition (maybe due to files being too large?) and Solr is erroneously >> responding with a non-HTTP response. >> >> The way to debug this is to turn on httpclient wire logging, and then you >> will see the back-and-forth with solr that is the problem. You do this in >> the ManifoldCF logging.ini file. Here is a description of httpcomponents >> wire logging: >> >> https://hc.apache.org/httpcomponents-client-4.3.x/logging.html >> >> Karl >> >> >> >> On Fri, Aug 22, 2014 at 6:49 AM, lalit jangra <[email protected]> >> wrote: >> >>> Thanks, >>> >>> I checked everything, including replacing with new solr instance but >>> still this error appears. Next with same solr, i am indexing SharePoint >>> sites as well and its working fine (solr seems to be fine). >>> >>> I am using CMIS 1.0 to connect to alfresco in MCF on linux box . For >>> same set of configurations in windows , i am not getting any error for >>> alfresco. >>> >>> Regards. >>> >>> >>> On Thu, Aug 21, 2014 at 5:24 PM, Karl Wright <[email protected]> wrote: >>> >>>> Hi Lalit, >>>> >>>> Check your Solr instance. Something is going wrong talking to it. >>>> >>>> Karl >>>> >>>> >>>> >>>> On Thu, Aug 21, 2014 at 7:52 AM, lalit jangra <[email protected] >>>> > wrote: >>>> >>>>> Hi, >>>>> >>>>> I am using MCF 1.5.1 & indexing Alfresco 4.2 using CMIS. It was >>>>> working fine till now but suddenly Alfresco job broke & i could see below >>>>> error in manifoldcf.log. >>>>> >>>>> WARN 2014-08-21 12:28:27,030 (Worker thread '184') - Service >>>>> interruption reported for job 1408620030828 connection 'Alfresco': IO >>>>> exception during indexing >>>>> http://iwdc2devbld02:8080/alfresco/api/-default-/public/cmis/versions/1.0/atom/content/EPA-EPA%20Mission.pdf?id=37d1cc7e-e284-4466-ac2f-3d81dcaeb8a3%3B1.0: >>>>> missing CR >>>>> >>>>> WARN 2014-08-21 12:28:28,202 (Worker thread '76') - IO exception >>>>> during indexing >>>>> http://iwdc2devbld02:8080/alfresco/api/-default-/public/cmis/versions/1.0/atom/content/TCurran-Strategies%20for%20Domestic%20Wastewater%20Treatment.pdf?id=64e0e3c1-7b9e-451a-b4d1-876fbe5a0b8e%3B1.0: >>>>> missing CR >>>>> >>>>> java.io.IOException: missing CR >>>>> >>>>> at >>>>> sun.net.www.http.ChunkedInputStream.processRaw(ChunkedInputStream.java:405) >>>>> >>>>> at >>>>> sun.net.www.http.ChunkedInputStream.readAheadBlocking(ChunkedInputStream.java:572) >>>>> >>>>> at >>>>> sun.net.www.http.ChunkedInputStream.readAhead(ChunkedInputStream.java:609) >>>>> >>>>> at >>>>> sun.net.www.http.ChunkedInputStream.read(ChunkedInputStream.java:696) >>>>> >>>>> at java.io.FilterInputStream.read(FilterInputStream.java:133) >>>>> >>>>> at >>>>> sun.net.www.protocol.http.HttpURLConnection$HttpInputStream.read(HttpURLConnection.java:3052) >>>>> >>>>> at >>>>> sun.net.www.protocol.http.HttpURLConnection$HttpInputStream.read(HttpURLConnection.java:3046) >>>>> >>>>> at >>>>> org.apache.http.entity.mime.content.InputStreamBody.writeTo(InputStreamBody.java:69) >>>>> >>>>> at >>>>> org.apache.manifoldcf.agents.output.solr.ModifiedHttpMultipart.doWriteTo(ModifiedHttpMultipart.java:211) >>>>> >>>>> at >>>>> org.apache.manifoldcf.agents.output.solr.ModifiedHttpMultipart.writeTo(ModifiedHttpMultipart.java:229) >>>>> >>>>> at >>>>> org.apache.manifoldcf.agents.output.solr.ModifiedMultipartEntity.writeTo(ModifiedMultipartEntity.java:186) >>>>> >>>>> at >>>>> org.apache.http.entity.HttpEntityWrapper.writeTo(HttpEntityWrapper.java:98) >>>>> >>>>> at >>>>> org.apache.http.impl.client.EntityEnclosingRequestWrapper$EntityWrapper.writeTo(EntityEnclosingRequestWrapper.java:108) >>>>> >>>>> at >>>>> org.apache.http.impl.entity.EntitySerializer.serialize(EntitySerializer.java:122) >>>>> >>>>> at >>>>> org.apache.http.impl.AbstractHttpClientConnection.sendRequestEntity(AbstractHttpClientConnection.java:271) >>>>> >>>>> at >>>>> org.apache.http.impl.conn.ManagedClientConnectionImpl.sendRequestEntity(ManagedClientConnectionImpl.java:197) >>>>> >>>>> at >>>>> org.apache.http.protocol.HttpRequestExecutor.doSendRequest(HttpRequestExecutor.java:257) >>>>> >>>>> at >>>>> org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:125) >>>>> >>>>> at >>>>> org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:715) >>>>> >>>>> at >>>>> org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:520) >>>>> >>>>> at >>>>> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906) >>>>> >>>>> at >>>>> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805) >>>>> >>>>> at >>>>> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784) >>>>> >>>>> at >>>>> org.apache.manifoldcf.agents.output.solr.ModifiedHttpSolrServer.request(ModifiedHttpSolrServer.java:291) >>>>> >>>>> at >>>>> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:197) >>>>> >>>>> at >>>>> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117) >>>>> >>>>> at >>>>> org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:923) >>>>> >>>>> WARN 2014-08-21 12:28:28,227 (Worker thread '76') - Service >>>>> interruption reported for job 1408620030828 connection 'Alfresco': IO >>>>> exception during indexing >>>>> http://iwdc2devbld02:8080/alfresco/api/-default-/public/cmis/versions/1.0/atom/content/TCurran-Strategies%20for%20Domestic%20Wastewater%20Treatment.pdf?id=64e0e3c1-7b9e-451a-b4d1-876fbe5a0b8e%3B1.0: >>>>> missing CR >>>>> >>>>> Please help. >>>>> >>>>> Regards, >>>>> Lalit. >>>>> >>>> >>>> >>> >>> >>> -- >>> Regards, >>> Lalit. >>> >> >> > > > -- > Regards, > Lalit. > -- Regards, Lalit.
