[ 
https://issues.apache.org/jira/browse/CONNECTORS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13556174#comment-13556174
 ] 

David Morana commented on CONNECTORS-608:
-----------------------------------------

Thanks Karl,
        I'll forward this question to the solr community, but I would like your 
opinion as well.
        So, should I separate all the cores on separate servers (or separate 
websites)? Or should I have separate instances of Manifold?
        Currently, I'm going to try to move everything off of my laptop onto a 
true 
DEV server environment and see if there's any improvement.
        Is there anyone out there running Manifold and Solr that I can talk to?

        And the link that was stuck for a while wasn't a profile it was just a 
long 
xml file that was unlike any of the other records in the feed.
        Here's an excerpt; upon closer inspection: It appears to be a link in 
each 
record that refers to itself but it only works in the context of the IBM 
portal. Whatever it is, Manifold didn't like it.
        
<pt:type><pt:parentId>snx:person</pt:parentId><pt:id>default</pt:id><pt:property><pt:ref>profileType</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property><pt:property><pt:ref>managerUid</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property><pt:property><pt:ref>isManager</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property><pt:property><pt:ref>loginId</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property><pt:property><pt:ref>userState</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property><pt:property><pt:ref>userid</pt:ref><pt:updatability>read</pt:updatability><pt:hidden>false</pt:hidden></pt:property>...
 
etc...

Thanks,
David


                
> Solr connector gets socket timeouts on slow documents
> -----------------------------------------------------
>
>                 Key: CONNECTORS-608
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-608
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 1.1
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 1.1
>
>         Attachments: mcf-jetty-error.txt
>
>
> The Solr connector fails on some documents with the following exception.
> {code}
>                 ERROR 2013-01-11 11:13:59,372 (Worker thread '36') - 
> Exception tossed: Repeated service interruptions - failure processing 
> document: Software caused connection abort: recv failed
> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service 
> interruptions - failure processing document: Software caused connection 
> abort: recv failed
>                 at 
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585)
> Caused by: java.net.SocketException: Software caused connection abort: recv 
> failed
>                 at java.net.SocketInputStream.socketRead0(Native Method)
>                 at java.net.SocketInputStream.read(Unknown Source)
>                 at java.net.SocketInputStream.read(Unknown Source)
>                 at 
> org.apache.http.impl.io.AbstractSessionInputBuffer.fillBuffer(AbstractSessionInputBuffer.java:166)
>                 at 
> org.apache.http.impl.io.SocketInputBuffer.fillBuffer(SocketInputBuffer.java:90)
>                 at 
> org.apache.http.impl.io.AbstractSessionInputBuffer.readLine(AbstractSessionInputBuffer.java:281)
>                 at 
> org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:92)
>                 at 
> org.apache.http.impl.conn.DefaultHttpResponseParser.parseHead(DefaultHttpResponseParser.java:61)
>                 at 
> org.apache.http.impl.io.AbstractMessageParser.parse(AbstractMessageParser.java:254)
>                 at 
> org.apache.http.impl.AbstractHttpClientConnection.receiveResponseHeader(AbstractHttpClientConnection.java:289)
>                 at 
> org.apache.http.impl.conn.DefaultClientConnection.receiveResponseHeader(DefaultClientConnection.java:252)
>                 at 
> org.apache.http.impl.conn.ManagedClientConnectionImpl.receiveResponseHeader(ManagedClientConnectionImpl.java:191)
>                 at 
> org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:300)
>                 at 
> org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:127)
>                 at 
> org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:716)
>                 at 
> org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:521)
>                 at 
> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906)
>                 at 
> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805)
>                 at 
> org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784)
>                 at 
> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:352)
>                 at 
> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:181)
>                 at 
> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117)
>                 at 
> org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:742)
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to