Re: Debugging update request

2013-10-18 Thread Erick Erickson
@Michael:

Yep, that's the bit that's addressed by the two patches I referenced. If
you can try this with 4.5 (or the soon to be done 4.5.1), the problem
should go away.

@Chris:

I think you have a different issue. A very quick glance at your stack trace
doesn't really show anything outstanding. There are always a bunch of
threads waiting around for something to do that show up as blocked. So
I'm pretty puzzled. Are your Solr logs showing anything when you try to
update after this occurs?


On Wed, Oct 16, 2013 at 11:32 AM, Chris Geeringh geeri...@gmail.com wrote:

 Here is my jstack output... Lots of blocked threads.

 http://pastebin.com/1ktjBYbf


 On 16 October 2013 10:28, michael.boom my_sky...@yahoo.com wrote:

  I got the trace from jstack.
  I found references to semaphore but not sure if this is what you meant.
  Here's the trace:
  http://pastebin.com/15QKAz7U
 
 
 
  --
  View this message in context:
 
 http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619p4095847.html
  Sent from the Solr - User mailing list archive at Nabble.com.
 



Re: Debugging update request

2013-10-16 Thread michael.boom
Thanks Erick!

The version is 4.4.0.

I'm posting 100k docs batches every 30-40 sec from each indexing client and
sometimes two or more clients post in a very small timeframe. That's when i
think the deadlock happens.

I'll try to replicate the problem and check the thread dump.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619p4095821.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Debugging update request

2013-10-16 Thread Chris Geeringh
I ran an import last night, and this morning my cloud wouldn't accept
updates. I'm running the latest 4.6 snapshot. I was importing with latest
solrj snapshot, and using java bin transport with CloudSolrServer.

The cluster had indexed ~1.3 million docs before no further updates were
accepted, querying still working.

I'll run jstack shortly and provide the results.

On Wednesday, October 16, 2013, michael.boom wrote:

 Thanks Erick!

 The version is 4.4.0.

 I'm posting 100k docs batches every 30-40 sec from each indexing client and
 sometimes two or more clients post in a very small timeframe. That's when i
 think the deadlock happens.

 I'll try to replicate the problem and check the thread dump.



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619p4095821.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: Debugging update request

2013-10-16 Thread michael.boom
I got the trace from jstack.
I found references to semaphore but not sure if this is what you meant.
Here's the trace:
http://pastebin.com/15QKAz7U



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619p4095847.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Debugging update request

2013-10-16 Thread Chris Geeringh
Here is my jstack output... Lots of blocked threads.

http://pastebin.com/1ktjBYbf


On 16 October 2013 10:28, michael.boom my_sky...@yahoo.com wrote:

 I got the trace from jstack.
 I found references to semaphore but not sure if this is what you meant.
 Here's the trace:
 http://pastebin.com/15QKAz7U



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619p4095847.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Debugging update request

2013-10-15 Thread maephisto
As a followup to another thread, where I described how my SolrCloud sometimes
just stops accepting updates 
http://lucene.472066.n3.nabble.com/Concurent-indexing-td4095409.html

I have a question, is there a way to debug or analyze the update request?
Verbose output or anything else?
It happens for me that when in the above situation, I'm using the post.sh
tool to post 1 doc and i get no feedback, it just hangs and waits.
10x!



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Debugging update request

2013-10-15 Thread Erick Erickson
What version of Solr? There's a known issue
with distributed deadlock (since fixed) where
under heavy load from multiple clients to
multiple Solr nodes, you can get into a situation where
it will stop accepting updates, but _will_ continue
to serve queries.

If this describes your situation, dump the stack (jstack works)
and look for the word semaphore in the stack traces. If
Solr is waiting on that, that's your problem.

SOLR-4816 (4.5) should alleviate this if you're indexing
from SolrJ, and SOLR-5216 should fix it on the server.

Best,
Erick


On Tue, Oct 15, 2013 at 10:54 AM, maephisto my_sky...@yahoo.com wrote:

 As a followup to another thread, where I described how my SolrCloud
 sometimes
 just stops accepting updates
 http://lucene.472066.n3.nabble.com/Concurent-indexing-td4095409.html

 I have a question, is there a way to debug or analyze the update request?
 Verbose output or anything else?
 It happens for me that when in the above situation, I'm using the post.sh
 tool to post 1 doc and i get no feedback, it just hangs and waits.
 10x!



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Debugging-update-request-tp4095619.html
 Sent from the Solr - User mailing list archive at Nabble.com.