[jira] Commented: (SOLR-1144) replication hang

2010-07-02 Thread Toby Cole (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884644#action_12884644
 ] 

Toby Cole commented on SOLR-1144:
-

Just over a year since it was first spotted, I'm consistently getting the same 
symptoms as this bug.
We've got a single master, with two slaves polling it, both slaves have stalled 
at exactly the same point in the replication.

Here's the relevent section of the replication handler's 'details' response:
Node A
{code:xml}
  str name=numFilesDownloaded18/str
  str name=replicationStartTimeFri Jul 02 10:40:00 BST 2010/str
  str name=timeElapsed6683s/str
  str name=currentFile_9du.prx/str
  str name=currentFileSize8.17 MB/str
  str name=currentFileSizeDownloaded8.17 MB/str
  str name=currentFileSizePercent100.0/str
  str name=bytesDownloaded40.55 MB/str
  str name=totalPercent0.0/str
  str name=timeRemaining8290722s/str
  str name=downloadSpeed6.21 KB/str
{code}

Node B
{code:xml}
  str name=numFilesDownloaded18/str
  str name=replicationStartTimeFri Jul 02 10:40:00 BST 2010/str
  str name=timeElapsed6752s/str
  str name=currentFile_9du.prx/str
  str name=currentFileSize8.17 MB/str
  str name=currentFileSizeDownloaded8.17 MB/str
  str name=currentFileSizePercent100.0/str
  str name=bytesDownloaded40.55 MB/str
  str name=totalPercent0.0/str
  str name=timeRemaining8376322s/str
  str name=downloadSpeed6.15 KB/str
{code}

 replication hang
 

 Key: SOLR-1144
 URL: https://issues.apache.org/jira/browse/SOLR-1144
 Project: Solr
  Issue Type: Bug
Reporter: Yonik Seeley
Assignee: Noble Paul
 Fix For: 1.4


 It seems that replication can sometimes hang.
 http://www.lucidimagination.com/search/document/403305a3fda18599

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1144) replication hang

2010-07-02 Thread Toby Cole (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Toby Cole updated SOLR-1144:


Attachment: stacktrace-master.txt
stacktrace-slave-1.txt
stacktrace-slave-2.txt

Adding stacktraces for both slave instances and the master instance.

 replication hang
 

 Key: SOLR-1144
 URL: https://issues.apache.org/jira/browse/SOLR-1144
 Project: Solr
  Issue Type: Bug
Reporter: Yonik Seeley
Assignee: Noble Paul
 Fix For: 1.4

 Attachments: stacktrace-master.txt, stacktrace-slave-1.txt, 
 stacktrace-slave-2.txt


 It seems that replication can sometimes hang.
 http://www.lucidimagination.com/search/document/403305a3fda18599

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Issue Comment Edited: (SOLR-1144) replication hang

2010-07-02 Thread Toby Cole (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884672#action_12884672
 ] 

Toby Cole edited comment on SOLR-1144 at 7/2/10 9:37 AM:
-

Adding stacktraces for both slave instances and the master instance.
These stack traces are from a reproduction of the original problem, so the 
timestamps will not matchup with the XML from the replication-handler 
previously posted.

  was (Author: tub):
Adding stacktraces for both slave instances and the master instance.
  
 replication hang
 

 Key: SOLR-1144
 URL: https://issues.apache.org/jira/browse/SOLR-1144
 Project: Solr
  Issue Type: Bug
Reporter: Yonik Seeley
Assignee: Noble Paul
 Fix For: 1.4

 Attachments: stacktrace-master.txt, stacktrace-slave-1.txt, 
 stacktrace-slave-2.txt


 It seems that replication can sometimes hang.
 http://www.lucidimagination.com/search/document/403305a3fda18599

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-1144) replication hang

2010-07-02 Thread Toby Cole (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884693#action_12884693
 ] 

Toby Cole commented on SOLR-1144:
-

Oh yes, should have mentioned... we're already on Solr 1.4.1 in production as 
of yesterday (we don't hang about y'know ;) ).

 replication hang
 

 Key: SOLR-1144
 URL: https://issues.apache.org/jira/browse/SOLR-1144
 Project: Solr
  Issue Type: Bug
Reporter: Yonik Seeley
Assignee: Noble Paul
 Fix For: 1.4

 Attachments: stacktrace-master.txt, stacktrace-slave-1.txt, 
 stacktrace-slave-2.txt


 It seems that replication can sometimes hang.
 http://www.lucidimagination.com/search/document/403305a3fda18599

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-1144) replication hang

2010-07-02 Thread Toby Cole (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884719#action_12884719
 ] 

Toby Cole commented on SOLR-1144:
-

I know exactly why the line numbers would be off. I just remembered we're using 
a custom war package so we can add our own plugins in (yes, I know we can use 
solr.home/lib, but we've not got round to that yet).

The only classes we're overriding from solr are ConcurrentLRUCache and 
FastLRUCache. This was from pre solr 1.4, when the cache implementations were 
slowing faceting right down.
I have a feeling if I remove those overridden classes and use the new 
(bug-free) ones, the hang may stop.

I'll give it a go now, sorry in advance if it was my oversight that is causing 
this bug to re-appear.
T

 replication hang
 

 Key: SOLR-1144
 URL: https://issues.apache.org/jira/browse/SOLR-1144
 Project: Solr
  Issue Type: Bug
Reporter: Yonik Seeley
Assignee: Noble Paul
 Fix For: 1.4

 Attachments: stacktrace-master.txt, stacktrace-slave-1.txt, 
 stacktrace-slave-2.txt


 It seems that replication can sometimes hang.
 http://www.lucidimagination.com/search/document/403305a3fda18599

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-711) SimpleFacets: Performance Boost for Tokenized Fields for smaller DocSet using Term Vectors

2008-08-20 Thread Toby Cole (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12623950#action_12623950
 ] 

Toby Cole commented on SOLR-711:


We've seen this problem with our dataset, we have around 10m small records and 
were trying to facet on several multi-valued strings. Two of which had over 40k 
unique values (around 10 values per record).
If we can come up with a plan I don't mind volunteering to implement it.

 SimpleFacets: Performance Boost for Tokenized Fields for smaller DocSet using 
 Term Vectors
 --

 Key: SOLR-711
 URL: https://issues.apache.org/jira/browse/SOLR-711
 Project: Solr
  Issue Type: Improvement
  Components: search
Affects Versions: 1.3
Reporter: Fuad Efendi
 Fix For: 1.4

   Original Estimate: 1680h
  Remaining Estimate: 1680h

 From 
 [http://www.nabble.com/SimpleFacets%3A-Performance-Boost-for-Tokenized-Fields-td19033760.html]:
 Scenario:
 - 10,000,000 documents in the index; 
 - 5-10 terms per document; 
 - 200,000 unique terms for a tokenized field. 
 _Obviously calculating sizes of 200,000 intersections with FilterCache is 100 
 times slower than traversing 10 - 20,000 documents for smaller DocSets and 
 counting frequencies of Terms._
 Not applicable if size of DocSet is close to total number of unique tokens 
 (200,000 in our scenario).
 See   SimpleFacets.java:
 {code}
 public NamedList getFacetTermEnumCounts(
   SolrIndexSearcher searcher, 
   DocSet docs, ...
 {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.