Segments count increased to around 7200, index remains unoptimized

2013-07-13 Thread A Geek
Hi All, I'm running SOLR 4.0 on a Linux machine with around 30GB RAM. I've 2 cores running under solr as belowCore AA: around 30 GB data , segments count = 30Core BB: around 216 GB data, segments count=300 Solr is running through jetty and I've allocated max of 12GB heap memory through java

SolrCloud leader

2013-07-13 Thread kowish.adamosh
Hi guys, Is it possible to determine (by the server node itself) in SolrCloud if the node is the leader? One possible solution is to use CloudSolrServer and fetch leader address but how can I check address of the current node (I can see it in configuration but how to get configuration

How to query multiple words correctly

2013-07-13 Thread Stavros Delisavas
Hello Solr-Community, I am having some strange behavior that I don't understand. I hope you can help. I try to query/search for two words. For example (*foo* AND *bar*) What I want is to get all entries that contain the string foo AND contain the word bar. What I get is all entries that

Re: Segments count increased to around 7200, index remains unoptimized

2013-07-13 Thread Otis Gospodnetic
Hi, Index size growing during optimize and then shrinking is expected, so if you are not seeing growth with BB, maybe you are not really optimizing it. Start by looking at the logs. Any errors/exceptions there? Otis -- Solr ElasticSearch Support -- http://sematext.com/ Performance Monitoring

Re: How to query multiple words correctly

2013-07-13 Thread Otis Gospodnetic
Hi, Does the same happen if you use +*foo* +*bar* syntax? If such queries turn out to be too slow, consider indexing ngrams. Otis -- Solr ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Sat, Jul 13, 2013 at 5:40 AM, Stavros Delisavas

Re: Custom processing in Solr Request Handler plugin and its debugging ?

2013-07-13 Thread Tony Mullins
Please any help on how to pass the search request to different RequestHandler from within the custom RequestHandler and how to debug the custom RequestHandler plugin ? Thanks, Tony On Fri, Jul 12, 2013 at 4:41 PM, Tony Mullins tonymullins...@gmail.comwrote: Hi, I have defined my new Solr

Re: java.lang.OutOfMemoryError: Requested array size exceeds VM limit

2013-07-13 Thread Stephane Gamard
Hi Ali,You are getting an error because of the number of rows you are trying to fetch. Solr will keep all results in its queue before submitting the results. Solution is to page thru your results (but be careful about deep paging)._Stephane On July 13, 2013 at 2:31:57 AM, Ali, Saqib

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
Why do you think you need to know which node is the leader? Answer: You don't - SolrClound is NOT a master/slave architecture; there is no master node. Each shard has its own leader, which can vary over time. It's called leader election. But, there is no cosmic significance to being a leader

Re: How to query multiple words correctly

2013-07-13 Thread Stavros Delisavas
Thank you, problem solved! On 13.07.2013 12:16, Otis Gospodnetic wrote: Hi, Does the same happen if you use +*foo* +*bar* syntax? If such queries turn out to be too slow, consider indexing ngrams. Otis -- Solr ElasticSearch Support -- http://sematext.com/ Performance Monitoring --

Re: SolrCloud leader

2013-07-13 Thread Radim Kolar
Since SolrCloud is a master-free architecture, you can send both queries and updates to ANY node and SolrCloud will assure that the data gets to where it belongs its way faster to send them to right node

Re: SolrCloud leader

2013-07-13 Thread kowish.adamosh
Ya :-) I need it for different reason. I have 8 server nodes and one of them has to invoke scheduled data import (by invoking CloudSolrServer)... And that's why I need to identify exclusive server (it doesn't have to be leader but I think it is the simplest solution). I don't want to invoke

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
The point is that there is no right node. The concept of a right node does not exist in SolrCloud. All of the nodes of a shard are equally valid for both query and update requests. Be careful not to fall into the trap of premature optimization, which is usually a false optimization and that

Re: SolrCloud leader

2013-07-13 Thread Jack Krupansky
Thank you for highlighting the actual issue that needs to be addressed. At this point, I don't know if there is any best answer as to where in the cluster aggregate operations like Data Import Handler and Solr Cell should be run. My own inclination is that both of them need to be pulled out of

Re: About Suggestions

2013-07-13 Thread Erick Erickson
Not quite sure what you mean here, a couple of examples would help. But since the term is using keyword tokenizer, then each thing you get back is a complete term, by definition. So I'm not quite sure what you're asking here. Best Erick On Fri, Jul 12, 2013 at 4:48 AM, Lochschmied,

Re: Search with punctuations

2013-07-13 Thread Erick Erickson
At a first glance, you're replacing the apostrophe with a space so INT'L becomes INT L, two separate tokens. Why not replace with ? I.e. remove the apostrophe? I also suspect you actually want WhitespaceTokensizerFactory, KeywordTokenizerFactory will cause my dog has fleas to be indexed exactly

Re: SolrCloud group.query error shard X did not set sort field values or how i can set fillFields=true on IndexSearcher.search

2013-07-13 Thread Erick Erickson
Wait, you can't do this. You're changing the number of shards? Your original startup specified a single shard, bringing up another node and calling it shard2 isn't consistent. If you'd brought up a single shard while telling SolrCloud that there were 2 shards, you shouldn't have been able to

Re: preferred container for running SolrCloud

2013-07-13 Thread Tim Vaillancourt
We run Jetty 8 and 9 with Solr. No issues I can think of. We use Jetty interally anyways, and it seemed to be the most common container out there for Solr (from reading this mailinglist, articles, etc), so that made me feel a bit better if I needed advice or help from the community - not to

Re: preferred container for running SolrCloud

2013-07-13 Thread Furkan KAMACI
Of course you may have some reasons to use Tomcat or anything else (i.e. your stuff may have more experience at Tomcat etc.) However developers generally runs Jetty because it is default for Solr and I should point that Solr unit tests run against jetty (in fact, a specific version of Jetty) and

Re: preferred container for running SolrCloud

2013-07-13 Thread Tim Vaillancourt
Very good point, Furkan. The unit tests being ran against Jetty is another very good reason to feel safer on Jetty, IMHO. I'm assuming the SolrCloud ChaosMonkey tests are ran against Jetty as well? Tim On 13/07/13 02:46 PM, Furkan KAMACI wrote: Of course you may have some reasons to use

Re: preferred container for running SolrCloud

2013-07-13 Thread Shawn Heisey
On 7/13/2013 3:55 PM, Tim Vaillancourt wrote: Very good point, Furkan. The unit tests being ran against Jetty is another very good reason to feel safer on Jetty, IMHO. I'm assuming the SolrCloud ChaosMonkey tests are ran against Jetty as well? All Solr unit tests either access the