Re: 5.5.3: fieldValueCache auto-warming error

2016-11-11 Thread Bram Van Dam
On 10/11/16 17:10, Erick Erickson wrote: > Just facet on the text field yourself ;) Wish I could, this is on premise over at a client, access is difficult and their response time is pretty bad on public holidays and weekends. So I'm basically twiddling my thumbs while waiting to get more log f

Re: Keeping faster and slower solr slaves alined with the same index version

2016-11-11 Thread Will Martin
Csongor: If session locking is new to you, here is a comprehensive explanation of the "Active - Active multi-region" scenario you're encountering and how NetFlix resolves the matter. Although I remain confused by a 15 minute network transfer of non-optimized segments; or even if you are replic

Re: Wildcard searches with space in TextField/StrField

2016-11-11 Thread Erick Erickson
You have to query text and string fields differently, that's just the way it works. The problem is getting the query string through the parser as a _single_ token or as multiple tokens. Let's say you have a string field with the "a b" example. You have a single token a b that starts at offset 0.

Re: Sorl shards: very sensitive to swap space usage !?

2016-11-11 Thread Shawn Heisey
On 11/11/2016 6:46 AM, Toke Eskildsen wrote: > but on two occasions I have > experienced heavy swapping with multiple gigabytes free for disk > cache. In both cases, the cache-to-index size was fairly low (let's > say < 10%). My guess (I don't know the intrinsics of memory mapping > vs. swapping) i

Re: Sorl shards: very sensitive to swap space usage !?

2016-11-11 Thread Toke Eskildsen
On Thu, 2016-11-10 at 16:42 -0700, Shawn Heisey wrote: > If the machine that Solr is installed on is using swap, that means > you're having serious problems, and your performance will be > TERRIBLE.  Agreed so far. > This kind of problem cannot be caused by Solr if it is properly > configured for

Re: Is there a way to tell if multivalued field actually contains multiple values?

2016-11-11 Thread Michael McCandless
I think you can use the term stats that Lucene tracks for each field. Compare Terms.getSumTotalTermFreq and Terms.getDocCount. If they are equal it means every document that had this field, had only one token. Mike McCandless http://blog.mikemccandless.com On Fri, Nov 11, 2016 at 5:50 AM, Mik

Re: Is there a way to tell if multivalued field actually contains multiple values?

2016-11-11 Thread Mikhail Khludnev
I suppose it's needless to remind that norm(field) is proportional (but not precisely by default) to number of tokens in a doc's field (although not actual text values). On Fri, Nov 11, 2016 at 5:08 AM, Alexandre Rafalovitch wrote: > Hello, > > Say I indexed a large dataset against a schemaless

Re: Filter nested index - remove empty parents

2016-11-11 Thread Dragos Bogdan
This seems to be a good approach. I will try!Thank you! Dragos From: Erick Erickson To: solr-user ; Dragos Bogdan Sent: Thursday, November 10, 2016 6:02 PM Subject: Re: Filter nested index - remove empty parents It looks like you're trying to just index tables from some DB and th