ValueSource with long BinaryField

2019-02-07 Thread Jason
Hi, I'm suffering from very long search response. Specifically, sort by ValueSource. I stored 4,096 length double array using BinaryField per a doc. In searching, I searched with sort function which is calculate a distance between two double array in custom defined ValueSource. But response time

Re: Solr collapse result repeat in 6.6.5 cloud example techproducts.

2019-02-07 Thread Joel Bernstein
Do you have more then one shard? Collapse requires that all docs in the same collapse group be co-located on the same shard. Grouping I believe does not require this is some scenarios. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Feb 7, 2019 at 4:07 PM 刘正 wrote: > i try request this

Solr collapse result repeat in 6.6.5 cloud example techproducts.

2019-02-07 Thread 刘正
i try request this to techproducts collection {code} select?fl=id,genre_s={!collapse%20field=genre_s}=on=genre_s:*=json {code} and i get response {code:json} { "responseHeader":{ "zkConnected":true, "status":0, "QTime":6, "params":{ "q":"genre_s:*", "indent":"on",

Re: Why solr sends a request for a metrics every minute?

2019-02-07 Thread levtannen
Jan, After I suppress the metrics messages, I found that there are other messages. They come also once a minute, but only on the one out of 3 computers. Could you please explain me what do these messages mean and why they are produced by only one computer? Best wishes. 2019-02-07 20:18:37.089

Re: Solr relevancy score different on replicated nodes

2019-02-07 Thread Erick Erickson
Optimization is safe. The large segment is irrelevant, you'll lose a little parallelization, but on an index with this few documents I doubt you'll notice. As of Solr 5, optimize will respect the max segment size which defaults to 5G, but you're well under that limit. Best, Erick On Sun, Feb 3,

Re: Relevancy Score Calculation

2019-02-07 Thread Erick Erickson
Why do you think that would help? This sounds like an XY problem, you are asking how to do X because you think it'll help with problem Y but haven't told us what Y is. At any rate, it would require a code change, in many places. It's unlikely to be worth the effort anyway, because the term

Re: [CDCR]Unable to locate core

2019-02-07 Thread Tim
So it looks like I'm having an issue with this fix: https://issues.apache.org/jira/browse/SOLR-11724 So I've messed around with this for a while and every time the leader to leader replica portion works fine. But the Recovery portion (implemented as part of the fix above) fails. I've run a few

Re: AIX platform: Solr goes down with java.lang.OutOfMemoryError with Open JDK 11

2019-02-07 Thread Erick Erickson
Check your ulimit for max processes and max open file handles, those typically are places where things go weird, and the error message isn't always that helpful. Usually we want 65K of each... On Mon, Feb 4, 2019 at 8:25 AM Shawn Heisey wrote: > > On 2/4/2019 5:53 AM, balu...@gmail.com wrote: >

Re: Ignore accent in a request

2019-02-07 Thread Erick Erickson
exactly _how_ is it "not working"? Try building your parameters _up_ rather than starting with a lot, e.g. select?defType=dismax=je suis avarié=title ^^ assumes you expect a match on title. Then: select?defType=dismax=je suis avarié=title subject etc. Because mm=757 looks really wrong. From the

Re: Help needed with Solrcloud error messages

2019-02-07 Thread Erick Erickson
Your solr logs on the server should have more details than just the bare error in the full stack trace. Those would help figure out what's happening. Best, Erick On Mon, Feb 4, 2019 at 3:14 PM Webster Homer wrote: > > We have a number of collections in a Solrcloud. > > The cloud has 2 shards

Re: Full index replication upon service restart

2019-02-07 Thread Erick Erickson
bq. We have a heavy indexing load of about 10,000 documents every 150 seconds. Not so heavy query load. It's unlikely that changing numRecordsToKeep will help all that much if your maintenance window is very large. Rather, that number would have to be _very_ high. 7 hours is huge. How big are

Re: Allow Join over two sharded collection

2019-02-07 Thread Erick Erickson
This doesn't appear to be being actively pursued, so it's anybody's guess. Depending on your use-case, the streaming capabilities may be an OOB solution. Best, Erick On Wed, Feb 6, 2019 at 1:22 AM mganeshs wrote: > > All, > > Any idea, whether this will be taken care or addressed in near

Re: How to stop a new slave from serving request until it has replicated index the first time.

2019-02-07 Thread Erick Erickson
Unless you have a specific reason to use master/slave, SolrCloud (in this case only one shard probably) will make this kind of thing easier. This is the equivalent of ADDREPLICA. Also, the TLOG and PULL replica types (as of Solr 7) are something of a hybrid of master/slave and SolrCloud. FWIW,

Re: Accessing multiValued field from within custom function

2019-02-07 Thread Dariusz Wojtas
Hi, Any hints on this topic? How to access String / Text values from a multiValued field inside custom function? Best regards, Dariusz Wojtas On Thu, Jan 3, 2019 at 6:18 PM Dariusz Wojtas wrote: > Hi, > > I am using SOLR 7.5 in the cloud mode. > I want to create a custom function similar to

CloudSolrClient getDocCollection

2019-02-07 Thread Hendrik Haddorp
Hi, when I perform a query using the CloudSolrClient the code first retrieves the DocCollection to determine to which instance the query should be send [1]. getDocCollection [2] does a lookup in a cache, which has a 60s expiration time [3]. When a DocCollection has to be reloaded this is

Re: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread Zheng Lin Edwin Yeo
Hi Paul, We have tried it with the space preceeding the \n i.e. (\s*\n){2,}, with the following regex pattern: content (\s*\n){2,} brbr However, we are also getting the exact same results as the earlier Example 1, 2 and 3. As for your point 2 on perhaps in the data you have other

AW: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread paul.dodd
Hi Edwin 1. Sorry, the pattern was wrong, the space should preceed the \n i.e. (\s*\n){2,} 2. Perhaps in the data you have other (non printing) characters than \n? Gesendet von Mail für Windows 10 Von: Zheng Lin Edwin

Re: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread Zheng Lin Edwin Yeo
Hi Paul, We have tried this suggested regex pattern as follow: content (\n\s*){2,} brbr But we still have exactly the same problem of Example 1,2 and 3 below. Example 1: The sentence that the above regex pattern is working correctly *Original content:*Dear Sir, \n\n \n \n\n I am

AW: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread paul.dodd
To avoid the «\n+\s*» matching too many \n and then failing on the {2,} part you could try (\n\s*){2,} If you also want to match CRLF then (\r?\n\s*){2,} Gesendet von Mail für Windows 10 Von: Zheng Lin Edwin

Re: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread Zheng Lin Edwin Yeo
Hi Paul, Thanks for your reply. When I use this pattern: content (\n+\s*){2,} brbr It is working for some sentence within the same content and not working for some sentences. Please see below for the one that is working and another that is not working (partially working): Example 1:

AW: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread paul.dodd
You don’t say what happens, just that it is not working. I assume nothing is replaced? Perhaps the pattern should be "(\n\s*){2,}" ?? Gesendet von Mail für Windows 10 Von: Zheng Lin Edwin Yeo Gesendet:

RegexReplaceProcessorFactory pattern to detect multiple \n

2019-02-07 Thread Zheng Lin Edwin Yeo
Hi, I am trying to use the RegexReplaceProcessorFactory to remove more than two \n with any number of spaces between them (Eg: \n\n, \n \n, \n \n \n \n), and replace it with two . I use the following regex pattern and it is working when I test it in regex101.com. But it is not working when I

Re: RESTORE does not create replica as defined

2019-02-07 Thread Ganesh Sethuraman
Any help on this is much appreciated. On Wed, Feb 6, 2019 at 11:10 AM Ganesh Sethuraman wrote: > Hi > > We are using Solr Cloud 7.2.1. We are using the backup and restore > features, and we finding that the restore is not working as expected. The > restore is successful, but does not create the

Re: What is the benefit of stored="true" in *PointFields

2019-02-07 Thread Toke Eskildsen
On Thu, 2019-02-07 at 11:24 +0900, Yasufumi Mizoguchi wrote: > Actually, stored is compressed but I believed that docValues was > compressed > in some strategies depending on > field's values/density as following java doc says. >

Re: change in White Space when upgrading 6.6 to 7.4

2019-02-07 Thread Matt Pearce
sow defaulting to false changed between 6.x and 7.x, which is why the problem has appeared for you, and is solved by setting sow=true in your defaults. With sow=true, I would expect your query to be broken into three parts, and then tokenised: ABC4856.21 AND -field1:ABC4856.21 With

Re: Remove my mail from subscriptions

2019-02-07 Thread Gora Mohanty
On Thu, 7 Feb 2019 at 12:29, manohar c wrote: > Hi, > Please Remove my mail from the subscription list. > Please see http://lucene.apache.org/solr/community.html#mailing-lists-irc In a manner similar to how you subscribed, you need to send an email from your subscribed account to