Error when configuring reclaimDeletesWeight in TieredMergedPolicyFactory

2019-01-24 Thread Zheng Lin Edwin Yeo
Hi, I am using Solr 7.5.0, and I came across this presentation ( https://www.slideshare.net/sematext/solr-search-engine-optimize-is-not-bad-for-you) on Solr Search Engine: Optimize Is (Not) Bad for You. >From Slide 59, it touch on the settings on reclaimDeletesWeight. I have tried to follow

Question about IndexSearcher.search()

2019-01-24 Thread NDelt
Hello. I'm trying to make sample search application using Lucene. search() method of IndexSearch class searches documents with query, and returns TopDocs instance. TopDocs instance includes array of ScoreDoc instances. My Question is: 1. Will query be tokenized during a search? 2. If so, does

Re: API to convert a SolrInputDocument to JSON

2019-01-24 Thread Shawn Heisey
On 1/24/2019 5:06 PM, Pushkar Raste wrote: May be my questions wasn’t clear. By issues I meant will Solrj client for 7.x work to index documents in Solr 4.10 or vice versa. With HttpSolrClient, I would generally expect very good compatibility from 7.x to 4.x. I have done it with no problems.

Re: API to convert a SolrInputDocument to JSON

2019-01-24 Thread Pushkar Raste
May be my questions wasn’t clear. By issues I meant will Solrj client for 7.x work to index documents in Solr 4.10 or vice versa. I am OK to use HttpSolrClient On Wed, Jan 23, 2019 at 9:33 PM Erick Erickson wrote: > Walter: > > Don't know if it helps, but have you looked at: >

Re: CDCR "all" collections

2019-01-24 Thread Erick Erickson
Bram: Hmmm You can't do that OOB right now, but it might not be a hard thing to add. The current configuration allows the source collection to have a different name than the target collection so if you could make the assumption that the two collections always had the same name, it might be

Re: [SPAM] Re: Per-field slop param in eDisMax

2019-01-24 Thread David Hastings
Also the order matters, it would be a different result set than "a tnf"~2 On Thu, Jan 24, 2019 at 10:53 AM David Hastings < hastings.recurs...@gmail.com> wrote: > it allows two words or less to be matched in a phrase in-between "tnf" and > "a" > so it will match > "tnf a" > "tnf aword1 a" > "tnf

Re: [SPAM] Re: Per-field slop param in eDisMax

2019-01-24 Thread David Hastings
it allows two words or less to be matched in a phrase in-between "tnf" and "a" so it will match "tnf a" "tnf aword1 a" "tnf aword1 aword2 a" On Thu, Jan 24, 2019 at 10:45 AM Danilo Tomasoni wrote: > And what does > > q: f2:"tnf α"~2 > > f.f2.qf: titles study_brief_title > > > means

Re: [SPAM] Re: Per-field slop param in eDisMax

2019-01-24 Thread Danilo Tomasoni
And what does q: f2:"tnf α"~2 f.f2.qf:  titles study_brief_title means with edismax? it raises different results from q: f2:"tnf α" On 24/01/19 14:51, Elizabeth Haubert wrote: To do this you specify the slop on each field when you specify the pf/pf2/pf3 parameters:

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Zheng Lin Edwin Yeo
Hi Jan, Thanks for your reply. However, we are still getting a slow QTime of 517ms even after we set hl=false=null. Below is the debug query: "debug":{ "rawquerystring":"cherry", "querystring":"cherry", "parsedquery":"searchFields_tcs:cherry",

Re: _version_ field missing in schema?

2019-01-24 Thread Aleksandar Dimitrov
Finally, since you are trying to really tweak the schema and general configuration right from the start, you may find some of my presentations useful, as they show the minimal configuration. Not perfect for your needs, as I do skip _version, but as an additional data point. The recent one is:

Re: _version_ field missing in schema?

2019-01-24 Thread Aleksandar Dimitrov
Shawn Heisey writes: On 1/23/2019 3:49 AM, Aleksandar Dimitrov wrote: Hi Alex, thanks for you answer. I took the lines directly from the managed-schema, deleted the managed-schema, and pasted those lines into my schema.xml. Unless you have changed the solrconfig.xml to refer to the

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Jan Høydahl
Looks like highlighting takes most of the time on the first query (680ms). You config seems to ask for a lot of highlighting here, like 100 snippets of max 10 characters etc. Sounds to me that this might be a highlighting configuration problem. Try to disable highlighting (hl=false) and see

Re: Per-field slop param in eDisMax

2019-01-24 Thread Elizabeth Haubert
To do this you specify the slop on each field when you specify the pf/pf2/pf3 parameters: pf:fieldA~2 fieldB~5 I'll try to add an example to the documentation here: https://lucene.apache.org/solr/guide/7_6/the-extended-dismax-query-parser.html#using-slop Elizabeth On Wed, Jan 23, 2019 at 10:30

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Zheng Lin Edwin Yeo
Thanks for your reply. Below are what you have requested about our Solr setup, configurations files ,schema and results of debug queries: Looking forward to your advice and support on our problem. 1. System configurations OS: Windows 10 Pro 64 bit System Memory: 32GB CPU: Intel(R) Core(TM)

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Jan Høydahl
It would be useful if you can disclose the machine configuration, OS, memory, settings etc, as well as solr config including solr.in .sh, solrconfig.xml etc, so we can see the whole picture of memory, GC, etc. You could also specify debugQuery=true on a slow search and check the

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Zheng Lin Edwin Yeo
Hi Shawn, Unfortunately your reply of memory may not be valid. Please refer to my explanation below of the strange behaviors (is it much more like a BUG than anything else that is explainable): Note that we still have 18GB of free unused memory on the server. 1. We indexed the first collection

Re: Solr dependencies with security issues (CVEs)

2019-01-24 Thread Jan Høydahl
Please see https://wiki.apache.org/solr/SolrSecurity#Solr_and_Vulnerability_Scanning_Tools for a list of CVEs that do NOT affect Solr. As that page states, if you believe that one of the CVEs are really

Solr dependencies with security issues (CVEs)

2019-01-24 Thread Andreas Hubold
Hi, in our project, we're checking JAR dependencies with the OWASP dependency check [1] for security issues for which CVEs have been reported. There are CVEs for some of Solr's third-party dependencies in version 7.6.0, and I wonder if you have plans to update these to unaffected versions.

CDCR "all" collections

2019-01-24 Thread Bram Van Dam
Hey folks, Is there any way to set up CDCR for *all* collections, including any newly created ones? Having to modify the solrconfig in ZK every time a collection is added is a bit of a pain, especially because I'm assuming it requires a restart to activate the config? Basically if I have DC Src

How to estimate Java Heap Requirement for solr.

2019-01-24 Thread Satya Nand kanodia
Hi, I have a Solr instance having 6 cores. I have given -Xms1024m -Xmx16g heap memory to it. *Cores have the following number of documents in it.* 1. 86,31,043 2. 6,59,61,263 3. 4,55,31,492 4. 21,10,087 5. 1,14,477 6. 33,397 *I have following cache configuration.* My question is if

Re: Indexing in one collection affect index in another collection

2019-01-24 Thread Zheng Lin Edwin Yeo
Hi Shawn, > If the two collections have data on the same server(s), I can see this > happening. More memory is consumed when there is additional data, and > when Solr needs more memory, performance might be affected. The > solution is generally to install more memory in the server. I have