Re: Re: Re: Query Autocomplete Evaluation

2020-02-26 Thread Paras Lehana
Hi Audrey, For MRR, we assume that if a suggestion is selected, it's relevant. It's also assumed that the user will always click the highest relevant suggestion. Thus, we calculate position selection for each selection. If still, I'm not understanding your question correctly, feel free to contact

Re: Best Practises around relevance tuning per query

2020-02-26 Thread Paras Lehana
Hi Ashwin, If I'm understanding your requirement correctly, I think you should read about Payloads . On Thu, 27 Feb 2020 at 09:41, Ashwin Ramesh wrote: > Hi everybody, > > Thank you for all the amazing feedback. I apologize for the formatting of > my

Re: Best Practises around relevance tuning per query

2020-02-26 Thread Ashwin Ramesh
Hi everybody, Thank you for all the amazing feedback. I apologize for the formatting of my question. I guess if I was to generalize my question, 'What is the most common approaches to storing query level features in Solr documents?' For example, a normalized_click_score is a document level featu

Re: Rule of thumb for determining maxTime of AutoCommit

2020-02-26 Thread Kayak28
Hello, Emir: Thank you for your reply. I do understand that the frequency of creating searcher depends on how much realitime-search is required. As you advise me, I have checked a soft-commit configuration. It is configured as: ${solr.autoSoftCommit.maxTime:-1} If I am correct, I have not set au

Re: Time out problems with the Solr server 8.4.1

2020-02-26 Thread Massimiliano Randazzo
Il giorno mer 26 feb 2020 alle ore 23:42 Vincenzo D'Amore < v.dam...@gmail.com> ha scritto: > Hi Massimiliano, > > it’s not clear how much memory you have configured for your Solr instance. > SOLR_HEAP="20480m" SOLR_JAVA_MEM="-Xms20480m -Xmx20480m" GC_LOG_OPTS="-verbose:gc -XX:+PrintHeapAtGC -XX:

Re: Time out problems with the Solr server 8.4.1

2020-02-26 Thread Walter Underwood
Many years ago, I accidentally ran Solr with the data dir on an NFS volume. It was 100X slower. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 26, 2020, at 2:42 PM, Vincenzo D'Amore wrote: > > Hi Massimiliano, > > it’s not clear how much memor

Re: Time out problems with the Solr server 8.4.1

2020-02-26 Thread Vincenzo D'Amore
Hi Massimiliano, it’s not clear how much memory you have configured for your Solr instance. And I would avoid an nfs mount for the datadir. Ciao, Vincenzo -- mobile: 3498513251 skype: free.dev > On 26 Feb 2020, at 19:44, Massimiliano Randazzo > wrote: > > Il giorno mer 26 feb 2020 alle o

Re: Time out problems with the Solr server 8.4.1

2020-02-26 Thread Massimiliano Randazzo
Il giorno mer 26 feb 2020 alle ore 19:30 Dario Rigolin < dario.rigo...@comperio.it> ha scritto: > You can avoid commit and leave solr do autocommit at certain times. > Or use softcommit if you have search queries at the same time to answer. > 55 pages of 3500 words isn't a big deal for a solr

Re: Time out problems with the Solr server 8.4.1

2020-02-26 Thread Dario Rigolin
You can avoid commit and leave solr do autocommit at certain times. Or use softcommit if you have search queries at the same time to answer. 55 pages of 3500 words isn't a big deal for a solr server, what's the hardware configuration? What's you single solr document a single newspaper? a single

Time out problems with the Solr server 8.4.1

2020-02-26 Thread Massimiliano Randazzo
Good morning I have the following situation I have to index the OCR of about 550,000 pages of newspapers counting an average of 3,500 words per page and making a document per word the records are many. At the moment I have 1 instance of Solr and 8 servers that read and write all on the same insta

Re: How to check for uncommitted changes

2020-02-26 Thread Walter Underwood
There is a “docsPending” value in Solr metrics. It is probably available through JMX. You can get to it through the admin UI, too. Choose a replica, then look at Plugins/Stats, then Update, then updateHandler. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blo

Re: Optimize solr 8.4.1

2020-02-26 Thread Erick Erickson
As long as you have an http connection, you can use the replication API fetchindex command to, well, fetch an index. But that copies the index but does not shard it. I guess you could fetch into a single shard collection and then use splitshard. All that said, you'll have to reindex sometime if yo

Re: Solr 8.2.0 - Schema issue

2020-02-26 Thread Jörn Franke
Not sure i understood the whole scenario. However did you try to reload (not reindex) the collection > Am 26.02.2020 um 15:02 schrieb Joe Obernberger : > > Hi All - I have several solr collections all with the same schema. If I add > a field to the schema and index it into the collection on

Solr 8.2.0 - Schema issue

2020-02-26 Thread Joe Obernberger
Hi All - I have several solr collections all with the same schema.  If I add a field to the schema and index it into the collection on which I added the field, it works fine.  However, if I try to add a document to a different solr collection that contains the new field (and is using the same s

Re: Optimize solr 8.4.1

2020-02-26 Thread Dario Rigolin
Hi Massimiliano, the only way to reindex is to resend all documents to the indexer of the Cloud instance. At the moment solr doesn't have the ability to do it when the schema is changed or to "send" indexed data to a SolrCloud from a non cloud . For example we have in solr a field with an only sto

Re: Optimize solr 8.4.1

2020-02-26 Thread Massimiliano Randazzo
Hi Paras, thank you for your answer if you don't mind I would have a couple of questions I am experiencing very long indexing times I have 8 servers for currently working on 1 instance of Solr, I thought of moving to a cloud of 4 solr servers with 3 zookeeeper servers to distribute the load but I

RE: Solr 6.3 with Open JDK

2020-02-26 Thread Kommu, Vinodh K.
Hi Lehana, Thanks for your quick response. We are having vulnerability issue with JDK 1.8.0_60 so planning to stay with Java 8 but wanted to update to Open JDK 1.8.0_242 or higher version within JDK 1.8 Regards, Vinodh -Original Message- From: Paras Lehana Sent: Wednesday, February 2

Re: Solr 6.3 with Open JDK

2020-02-26 Thread Paras Lehana
Hi Vinodh, You can safely use OpenJDK 1.8. Although, we have upgraded to Solr 8.4 and Java 11 now, we were using Solr 6.5 and OpenJDK 1.8 for a long time. Someone on the forums wrote that System Requirements page for Solr mentions that 'You should avoid Java 9 or later for Lucene/Solr 6.x or earl

Re: Optimize solr 8.4.1

2020-02-26 Thread Paras Lehana
Hi Massimiliano, Is it still necessary to run the Optimize command from my application when > I have finished indexing? I guess you can stop worrying about optimizations and let Solr handle that implicitly. There's nothing so bad about having more segments. On Wed, 26 Feb 2020 at 16:02, Massimi

Re: How to check for uncommitted changes

2020-02-26 Thread Paras Lehana
Hey Connor, You can use metrics API which has an attribute - docsPending. API: host:port/solr/admin/metrics?group=core&prefix=UPDATE.updateHandler.docsPending Read more here: https://lucene.apache.org/solr/guide/8_4/performance-statistics-reference.html#update-handler On Wed, 26 Feb 2020 at 02

Re: Solr datePointField facet

2020-02-26 Thread Paras Lehana
Hi Srinivas, That's not the same error - the previous was regarding docValues. Commit and restart your Solr and try again. On Tue, 25 Feb 2020 at 16:58, Srinivas Kashyap wrote: > Hi Paras, > > PFB details: > > org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: > Error from se

Solr 6.3 with Open JDK

2020-02-26 Thread Kommu, Vinodh K.
Hi Team, Anyone using Solr 6.3 version with Open JDK? If so, what version of open JDK you are using? And can we use open JDK 1.8.0_242 or later version with Solr 6.3 version? Regards, Vinodh DTCC DISCLAIMER: This email and any files transmitted with it are confidential and intended solely fo

Optimize solr 8.4.1

2020-02-26 Thread Massimiliano Randazzo
> Good morning, > > recently I went from version 6.4 to version 8.4.1, I access solerre > through java applications written by me to which I have updated the > solr-solrj-8.4.1.jar libraries. > > I am performing the OCR indexing of a newspaper of about 550,000 pages in > production for which I have

Re: Rule of thumb for determining maxTime of AutoCommit

2020-02-26 Thread Emir Arnautović
Hi Kaya, The answer is simple: as much as your requirements allow delay between data being indexed and changes being visible. It is sometimes seconds and sometimes hours or even a day is tolerable. On each commit your caches are invalidated and warmed (if it is configured like that) so in order