Re: Need help on LTR

2019-03-18 Thread Mohomed Rimash
Hi Amjad, After adding the libraries into the path, Did you restart the SOLR ? On Tue, 19 Mar 2019 at 08:45, Amjad Khan wrote: > I followed the Solr LTR Documentation > > https://lucene.apache.org/solr/guide/7_4/learning-to-rank.html < >

Need help on LTR

2019-03-18 Thread Amjad Khan
I followed the Solr LTR Documentation https://lucene.apache.org/solr/guide/7_4/learning-to-rank.html 1. Added library into the solr-config 2. Successfully added feature 3. Get schema to see feature is available 4. When I try

Fwd: Boosting Query Time Complexity

2019-03-18 Thread Krishna Raj
Hello Experts, My goal is to understand the time complexity of the Boosting Query as part of a search in Solr: sort=score+desc defType=edismax boost= I followed the stacktrace for the the search call and I am believe the time complexity is as follows: - Main Query time + multiple filters

Re: Fetching All Terms and Corresponding Documents

2019-03-18 Thread Zheng Lin Edwin Yeo
Hi, You can take a look at MoreLikeThis from the Solr Reference Guide at https://lucene.apache.org/solr/guide/7_7/morelikethis.html, and see if it fits what you are looking for. Regards, Edwin On Mon, 18 Mar 2019 at 22:24, Furkan KAMACI wrote: > Hi, > > I need to iterate on all terms at Solr

Re: Behavior of Function Query

2019-03-18 Thread Zheng Lin Edwin Yeo
Hi, Which version of Solr are you running? How many shards/replica is this running on, and are you running on SolrCloud? Regards, Edwin On Mon, 18 Mar 2019 at 22:19, Ashish Bisht wrote: > Please see the below requests and response > > http://Sol:8983/solr/SCSpell/select?q="*internet of > >

Re: Re: obfuscated password error

2019-03-18 Thread Zheng Lin Edwin Yeo
Hi, Did you generate your keystore with the obfuscated password or the plain text password? Regards, Edwin On Tue, 19 Mar 2019 at 02:32, Branham, Jeremy (Experis) wrote: > I’m not sure if you are sharing the trust/keystores, so I may be off-base > here… > > Some thoughts – > - Verify your VM

Boosting Query Time Complexity

2019-03-18 Thread Krishna Raj
Hello Experts, My goal is to understand the time complexity of the Boosting Query as part of a search in Solr: sort=score+desc defType=edismax boost= I followed the stacktrace for the the search call and I am believe the time complexity is as follows: - Main Query time + multiple filters

Re: SolrJ - CoreAdminRequest / PingRequest and HttpSolrClient baseUrl

2019-03-18 Thread Jörn Franke
Depends what you want to achieve. You can also have two different beans with two different clients and one accesses a core and another accesses without. You may also want depending on your use case get all cores and ping each core individually to check if they are reachable. > Am 18.03.2019 um

Re: SolrJ - CoreAdminRequest / PingRequest and HttpSolrClient baseUrl

2019-03-18 Thread Shawn Heisey
On 3/18/2019 2:04 PM, Markus Schuch wrote: * CoreAdminRequest - is only working when no particular core is given * PingRequest - is only working when a particular core is given That sounds like what I would expect to happen. Using HTTP (not SolrJ), the CoreAdmin API is accessed

SolrJ - CoreAdminRequest / PingRequest and HttpSolrClient baseUrl

2019-03-18 Thread Markus Schuch
Hi folks, i have a question about how to perform a connection check with SolrJ in a generic way. I found PingRequest and CoreAdminRequest in SolrJ. Neither of both work in both scenarios when a SolrClient is created by HttpSolrClient.Builder with AND without having the baseUrl containing the

Re: Java Heap Error

2019-03-18 Thread wclarke
I was mistaken xms and xmx are equal. I was looking in the wrong spot. I will look at the rest. Thanks! -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Garbage Collection Metrics

2019-03-18 Thread Jeff Courtade
yes i needed to see behavior ovr time from 14 servers and correlate it. Putting it into zabbix gave me behavior type graphs... like this unusual gc activity... [image: image.png] On Mon, Mar 18, 2019 at 1:32 PM Erick Erickson wrote: > GCViewer will work on the GC logging file created by Solr.

Re: Re: obfuscated password error

2019-03-18 Thread Branham, Jeremy (Experis)
I’m not sure if you are sharing the trust/keystores, so I may be off-base here… Some thoughts – - Verify your VM arguments, to be sure there aren’t conflicting SSL properties. - Verify the environment is targeting the correct version of Java - Verify the trust/key stores exist where they are

Re: Re: Garbage Collection Metrics

2019-03-18 Thread Branham, Jeremy (Experis)
I get these metrics by pushing the JMX data into Graphite, then use the non-negative derivative function on the GC ‘time’ metric. It essentially shows the amount of change on a counter, at the specific time it occurred. Jeremy Branham jb...@allstate.com On 3/18/19, 12:06 PM, "Jeff Courtade"

Re: obfuscated password error

2019-03-18 Thread Satya Marivada
Any suggestions please. Thanks, Satya On Mon, Mar 18, 2019 at 11:12 AM Satya Marivada wrote: > Hi All, > > Using solr-6.3.0, to obfuscate the password, have used jetty util to > generate obfuscated password > > > java -cp jetty-util-9.3.8.v20160314.jar >

Re: Unsubscribing

2019-03-18 Thread Erick Erickson
Please follow the instructions here: http://lucene.apache.org/solr/community.html#mailing-lists-irc. You must use the _exact_ same e-mail as you used to subscribe. If the initial try doesn't work and following the suggestions at the "problems" link doesn't work for you, let us know. But note

Re: Garbage Collection Metrics

2019-03-18 Thread Erick Erickson
GCViewer will work on the GC logging file created by Solr. It has some nice summaries, particularly of stop-the-world GC events. From there, you can pinpoint the exact times from manual inspection of the GC log. It usually looks like this: Your replica went into recovery when again? sometime

Unsubscribing

2019-03-18 Thread Monique Monteiro
How can I unsubscribe from this list? Thanks in advance, -- Monique Monteiro Twitter: http://twitter.com/monilouise

Re: Garbage Collection Metrics

2019-03-18 Thread Jeff Courtade
So, I had a problem when at a customer site. They use zabbix for data collection and alerting. The solr server had been setup to use only jmx metrics. the jvm was unstable and would lock up for a period of time and the metrics and counters would be all screwed up. Because it was using jmx to

Re: Garbage Collection Metrics

2019-03-18 Thread Jeff Courtade
The only way I found to track GC times was by truning on GC logging and the writing cronjob data collection script and graphing it in zabbix On Mon, Mar 18, 2019 at 12:34 PM Erick Erickson wrote: > Attachments are pretty aggressively stripped by the apache mail server, so > it didn’t come

Re: Garbage Collection Metrics

2019-03-18 Thread Erick Erickson
Attachments are pretty aggressively stripped by the apache mail server, so it didn’t come through. That said, I’m not sure how much use just the last GC time is. What do you want it for? This sounds a bit like an XY problem. Best, Erick > On Mar 17, 2019, at 2:43 PM, Karthik K G wrote: > >

Re: Spellchecker -File based vs Index based

2019-03-18 Thread Erick Erickson
Two guesses: 1> you have something different in your spellcheck config .vs. index config. 2> you don’t have the word in your file for the file-based spellcheck, thus Solr has no way of knowing the word is correctly spelled. > On Mar 17, 2019, at 11:56 PM, Ashish Bisht wrote: > > Hi, > > I

Re: Java Heap Error

2019-03-18 Thread Erick Erickson
1> set Xmx and Xms equal. That’s not the root of our problem, but it doesn’t help either. 2> Crack the documents outside of Solr. It’s quite easy to write a SolJ program to do this, here’s an example with the reasons why it’s advisable to do this outside of Solr.:

Re: Solr index slow response

2019-03-18 Thread Walter Underwood
Solr is not designed to have consistent response times for updates. You are expecting Solr to do something that it does not do. About Xms and Xmx, the JVM will continue to allocate memory until it hits the max. After it hits the max, it will start to collect garbage. A smaller Xms just wastes

Java Heap Error

2019-03-18 Thread wclarke
Hi, I am doing a recursive search on a local hard drive through DIH with Tika. It starts slow and then it dies at Thread-14 every time with a Java Heap error. My Xmx is at 10g, my Xms is at 512m (which might be the problem). It takes hours to fail after it has fetched over 300,000 items and

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Hello, Chris Thanks for the tips. So I tried to set it as you suggested, not see too much improvement. Since I don't need it visible immediately, softCommit is disabled totally. The slow response is happening every few seconds, if it happens hourly I would suspect the hourly auto-commit.

obfuscated password error

2019-03-18 Thread Satya Marivada
Hi All, Using solr-6.3.0, to obfuscate the password, have used jetty util to generate obfuscated password java -cp jetty-util-9.3.8.v20160314.jar org.eclipse.jetty.util.security.Password mypassword The output has been used in solr.in.sh as below

Fetching All Terms and Corresponding Documents

2019-03-18 Thread Furkan KAMACI
Hi, I need to iterate on all terms at Solr index, and then find related documents for some terms that match my criteria. I know that I can send a query to *LukeRequestHandler*: */admin/luke?fl=content={distinct term count}=json* and then check my criteria. If matches, I can send a *fq* to

Behavior of Function Query

2019-03-18 Thread Ashish Bisht
Please see the below requests and response http://Sol:8983/solr/SCSpell/select?q="*internet of things*"=edismax=spellcontent=json=1=score,internet_of_things:query({!edismax v='"*internet of things*"'}),instant_of_things:query({!edismax v='"instant of things"'}) Response contains score from

Re: Solr index slow response

2019-03-18 Thread Chris Ulicny
One other thing to look at besides the heap is your commit settings. We've experienced something similar, and changing commit settings alleviated the issue. Are you opening a search on every hardcommit? If so, you might want to reconsider and use the softcommit for the hourly creation of a new

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
4GB swap on 400GB machine does not make much sense, so disable it. Even 4GB, some pages might be swapped, and if those are some Solr pages, it’ll affect Solr. Setting Xms and Xmx to the same value will not solve your issue but you will avoid heap resize when your heap reaches Xms. Emir --

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
Hi Aaron, You are right - large heap means that there will be no major GC all the time, but eventually it will happen and then the larger the heap the longer it will take. So with 300GB heap it takes observed 300s. If you used to run on 32GB heap and it was slow, it probably means that heap is

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Hi, Emir, My system used to run with max 32GB, the response time is bad as well. swap is set to 4GB, there 3.2 free, I doubt swap would affect it since there is such huge free memory. I could try to with set Xms and Xmx to the same value, but I doubt how much would that change the response

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
I'm a bit confused, why large heap size would make it slower? Isn't that give it enough room to make it not busy doing GC all the time? My http/json request contains 100 documents, the total size of the 100 documents is around 5M, there are ~100 client sending those requests continuously.

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
One more thing - it is considered a good practice to use the same value for Xmx and Xms. Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 18 Mar 2019, at 14:19, Emir Arnautović > wrote: > > Hi

Re: Different behavior when using function queries

2019-03-18 Thread Erik Hatcher
If you have no documents in the results, there’s nothing to attach the function result to.`fl` is field list of fields to show in matched documents. You have no matches documents. Erik > On Mar 18, 2019, at 07:55, Ashish Bisht wrote: > > Can someone please explain the below

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
Hi Aaron, Without looking too much into numbers, my bet would be that it is large heap that is causing issues. I would decrease is significantly (<30GB) and see if it is enough for your max load. Also, disable swap or reduce swappiness to min. In any case, you should install some monitoring

Re: Solr index slow response

2019-03-18 Thread Toke Eskildsen
On Mon, 2019-03-18 at 10:47 +, Aaron Yingcai Sun wrote: > Solr server is running on a quit powerful server, 32 cpus, 400GB RAM, > while 300 GB is reserved for solr, [...] 300GB for Solr sounds excessive. > Our application send 100 documents to solr per request, json encoded. > the size is

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Hello, Emir, Thanks for the reply, this is the solr version and heap info, standalone single solr server. I don't have monitor tool connected. only look at 'top', has not seen cpu spike so far, when the slow response happens, cpu usage is not high at all, around 30%. # curl

Different behavior when using function queries

2019-03-18 Thread Ashish Bisht
Can someone please explain the below behavior.For different q parameter function query response differs although function queries are same http://:8983/solr/SCSpell/select?q="*market place*"=edismax=spellcontent=json=1=internet_of_things:if(exists(query({!edismax v='"internet of

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
Hi Aaron, Which version of Solr? How did you configure your heap? Is it standalone Solr or SolrCloud? A single server? Do you use some monitoring tool? Do you see some spikes, pauses or CPU usage is constant? Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr &

Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Hello, Solr! We are having some performance issue when try to send documents for solr to index. The repose time is very slow and unpredictable some time. Solr server is running on a quit powerful server, 32 cpus, 400GB RAM, while 300 GB is reserved for solr, while this happening, cpu usage

Re: Solr or SolrJ Atomic Update

2019-03-18 Thread THIERRY BOUCHENY
Hi all, Ok, I found my problem, it was a silly one as I expected !!! In my schema the “uniqueKey” was spelled with a small cap k !! Thierry > On 15 Mar 2019, at 19:03, THIERRY BOUCHENY wrote: > > Hello, > > I have spent a few hours trying to understand why I get this error. > >

Re: Solr 7.6 Shard name - possible issue?

2019-03-18 Thread Jan Høydahl
It's just a name, so as long as Solr agrees with itself and ZK that this is the core name you're fine. I have no idea why the numbering seems to jump in that way though. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 18. mar. 2019 kl. 02:15 skrev Dave Durbin : >

Spellchecker -File based vs Index based

2019-03-18 Thread Ashish Bisht
Hi, I am seeing difference in file based spellcheck and index based spellcheck implementations. Using index based http://:8983/solr/SCSpell/spell?q=*intnet of things*=edismax=spellcontent=json=0=true=*default*=AND "suggestions":[ "intnet",{ "numFound":10,