Re: LTR - OriginalScore query issue

2018-03-15 Thread ilayaraja
I do have the features defined as below for field specific (title..) matching etc: features: [ { name: "productNewness", class: "org.apache.solr.ltr.feature.SolrFeature", params: { q: "{!func}recip( ms(NOW,launchdate_pl), 3.16e-11, 1, 1)" }, store: "myFeatureStoreDemo", }, { name:

Re: statistics in hitlist

2018-03-15 Thread Joel Bernstein
I've been working on the user guide for the math expressions. Here is the page on regression: https://github.com/joel-bernstein/lucene-solr/blob/math_expressions_documentation/solr/solr-ref-guide/src/regression.adoc This page is part of the larger math expression documentation. The TOC is here:

Re: statistics in hitlist

2018-03-15 Thread Joel Bernstein
If you want to get everything in query you can do this: let(echo="d,e", a=search(tx_prod_production, q="oil_first_90_days_production:[1 TO *]", fq="isParent:true", rows="150", fl="id,oil_first_90_days_production,oil_last_30_days_production", sort="id asc"), b=col(a,

Re: statistics in hitlist

2018-03-15 Thread Erick Erickson
What does the fq clause look like? On Thu, Mar 15, 2018 at 11:51 AM, John Smith wrote: > Hi Joel, I did some more work on this statistics stuff today. Yes, we do > have nulls in our data; the document contains many fields, we don't always > have values for each field, but

Re: Apache commons fileupload migration

2018-03-15 Thread Christopher Schultz
To whom it may concern, On 3/15/18 8:40 AM, padmanabhan1616 wrote: > Hi Team,We are using Apache SOLR-5.2.1 as index engine for our data analytics > application. As part of this SOLR uses commons-fileupload-1.2.1.jar for file > manipulation.There is security Vulnerability identified in >

Re: statistics in hitlist

2018-03-15 Thread John Smith
Hi Joel, I did some more work on this statistics stuff today. Yes, we do have nulls in our data; the document contains many fields, we don't always have values for each field, but we can't set the nulls to 0 either (or any other value, really) as that will mess up other calculations (such as when

Re: DocValuesField fails if bytes > 32k in solr 7.2.1

2018-03-15 Thread Erick Erickson
No, it wasn't fixed "a long time back" in the sense that you could do this on any docValues field. Note that JIRA says "(individual codecs still have their limits, including the default codec)". Before anyone fixes it, the question is "what is the use-case for storing such large DocValues fields

DocValuesField fails if bytes > 32k in solr 7.2.1

2018-03-15 Thread Minu Theresa Thomas
Hello Team, I am using solr 7.2.1. I am getting an exception while indexing saying that "DocValuesField is too large, must be <= 32766, retry?" This is my field in my managed schema. When I checked this lucene ticket - https://issues.apache.org/jira/browse/LUCENE-4583, it says its fixed

RE: SpellCheck Reload

2018-03-15 Thread Alessandro Benedetti
Hi Sadiki, the kind of spellchecker you are using built an auxiliary Lucene index as a support data structure. That is going to be used to provide the spellcheck suggestions. "My question is, does "reloading the dictionary" mean completely erasing the current dictionary and starting from scratch

Re: Solr Developer needed urgently

2018-03-15 Thread Wenjie Zhang
Response a recruiting email and CC to the entire public solr-user@lucene.apache.org? Seriously? On Thu, Mar 15, 2018 at 9:53 AM, John Bickerstaff wrote: > Hi - thanks for thinking of me! > > I'm currently lead on the Solr team for Ancestry - and having a good time. > I

Re: FW: Question about Overseer calling SPLITSHARD collection API command during autoscaling

2018-03-15 Thread Cassandra Targett
Hi Matthew - It's cool to hear you're using the new autoscaling features. To answer your first question, SPLITSHARD as an action for autoscaling is not yet supported. As for when it might be, it's the next big gap to fill in the autoscaling functionality, but there is some work to do first to

Re: Solr Developer needed urgently

2018-03-15 Thread John Bickerstaff
Hi - thanks for thinking of me! I'm currently lead on the Solr team for Ancestry - and having a good time. I might be interested, but moving to New York isn't going to work for me. If there is a good chance of working from home, then I might be interested... Let me know... On Wed, Mar 14, 2018

Question about Overseer calling SPLITSHARD collection API command during autoscaling

2018-03-15 Thread Matthew Faw
Hi, So I’ve been trying out the new autoscaling features in solr 7.2.1. I run the following commands when creating my solr cluster: Set up overseer role: curl -s "solr-service-core:8983/solr/admin/collections?action=ADDROLE=overseer=$thenode" Create cluster prefs:

FW: Question about Overseer calling SPLITSHARD collection API command during autoscaling

2018-03-15 Thread Matthew Faw
I sent this a few mins ago, but wasn’t yet subscribed. Forwarding the message along to make sure it’s received! From: Matthew Faw Date: Thursday, March 15, 2018 at 12:28 PM To: "solr-user@lucene.apache.org" Cc: Matthew Faw

Re: Some performance questions....

2018-03-15 Thread Alessandro Benedetti
*Single Solr Instance VS Multiple Solr instances on Single Server * I think there is no benefit in having multiple Solr instances on a single server, unless the heap memory required by the JVM is too big. And remember that this has relatively to do with the index size ( inverted index is memory

Re: [nesting] Any way to return the whole hierarchical structure when doing Block Join queries?

2018-03-15 Thread Jan Høydahl
> 14. mar. 2018 kl. 15:45 skrev Anshum Gupta : > > Hi Jan, > > The way I remember it was done (or at least we did it) is by storing the > depth information as a field in the document using an update request > processor and using a custom transformer to reconstruct the

Re: Problem encountered upon starting Solr after improper exit

2018-03-15 Thread Erick Erickson
I've never heard fo killing a Java doing this. These lines: dyld: Library not loaded: /usr/local/opt/mpfr/lib/libmpfr.4.dylib Referenced from: /usr/local/bin/awk indicate what I expect is the root of your problem, _somehow_ files were deleted. I'd be _very_ surprised if killing the Java process

Re: LTR - OriginalScore query issue

2018-03-15 Thread Alessandro Benedetti
>From the snippet you posted this is the query you run : q=id:"13245336" So the original score ( for each document in the result set) can only be the score associated to that query. You then pass an EFI with a different text. You can now use that information to calculate another feature if you

Re: Some performance questions....

2018-03-15 Thread Deepak Goel
Please see inline... Deepak "Please stop cruelty to Animals, help by becoming a Vegan" +91 73500 12833 deic...@gmail.com Facebook: https://www.facebook.com/deicool LinkedIn: www.linkedin.com/in/deicool "Plant a Tree, Go Green" On Thu, Mar 15, 2018 at 6:04 PM, BlackIce

Re: Copying a SolrCloud collection to other hosts

2018-03-15 Thread Erick Erickson
yeah, it's on a core-by-core basis. Which also makes getting it propagated to all replicas something you have to be sure happens... Glad it's working for you! Erick On Thu, Mar 15, 2018 at 1:54 AM, Patrick Schemitz wrote: > Hi Erick, > > thanks a lot, that solved our problem

Re: Remove Replacement character "�" from the search results

2018-03-15 Thread Erick Erickson
This is more likely a problem with your browser's character set, try setting it to UTF-8. Best, Erick On Thu, Mar 15, 2018 at 5:59 AM, uttam Dhakal wrote: > Hello, > > I want to remove certain characters from the search result. Image of what I > now get (and want to

Re: solr query

2018-03-15 Thread Walter Underwood
We have an index with thousands of fields. Only a few are accessed on each query. These fields break out three different kinds of weights for a thousand or so different school subjects. Each query is just for one subject, so the scoring uses those three fields. Like: * weight_a_1234 *

SOLR subscription

2018-03-15 Thread SAMMAR UL HASSAN
Hi, I hope all is well. We are using SOLR for searches in our products. We want to share some sort of feedback & also discuss various issues. As per your website, we need to subscribe on solr-user@lucene.apache.org to discuss the different queries so please

Re: Expose a metric for percentage-recovered during full recoveries

2018-03-15 Thread Andrzej Białecki
Hi S G, This looks useful, and it should be easy to add to the existing metrics in ReplicationHandler, probably somewhere around ReplicationHandler:856 . > On 14 Mar 2018, at 20:16, S G wrote: > > Hi, > > Solr does full recoveries very frequently - sometimes even

Remove Replacement character "�" from the search results

2018-03-15 Thread uttam Dhakal
Hello, I want to remove certain characters from the search result. Image of what I now get (and want to avoid) is attached in this email. My impression is I need to write an "updaterequestprocessor", is there any built-in class specific for this need? Closest class I found which matches the

Re: Matching Queries with Wildcards and Numbers

2018-03-15 Thread tapan1707
I think it should have worked. Could you share the results for both queries with =true? Also, what's the result for ec1? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Apache commons fileupload migration

2018-03-15 Thread padmanabhan1616
Hi Team,We are using Apache SOLR-5.2.1 as index engine for our data analytics application. As part of this SOLR uses commons-fileupload-1.2.1.jar for file manipulation.There is security Vulnerability identified in commons-fileupload library: *CVE-2016-131 Apache Commons FileUpload:

Execution of query in subentity dependent on a field from the main entity

2018-03-15 Thread PeterKerk
How can I make the execution of a query in a subentity dependent on a field value from the main entity? So as you can see in the (simplified) data config below, there's an entity `categories_lvl_0` which holds an expensive query. I ONLY want to execute this query if: searchobject.objecttype=115

Re: Some performance questions....

2018-03-15 Thread BlackIce
Shawn: well the idea was to utilize system resources more efficiently.. this is not due so much to Solr, as I sayd I don't know that much about Solr, except Shema.xml and Solarconfig.xml - However the main app that will be running is more or less a single threated app which takes advantage when

Re: Expose a metric for percentage-recovered during full recoveries

2018-03-15 Thread Rick Leir
S Were there errors in the logs just before recoveries? Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

LTR - OriginalScore query issue

2018-03-15 Thread ilayaraja
solr/collection/select?fl=id,score,[features+store=myFeatureStore+efi.user_query='black shoes']=json=id:"13245336"=on When we fire this query during feature extraction, the originalScore feature gets the score of the "id" match but not the actual user query which is in this case 'black shoes'.

Re: In Place Updates not work as expected

2018-03-15 Thread Emir Arnautović
Hi, Can you share part of code where you prepare update. Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 14 Mar 2018, at 15:27, mganeshs wrote: > > Hi Emir, > > I am

RE: solr query

2018-03-15 Thread Stefan Matheis
> Is this practical adding so much additional fields? Well, as always "it depends" .. the way I see it: what are 20 fields? They just sit around and make your life way easier. You have two choices: stay with one field and have a hard time ... or add another 20 or so which do exactly what you

Re: Copying a SolrCloud collection to other hosts

2018-03-15 Thread Patrick Schemitz
Hi Erick, thanks a lot, that solved our problem nicely. (It took us a try or two to notice that this will not copy the entire collection but only the shard on the source instance, and we need to do this for all instances explicitly. But hey, we had to do the same for the old approch of scp'ing

Re: Solr on DC/OS ?

2018-03-15 Thread Hendrik Haddorp
Hi, we are running Solr on Marathon/Mesos, which should basically be the same as DC/OS. Solr and ZooKeeper are running in docker containers. I wrote my own Mesos framework that handles the assignment to the agents. There is a public sample that does the same for ElasticSearch. I'm not aware

Re: Solr document routing using composite key

2018-03-15 Thread Zheng Lin Edwin Yeo
Hi, What version of Solr are you running? How did you configure your shards in Solr? Regards, Edwin On 7 March 2018 at 02:53, Nawab Zada Asad Iqbal wrote: > Hi solr community: > > > I have been thinking to use composite key for my next project iteration and > tried it today

RE: solr query

2018-03-15 Thread Albert Lee
Cause I got about 20 date fields or more. If add a separate field for it, then I have to add additional 3 field for each of them. For example, for the field birthdate, I need to add birthdate_year, birthdate_month, birthdate_day. Is this practical adding so much additional fields? Albert

RE: solr query

2018-03-15 Thread Stefan Matheis
> You have any other idea? Yes, we go back to start and discuss again why you're not adding a separate field for that. It's the simplest thing possible and avoids all those workarounds that got mentioned. -Stefan On Mar 15, 2018 4:08 AM, "Albert Lee" wrote: > Hi Emir,