RE: Solr 5.5 | Field boosting not working as per expectation

2016-06-20 Thread Megha Bhandari
After further investigation we have found that latest inserted documents are getting higher priority and coming on top of the search results and ignoring the field boosting in case the time difference of document insertion is a day. Is there a configuration to switch off insertion time factor. A

Re: cursorMark and CSVResponseWriter for mass reindex

2016-06-20 Thread xavi jmlucjav
Hi Erick, Ah, yes I guess you are correct in that could just avoid using cursorMark this way...the only (smallish I think) issue is that I would need to extract the last id from the csv output. Oh and that I am using Datastaxx DSE, so uniqueKey is a combination of two fields...but I think I can ma

Re: Updating solr schema for a collection in place

2016-06-20 Thread Stephen Lewis
Oh, also I see when I first replied, I missed addressing this > For instance, > ​ ​ > having a field defined with docValues set to false, indexing some data > then changing that field to docValues="true" and indexing some more data will give you "interesting" results. The way we update our dat

Re: Updating solr schema for a collection in place

2016-06-20 Thread Stephen Lewis
​Thanks for the advice! I haven't encountered those nuances yet so it's great to be aware of them now. I manage our solr clouds through an OO python package which models our search stack. We use this package deploy to stacks which are isolated and configurable, but otherwise identical. We push our

Re: Updating solr schema for a collection in place

2016-06-20 Thread Erick Erickson
Glad you found the issue. The switch to managed has tripped up more people than just you! Do be a little cautious about changing the schema however. There are some "benign" changes you can do when you already have data indexed and a series of others that are not benign. For instance, having a fiel

Re: Updating solr schema for a collection in place

2016-06-20 Thread Stephen Lewis
I'm happy to say I figured out the issue. Looking through previous questions in this forum, I was able to find someone hitting the same issue which I was. After upgrading versions, we switched to the managed instead of the ClassicIndexSchemaFactory unintentionally. Sorry for the bother! On Mon, Ju

Updating solr schema for a collection in place

2016-06-20 Thread Stephen Lewis
Hello, I've recently set up a solr cloud using solr 6.0, and I've been having some trouble getting our collections to pick up schema updates. Following the docs on zkcli.sh and the collections API

RE: Solr Highlights

2016-06-20 Thread Kalpana
Thanks for your reply. Although I would like to say health_content_t is text_general and does not come back… From: Erick Erickson [via Lucene] [mailto:ml-node+s472066n428351...@n3.nabble.com] Sent: Monday, June 20, 2016 8:03 PM To: Kalpana Sivanandan Subject: Re: Solr Highlights My first quest

Re: cursorMark and CSVResponseWriter for mass reindex

2016-06-20 Thread Erick Erickson
The CursorMark stuff has to deal with shards, what happens when more than one document on different shards has the same sort value, what if all the docs in the response packet have the same sort value, what happens when you want to return docs by score and the like. For your case you can use a sor

Re: Solr Highlights

2016-06-20 Thread Erick Erickson
My first question is if you're absolutely sure that the term diabetes occurs in the title_field for the docs that are returned? Oh, looking again your field type for title_t is "string", which is totally un-analyzed so I think that's your problem. What happens if you define it as text_general and

Re: How many cores is too many cores?

2016-06-20 Thread Erick Erickson
Sebastian: It Depends (tm). Solr can handle this, but there are caveats. Is this SolrCloud or not? Each core will consume some resources and there are some JIRAs out there about specifically that many cores in SolrCloud. If your problem space works with the LotsOfCores, start here: https://cwiki.a

Solr Highlights

2016-06-20 Thread Kalpana
I have a question about Highlights – would greatly appreciate you taking the time  I have pasted snippets of SolrCongig, schema, url and results. I am not sure why I don’t see the highlights for title_t, _fullpath, health_content_t etc. I only see it for the copied field search as it is a default

Solr Highlights

2016-06-20 Thread Kalpana
I have a question about Highlights – would greatly appreciate you taking the time I have pasted snippets of SolrCongig, schema, url and results. I am not sure why I don’t see the highlights for title_t, _fullpath, health_content_t etc. I only see it for the copied field search as it is a default f

Re: Encryption to Solr indexes – Using Custom Codec

2016-06-20 Thread Alexandre Rafalovitch
On 21 June 2016 at 00:42, Sidana, Mohit wrote: > As Part of my studies I am exploring the solutions which can be used for > Lucene/Solr Index encryption. Can't help with the actual question, but just remembered that Hitachi was showing some Solr Encryption at the Lucene/Solr Revolution last year

Re: Solr 6.1.x Release Date ??

2016-06-20 Thread Joe Lawson
FYI it's released On Jun 16, 2016 11:06 AM, "Steve Rowe" wrote: > Tomorrow-ish. > > -- > Steve > www.lucidworks.com > > > On Jun 16, 2016, at 4:14 AM, Ramesh shankar wrote: > > > > Hi, > > > > Yes, i used the solr-6.1.0-79 nightly builds and [subquery] transformer > is > > working fine in, any i

Re: SpanQuery - How to wrap a NOT subquery

2016-06-20 Thread Brandon Miller
Thank you, Timothy. I have support for and am using SpanNotQuery elsewhere. Maybe there is another use for it that I'm not considering. I'm wondering if there's a clever way of reusing it in order to satisfy the requirements of proximity NOTs, too. dtSearch allows a user to have NOTs embedded i

Re: tlogs not deleting as usual in Solr 5.5.1?

2016-06-20 Thread Shawn Heisey
On 6/16/2016 1:18 PM, Chris Morley wrote: > In Solr 4.10.3 tlogs ARE deleted after issuing update?commit=true. > (And deleted immediately.) That seems like a bug to me. A hard commit is supposed to close the current transaction log and prune old logs such that what's left will meet the "ke

All Datanodes are Bad

2016-06-20 Thread Joseph Obernberger
Anyone ever seen an error like this? We are running using HDFS for the index. At the time of the error, we are doing a lot of indexing. Two errors: java.io.IOException: All datanodes DatanodeInfoWithStorage[ 172.16.100.220:50010,DS-4b806395-0661-4a70-a32b-deef82a85359,DISK] are bad. Aborting...

Re: Can someone explain about Sweetspot Similarity ?

2016-06-20 Thread Chris Hostetter
: Fwiw, here's example of per field similarity : https://cwiki.apache.org/confluence/display/solr/Other+Schema+Elements Similarities can be defined per field *TYPE* ... not per field (as shown in the examples on that page) -Hoss http://www.lucidworks.com/

Re: Encryption to Solr indexes – Using Custom Codec

2016-06-20 Thread Don Bosco Durai
Mohit, just curious. Have you considered just encrypting the volume where there index is stored? It will be done at the OS level and performance wise it will be much better and easier to manage. Would it address your requirement? Or is it purely academic exercise for you. Bosco From: "Sidana

How many cores is too many cores?

2016-06-20 Thread Sebastian Riemer
Hi, Currently I have a single solr server handling 5 cores which differ in the content they provide. However, each of them might hold data for many different clients/customers. Let's say for example one day there might be 300 different clients each storing their data in those 5 cores. Every c

cursorMark and CSVResponseWriter for mass reindex

2016-06-20 Thread xavi jmlucjav
Hi, I need to index into a new schema 800M docs, that exist in an older solr. As all fields are stored, I thought I was very lucky as I could: - use wt=csv - combined with cursorMark to easily script out something that would export/index in chunks of 1M docs or something. CVS output being very e

Encryption to Solr indexes – Using Custom Codec

2016-06-20 Thread Sidana, Mohit
Hello, As Part of my studies I am exploring the solutions which can be used for Lucene/Solr Index encryption. I found the patch open on Apache JIRA - Codec for index-level encryption (LUCENE-6966). https://issues.apache.org/jira/browse/LUCENE-69

Re: About reasons of "enablePositionIncrements" deprecation

2016-06-20 Thread Erick Erickson
I can at least answer the second question. Lucene guarantees backwards read compatibility for _one_ major revision. 6x can read 5x indexes, but not 4x. The back-compat logic is difficult enough for one major revision, so when Lucene moves to a new major version the classes supporting version-2 are

Re: Solr 5.5 | Field boosting not working as per expectation

2016-06-20 Thread Erick Erickson
bq: metatag.description field has highest boost and documents with a match in this field should come first This is something of a misunderstanding. The boost you've added to the metatag.description field is a _factor_ in the score calculations, not an ordering. In your case you're using edismax ap

Re: Core Admin API: Create Solr core if it does not exist

2016-06-20 Thread Erick Erickson
I can pretty much promise that this won't be addressed in the 4x code line. As far as whether returning an error when trying to create a core that already exists is an error or not, that's entirely a matter of interpretation. I argue that if my application is creating something that already exists

Re: How do we get terms suggestion from SuggestComponent?

2016-06-20 Thread Ahmet Arslan
Hi, I think : FreeTextLookupFactory DocumentDictionaryFactory 3 content Ahmet On Monday, June 20, 2016 3:51 PM, solr2020 wrote: Hi, I am using solr.SuggestComponent for auto suggestion, it works fine. But the problem is, it returns the whole field value as suggestion instead of terms. But m

Re: Core Admin API: Create Solr core if it does not exist

2016-06-20 Thread Andreas Hubold
Hi, can somebody help, please? Another time I even got a "500 Internal Server Error" response if I try to CREATE a core that already exists. This sounds like a bug in Solr, there's really nothing broken in Solr itself and it should not log an error and return a 500 if a client tries to create

How do we get terms suggestion from SuggestComponent?

2016-06-20 Thread solr2020
Hi, I am using solr.SuggestComponent for auto suggestion, it works fine. But the problem is, it returns the whole field value as suggestion instead of terms. But my requirement is term needs to be returned as suggestion. How do we achieve this with solr.SuggestComponent? Thanks. -- View this

Re: Phrase query proximity parameter doe not show up in parsed query string

2016-06-20 Thread Ahmet Arslan
Hi, I think synonym_edismax is not part of solr. Can you re-produce with the stock edismax? On Monday, June 20, 2016 12:34 PM, preeti kumari wrote: Hi All, My query looks like below : q=((_query_:"{!synonym_edismax qf='partnum' v='597871' bq='' mm=100 synonyms=true synonyms.constructPhra

Solr 5.5 | Field boosting not working as per expectation

2016-06-20 Thread Megha Bhandari
Hi Problem statement : metatag.description field has highest boost and documents with a match in this field should come first. However Silver-Choice-5000-E.pdf comes before /contact-us/florida even though the search term matches more fields in the /contact-us/florida page. In Silver-Choice-5000

BasicAuth and blockUnknown cause "Exception trying to get public key from..." error

2016-06-20 Thread Aleš Gregor
Hi all, I am facing issue with BasicAuth and blockUnknown. The blockUnknown in Solr authentication settings also blocks internal solr nodes communication and it is failing on fetching public keys of node in a cluster. I am getting following errors in the log: 2016-06-20 08:43:53.968 ERROR (qtp

Phrase query proximity parameter doe not show up in parsed query string

2016-06-20 Thread preeti kumari
Hi All, My query looks like below : q=((_query_:"{!synonym_edismax qf='partnum' v='597871' bq='' mm=100 synonyms=true synonyms.constructPhrases=true synonyms.ignoreQueryOperators=true}") OR (partnumcomp:597871* OR partnum:"597871"~4 OR ngramc:"597 978 787 871"~4 OR partnumngramc:"597 978 787 871