Re: newSearcher autowarming queries in solrconfig.xml run but does not appear to warm cache

2016-10-19 Thread Dalton Gooding
Erick, Thanks very much for your help so far with this one. I have captured the logs from a commit which shows a commit and new searcher starting. It appears a few ERROR's are amongst the logs and a few uninverting lines. The query is a very basic query as shown below:     DataType_s:Product

Re: Result Grouping vs. Collapsing Query Parser -- Can one be deprecated?

2016-10-19 Thread Joel Bernstein
Also as you consider using collapse you'll want to keep in mind the feature compromises that were made to achieve the higher performance: 1) Collapse does not directly support faceting. It simply collapses the results and the faceting components compute facets on the collapsed result set. Grouping

Re: Result Grouping vs. Collapsing Query Parser -- Can one be deprecated?

2016-10-19 Thread Joel Bernstein
Originally collapsing was designed with a very small feature set and one goal in mind: High performance collapsing on high cardinality fields. To avoid having to compromise on that goal, it was developed as a separate feature. The trick in combining grouping and collapsing into one feature, is to

Re: Result Grouping vs. Collapsing Query Parser -- Can one be deprecated?

2016-10-19 Thread John Bickerstaff
Thank you for posting that. I'll be saving it in my "important painful lessons learned by others" mail folder. On Oct 19, 2016 4:51 PM, "Mike Lissner" wrote: > Hi all, > > I've had a rotten day today because of Solr. I want to share my experience > and perhaps see if we can do something to fix

Result Grouping vs. Collapsing Query Parser -- Can one be deprecated?

2016-10-19 Thread Mike Lissner
Hi all, I've had a rotten day today because of Solr. I want to share my experience and perhaps see if we can do something to fix this particular situation in the future. Solr currently has two ways to get grouped results (so far!). You can either use Result Grouping or you can use the Collapsing

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread Hrishikesh Gadre
As part of Cloudera Search, we have integrated with Apache Sentry for document level authorization. Currently we are using custom search component to implement filtering. Please refer to this blog post for details, http://blog.cloudera.com/blog/2014/07/new-in-cdh-5-1-document-level-security-for-clo

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread John Bickerstaff
Thanks Erick - also very helpful. On Wed, Oct 19, 2016 at 1:24 PM, Erick Erickson wrote: > And for hairy ACL processing, consider a post-filter. It's custom code > that only evaluates a document _after_ it has made it through the > primary query and any "lower cost" filters. See: > http://yonik.

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread Erick Erickson
And for hairy ACL processing, consider a post-filter. It's custom code that only evaluates a document _after_ it has made it through the primary query and any "lower cost" filters. See: http://yonik.com/advanced-filter-caching-in-solr/. NOTE: this isn't the thing I would do first, it's much more e

Zero value fails to match Positive, Negative, or Zero interval facet

2016-10-19 Thread Andy C
I have a field called "SCALE_double" that is defined as multivalued with the fieldType "tdouble". "tdouble" is defined as: I have a document with the value "0" indexed for this field. I am able to successfully retrieve the document with the range query "SCALE_double:[0 TO 0]". However it doesn'

ApacheCon is now less than a month away!

2016-10-19 Thread Rich Bowen
Dear Apache Enthusiast, ApacheCon Sevilla is now less than a month out, and we need your help getting the word out. Please tell your colleagues, your friends, and members of related technical communities, about this event. Rates go up November 3rd, so register today! ApacheCon, and Apache Big Dat

Re: Problem with spellchecker component

2016-10-19 Thread la...@2locos.com
we are using these spellcheckers in our collesction configs: default solr.DirectSolrSpellChecker . . . wordbreak solr.WordBreakSolrSpellChecker . . . jarowinkler org.apache.lucene.search.spell.JaroW

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread John Bickerstaff
Thank you both! Very helpful. On Wed, Oct 19, 2016 at 8:48 AM, Shawn Heisey wrote: > On 10/18/2016 3:00 PM, John Bickerstaff wrote: > > How (or is it even wise) to "segregate data" in Solr so that some data > > can be seen by some users and some data not be seen? > > IMHO, security like this is

Re: PDF writer

2016-10-19 Thread Shawn Heisey
On 10/17/2016 8:01 AM, Matthew Roth wrote: > Is there a documented or preferred path to have a PDF response writer? > I am using solr 5.3.x for an internal project. I have an XSL-FO > transformation that I am able to return via the XSLT response writer. > Is there a documented way to produce a PDF

Re: solr-6.2.0 cannot be launched by systemd service

2016-10-19 Thread Shawn Heisey
On 10/17/2016 9:20 AM, yunjiez wrote: > solr_systemd.log > > > There is no problem when launching the solr-6.2.0 with the script bin/solr. > But when I launching it with systemd service, the solr instance will soon be > stopped b

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread Shawn Heisey
On 10/18/2016 3:00 PM, John Bickerstaff wrote: > How (or is it even wise) to "segregate data" in Solr so that some data > can be seen by some users and some data not be seen? IMHO, security like this isn't really Solr's job ... but with the right data in the index, the system that DOES handle the

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread Jan Høydahl
In practice there shoud not be much of a delay, but if you change the ACL permission on a top-level folder with 10 million docs beneath, it will take some time before all those docs are reindexed. But if you instead give your friend read access to a new “group” which already have access to the d

Re: How to substract numeric value stored in 2 documents related by correlation id one-to-one

2016-10-19 Thread Kevin Risden
The Parallel SQL support for what you are asking for doesn't exist quite yet. The use case you described is close to what I was envisioning for the Solr SQL support. This would allow full text searches and then some analytics on top of it (like call duration). I'm not sure if subtracting fields (c

Re: Facet behavior

2016-10-19 Thread Yonik Seeley
On Wed, Oct 19, 2016 at 6:23 AM, Bastien Latard | MDPI AG wrote: > Hi everybody, > > I just had a question about facets. > *==> Is the facet run on all documents (to pre-process/cache the data) or > only on returned documents?* Yes ;-) There are sometimes per-field data structures that are cache

How to substract numeric value stored in 2 documents related by correlation id one-to-one

2016-10-19 Thread kahle
Hello, I have 2 documents recorded at request or response of a service call : Entity Request { "type":"REQ", "reqid":"MES0", "service":"service0", "time":1, } Entity response { "type":"RES", "reqid":"MES0", "time":10, } I need to create following statistics: Total service ca

Facet behavior

2016-10-19 Thread Bastien Latard | MDPI AG
Hi everybody, I just had a question about facets. *==> Is the facet run on all documents (to pre-process/cache the data) or only on returned documents?* Because I have exactly the same index locally and on the prod server.. (except that my dev. contains much less docs) When I make a query,

Re: Query by distance

2016-10-19 Thread Sergio García Maroto
Thanks a lot. I will try it and let you know. Thanks again Sergio On 18 October 2016 at 17:02, John Bickerstaff wrote: > Just in case it helps, I had good success on multi-word synonyms using this > plugin... > > https://github.com/healthonnet/hon-lucene-synonyms > > IIRC, the instructions are