book on solr

2017-10-12 Thread Jay Potharaju
Hi, I am looking for a book that covers some basic principles on how to scale solr. Are there any suggestions. Example how to scale , by adding shards or replicas in the case of high rps and high index rates. Any blog or documentation also that would provide some basic rules or guidelines for

Re: Getting user-level KeeperException

2017-10-12 Thread Gunalan V
Thanks Shawn and Amrit! On Thu, Oct 12, 2017 at 4:05 PM, Shawn Heisey wrote: > On 10/11/2017 8:22 PM, Gunalan V wrote: > >> Could someone please let me know what this user-level keeper exception in >> zookeeper mean? and How to fix the same. >> > > Those are not errors.

Re: is there a way to remove deleted documents from index without optimize

2017-10-12 Thread Erick Erickson
You can use the IndexUpgradeTool that ships with each version of Solr (well, actually Lucene) to, well, upgrade your index. So you can use the IndexUpgradeTool that ships with 5x to upgrade from 4x. And the one that ships with 6x to upgrade from 5x. etc. That said, none of that is necessary _if_

Re: is there a way to remove deleted documents from index without optimize

2017-10-12 Thread Harry Yoo
I should have read this. My project has been running from apache solr 4.x, and moved to 5.x and recently migrated to 6.6.1. Do you think solr will take care of old version indexes as well? I wanted to make sure my indexes are updated with 6.x lucence version so that it will be supported when i

Re: Disabling XmlQParserPlugin through solrconfig

2017-10-12 Thread Manikandan Sivanesan
Thanks a lot. This is the suggestion we are proceeding forward with. On Thu, Oct 12, 2017 at 7:59 PM, Trey Grainger wrote: > You can also just "replace" the registered xml query parser with another > parser. I imagine you're doing this for security reasons, which means you >

Re: Disabling XmlQParserPlugin through solrconfig

2017-10-12 Thread Trey Grainger
You can also just "replace" the registered xml query parser with another parser. I imagine you're doing this for security reasons, which means you just want the actual xml query parser to not be executable through a query. Try adding the following line to your solrconfig.xml: This way, the

Re: Disabling XmlQParserPlugin through solrconfig

2017-10-12 Thread Shawn Heisey
On 10/12/2017 3:18 PM, Manikandan Sivanesan wrote: I'm looking for a way to disable the query parser XmlQParserPlugin (org.apache.solr.search.XmlQParserPlugin) through solrconfig.xml . Following the instructions mentioned here

Re: Disabling XmlQParserPlugin through solrconfig

2017-10-12 Thread Manikandan Sivanesan
Sorry noticed the typo. Am providing the corrected version On Thu, Oct 12, 2017 at 5:18 PM, Manikandan Sivanesan wrote: > I'm looking for a way to disable the query parser XmlQParserPlugin > (org.apache.solr.search.XmlQParserPlugin) through solrconfig.xml . > Following

Disabling XmlQParserPlugin through solrconfig

2017-10-12 Thread Manikandan Sivanesan
I'm looking for a way to disable the query parser XmlQParserPlugin (org.apache.solr.search.XmlQParserPlugin) through solrconfig.xml . Following the instructions mentioned here to disable a query parser. This is the part that

Re: Getting user-level KeeperException

2017-10-12 Thread Shawn Heisey
On 10/11/2017 8:22 PM, Gunalan V wrote: Could someone please let me know what this user-level keeper exception in zookeeper mean? and How to fix the same. Those are not errors.  They are INFO logs.  They are not an indication of a problem.  If they were a problem, they would most likely be at

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-12 Thread Cassandra Targett
Michael, On behalf of the Lucene PMC, thank you for reporting these issues. Please be assured we are actively looking into them and are working to provide resolutions as soon as possible. Somehow no one in the Lucene/Solr community saw your earlier mail so we have an unfortunate delay in reacting

Re: Getting user-level KeeperException

2017-10-12 Thread Amrit Sarkar
Gunalan, Zookeeper throws KeeperException at /overseer for most of the solr issues, namely indexing. Sync the timestamp of zookeeper error with solr log; the problem lies there most probably. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter

Re: Solrcloud replication not working

2017-10-12 Thread Shawn Heisey
On 10/10/2017 2:51 AM, solr2020 wrote: i could see different version of the below entries in Leader and replica. While doing index , in replica instance logs we could see it is keep receiving update request from leader but it says no changes, skipping commit. Master (Searching) Master

Re: Indexing files from HDFS

2017-10-12 Thread Shawn Heisey
On 10/12/2017 2:04 AM, István wrote: The question is not about Hue but about why file_path is in the schema for HDFS files when using search-mr. I am wondering what is the standard way of indexing files on HDFS. The error in your original post indicates that at least one document in the

Re: Inconsistent results for facet queries

2017-10-12 Thread Chris Ulicny
I'm not sure if that method is viable for reindexing and fetching the whole collection at once for us, but unless there is something inherent in that process which happens at the collection level, we could do it a few shards at a time since it is a multi-tenant setup. I'll see if we can setup a

Re: Inconsistent results for facet queries

2017-10-12 Thread Erick Erickson
(1) It doesn't matter whether it "affect only segments being merged". You can't get accurate information if different segments have different expectations. (2) I strongly doubt it. The problem is that the "tainted" segments' meta-data is still read when merging. If the segment consisted of _only_

Re: Unsubscribe my email

2017-10-12 Thread Erick Erickson
Please follow the instructions here: http://lucene.apache.org/solr/community.html#mailing-lists-irc. You must use the _exact_ same e-mail as you used to subscribe. If the initial try doesn't work and following the suggestions at the "problems" link doesn't work for you, let us know. But note you

Unsubscribe my email

2017-10-12 Thread Shashi Roushan
Please unsubscribe my email . Regards, Shashi Roushan

Re: Inconsistent results for facet queries

2017-10-12 Thread Chris Ulicny
We tested the query on all replicas for the given shard, and they all have the same issue. So deleting and adding another replica won't fix the problem since the leader is exhibiting the behavior as well. I believe the second replica was moved (new one added, old one deleted) between nodes and so

Re: Solrcloud replication not working

2017-10-12 Thread solr2020
The problem was replicationFactor was set to 1.Now replication works fine while setting replicationFactor as 2. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: SOLR cores are getting locked

2017-10-12 Thread Erick Erickson
You might be hitting SOLR-11297, which is fixed in Solr 7.0.1. The patch should back-port cleanly to 6x versions though. Best, Erick On Thu, Oct 12, 2017 at 12:14 AM, Gunalan V wrote: > Hello, > > I'm using SOLR 6.5.1 and I have 2 SOLR nodes in SOLRCloud and created >

Re: Inconsistent results for facet queries

2017-10-12 Thread Erick Erickson
Never mind. Anything that didn't merge old segments, just threw them away when empty (which was my idea) would possibly require as much disk space as the index currently occupied, so doesn't help your disk-constrained situation. Best, Erick On Thu, Oct 12, 2017 at 8:06 AM, Erick Erickson

Re: Inconsistent results for facet queries

2017-10-12 Thread Erick Erickson
If it's _only_ on a particular replica, here's what you could do: Just DELETEREPLICA on it, then ADDREPLICA to bring it back. You can define the "node" parameter on ADDREPLICA to get it back on the same node. Then the normal replication process would pull the entire index down from the leader. My

Re: Parsing of rq queries in LTR

2017-10-12 Thread Michael Alcorn
It turns out my last comment on that Jira was mistaken. Multi-term EFI arguments still exhibit unexpected behavior. Binoy is trying to help me figure out what the issue is. I plan on updating the Jira once we've figured out the problem. On Thu, Oct 12, 2017 at 3:41 AM, alessandro.benedetti

[Solr 6.6 w/SolrCloud]: Subqueries - Solr returning a 400 status code, Bad Request when attempting to use the [subquery] transformer

2017-10-12 Thread Damien Hawes
Good day, *Context and background:* I have a set of documents, that initially are quite deeply nested, but as part of the pre-index step the documents are flattened, such that they are at most 2 levels deep - a root document and a list of child documents. Each child document is given some

Re: Inconsistent results for facet queries

2017-10-12 Thread Chris Ulicny
I thought that decision would come back to bite us somehow. At the time, we didn't have enough space available to do a fresh reindex alongside the old collection, so the only course of action available was to index over the old one, and the vast majority of its use worked as expected. We're

Re: tf function query

2017-10-12 Thread Dmitry Kan
sorry guys to have not been responding & thanks a lot for answers. @Erick Erickson: what I would ideally like to have is tf-idf value for user's query. The thing is that we have two searchable fields. While boost works just fine for one, there is no easy way to have it multiplied by boost from

Re: Need help with Slow Query Logging

2017-10-12 Thread Atita Arora
Indeed , the trouble hasn't got over yet. So we got https://issues.apache.org/jira/browse/SOLR-11453 created meantime. I'll look forward to your updates. Thanks again , Atita On Thu, Oct 12, 2017 at 2:08 PM, Emir Arnautović < emir.arnauto...@sematext.com> wrote: > Hi Atita, > I did not have

RE: Parsing of rq queries in LTR

2017-10-12 Thread alessandro.benedetti
I don't think this is actually that much related to LTR Solr Feature. In the Solr feature I see you specify a query with a specific query parser (field). Unless there is a bug in the SolrFeature for LTR, I expect the query parser you defined to be used[1]. This means : "rawquerystring":"{!field

Re: Need help with Slow Query Logging

2017-10-12 Thread Emir Arnautović
Hi Atita, I did not have time to try it out, but will try to do it over the weekend if you are still having troubles with it. Regards, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 10 Oct 2017,

Suggester highlighter offsets inaccurate

2017-10-12 Thread Timothy Hill
Hello, I am using Solr 6.6's Suggester functionality to power an autosuggest widget that returns lists of people's names. One requirement that we have is that the suggester be punctuation-insensitive. For example, entering: 'Dr Joh' should provide the suggestion 'Dr. John', despite the fact

Re: Indexing files from HDFS

2017-10-12 Thread István
Hi Erik, The question is not about Hue but about why file_path is in the schema for HDFS files when using search-mr. I am wondering what is the standard way of indexing files on HDFS. THanks, Istvan On Wed, Oct 11, 2017 at 4:53 PM, Erick Erickson wrote: > You probably

SOLR cores are getting locked

2017-10-12 Thread Gunalan V
Hello, I'm using SOLR 6.5.1 and I have 2 SOLR nodes in SOLRCloud and created collection using the below [1] and it was created successfully during initial time but next day I tried to restart the nodes in SOLR cloud. When I start the first node the collection health is active but when I start the