Managing leaders when recycling a cluster

2020-08-11 Thread Adam Woods
Hi, We've just recently gone through the process of upgrading Solr the 8.6 and have implemented an automated rolling update mechanism to allow us to more easily make changes to our cluster in the future. Our process for this looks like this: 1. Cluster has 3 nodes. 2. Scale out to 6 nodes. 3. Pro

Re: Cannot add replica during backup

2020-08-11 Thread Ashwin Ramesh
Hey Matthew, Unfortunately, our shard leaders are across multiple nodes thus a single EBS couldn't work. Did you manage to get around this issue yourself? Regards, Ash On Tue, Aug 11, 2020 at 9:00 PM matthew sporleder wrote: > I can already tell you it is EFS that is slow. I had to switch to

Re: Survey on ManagedResources feature

2020-08-11 Thread Noble Paul
The end point is served by restlet. So, your rules are not going to be honored. The rules work only if it is served by a Solr request handler On Wed, Aug 12, 2020, 12:46 AM Jason Gerlowski wrote: > Hey Noble, > > Can you explain what you mean when you say it's not secured? Just for > those of u

Re: Incorrect Insecure Settings Check in CoreContainer

2020-08-11 Thread Jason Gerlowski
Yikes, yeah it's hard to argue with that. I'm a little confused because I remember testing this, but maybe it snuck in at the last minute? In any case, I'll reopen that jira to fix the check there. Sorry guys. Jason On Wed, Aug 5, 2020 at 9:22 AM Jan Høydahl wrote: > > This seems to have bee

massive numbers of threads with name that includes commitScheduler

2020-08-11 Thread Schwartz, Tony
>From time to time I see massive number of threads that have commitScheduler in >the name of the thread. When this happens, solr is pegging the disk IO and >querying becomes unusable for a while. I have many collections (240 shards). >It happens once in a while, I'm really not sure what is ca

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Bram Van Dam
On 11/08/2020 13:15, Erick Erickson wrote: > CDCR is being deprecated. so I wouldn’t suggest it for the long term. Ah yes, thanks for pointing that out. That makes Dominique's alternative less attractive. I guess I'll stick to my original proposal! Thanks Erick :-) - Bram

Re: Slow query response from SOLR 5.4.1

2020-08-11 Thread Jason Gerlowski
Hey Abhijit, The information you provided isn't really enough for anyone else on the mailing list to debug the problem. If you'd like help, please provide some more information. Good places to start would be: what is the query, what does Solr tell you when you add a "debug=timing" parameter to y

Re: Survey on ManagedResources feature

2020-08-11 Thread Jason Gerlowski
Hey Noble, Can you explain what you mean when you say it's not secured? Just for those of us who haven't been following the discussion so far? On the surface of things users taking advantage of our RuleBasedAuth plugin can secure this API like they can any other HTTP API. Or are you talking abo

Re: Multiple "df" fields

2020-08-11 Thread Erick Erickson
Have you explored edismax? > On Aug 11, 2020, at 10:34 AM, Alexandre Rafalovitch > wrote: > > I can't remember if field aliasing works with df but it may be worth a try: > > https://lucene.apache.org/solr/guide/8_1/the-extended-dismax-query-parser.html#field-aliasing-using-per-field-qf-overrid

Re: Multiple "df" fields

2020-08-11 Thread Alexandre Rafalovitch
I can't remember if field aliasing works with df but it may be worth a try: https://lucene.apache.org/solr/guide/8_1/the-extended-dismax-query-parser.html#field-aliasing-using-per-field-qf-overrides Another example: https://github.com/arafalov/solr-indexing-book/blob/master/published/languages/co

Re: Multiple "df" fields

2020-08-11 Thread Edward Turner
Hi David, We tried using copyfields, and we can get this to work, but it's not exactly what we want because we need to use a common type. E.g., Then if our "df" is specified as the "content" field, we can search over "

Re: Multiple "df" fields

2020-08-11 Thread David Hastings
why not use a copyfield for indexing? On Tue, Aug 11, 2020 at 9:59 AM Edward Turner wrote: > Hi all, > > Is it possible to have multiple "df" fields? (We think the answer is no > because our experiments did not work when adding multiple "df" values to > solrconfig.xml -- but we just wanted to do

Multiple "df" fields

2020-08-11 Thread Edward Turner
Hi all, Is it possible to have multiple "df" fields? (We think the answer is no because our experiments did not work when adding multiple "df" values to solrconfig.xml -- but we just wanted to double check with those who know better.) The reason we would like to do this is that we have two main fi

Survey on ManagedResources feature

2020-08-11 Thread Noble Paul
Hi all, The end-point for Managed resources is not secured. So it needs to be fixed/eliminated. I would like to know what is the level of adoption for that feature and if it is a critical feature for users. Another possibility is to offer a replacement for the feature using a different API Your

Study On Rejected Refactorings

2020-08-11 Thread Jevgenija Pantiuchina
Dear contributors, As part of a research team from Università della Svizzera italiana (Switzerland) and University of Sannio (Italy), we have analyzed refactoring pull requests in apache/lucene-solr repository and are looking for developers for a short 5-10 min survey (https://usi.eu.qualtrics

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
An idea could be use autoscaling API in order to add a PULL replica for each shard located in one or more low resource backup dedicated nodes in separate hardware. However, we need to exclude these "PULL backup replica" from searches. Unfortunately, I am not aware of this possibility. For better RP

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Erick Erickson
Dominique: Alternatives are under discussion, there isn’t a recommendation yet. Erick > On Aug 11, 2020, at 7:49 AM, Dominique Bejean > wrote: > > I missed that ! > Are you aware about an alternative ? > > Regards > > Dominique > > > Le mar. 11 août 2020 à 13:15, Erick Erickson a > écrit

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
I missed that ! Are you aware about an alternative ? Regards Dominique Le mar. 11 août 2020 à 13:15, Erick Erickson a écrit : > CDCR is being deprecated. so I wouldn’t suggest it for the long term. > > > On Aug 10, 2020, at 9:33 PM, Ashwin Ramesh > wrote: > > > > I would love an answer to th

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Erick Erickson
CDCR is being deprecated. so I wouldn’t suggest it for the long term. > On Aug 10, 2020, at 9:33 PM, Ashwin Ramesh wrote: > > I would love an answer to this too! > > On Fri, Aug 7, 2020 at 12:18 AM Bram Van Dam wrote: > >> Hey folks, >> >> Been reading up about the various ways of creating b

Re: Production Issue: TIMED_WAITING - Will net.ipv4.tcp_tw_reuse=1 help?

2020-08-11 Thread Doss
Hi Dominique, Our issues are similar to the one discussed here. https://github.com/eclipse/jetty.project/issues/4105 Your views on this. Thanks, Mohandoss. On Tue, Aug 11, 2020 at 7:06 AM Doss wrote: > Hi Dominique, > > Thanks for the response. > > I don't think I would use a JVM version 14.

Re: Cannot add replica during backup

2020-08-11 Thread matthew sporleder
I can already tell you it is EFS that is slow. I had to switch to an ebs disk for backups on a different project because efs couldn't keep up. > On Aug 10, 2020, at 9:43 PM, Ashwin Ramesh wrote: > > Hey Aroop, the general process for our backup is: > - Connect all machines to an EFS drive (AW

Re: Solrcloud tlog are not deleted

2020-08-11 Thread Dominique Bejean
Hi, Did you disable CDCR buffer ? solr//cdcr?action=DISABLEBUFFER You can check with "cdcr?action=STATUS" Regards Dominique Le mar. 11 août 2020 à 10:57, Michel Bamouni a écrit : > Hello, > > > We had setup a synchronization between our solr instances on 2 datacenters > by using the CDCR.

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
Hi, This procedure looks fine but it is a little complexe to automatize. Why not consider backup based on CDCR for Solrcloud or Replication for Solr standalone ? For Solrcloud, CDCR can be configured with source and target collections in the same Solrcloud cluster. The target collection can ha

Solrcloud tlog are not deleted

2020-08-11 Thread Michel Bamouni
Hello, We had setup a synchronization between our solr instances on 2 datacenters by using the CDCR. until now, every thing worked fine but after an upgrade from solr 7.3 to solr 7.7, we are facing an issue. Indeed, our tlog files are not deleted even if we see the new values on the two solr