The error was a zookeeper connect timeout. Which apparently is hardcoded to 180 seconds.
I've recently succeeded in the restore. It may well have been a connection issue, since the environment is a shared VM environment. Outside pressure is possible. The timeout source code(in version 7.6.0): https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/solr/core/src/java/org/apache/solr/handler/admin/CollectionsHandler.java#L301 And eventually it gets used here: https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/solr/core/src/java/org/apache/solr/handler/admin/CollectionsHandler.java#L359 This makes me wonder: is there a hard limit? The restore can only take 180 seconds or it fails? Or is that timeout per connection attempt to zookeeper? On Thu, Oct 17, 2019 at 11:16 AM Jon Kjær Amundsen <j...@udbudsvagten.dk> wrote: > As a restore is server agnostic (i.e. you can restore to a totally > different host than the backup was taken from) hat shouldn't be the > problem. > Also a collection as such has no leader, only shards have. > > Do you have any kind of logs stating the errors encountered? > > Venlig hilsen/Best regards > > *Jon Kjær Amundsen* > Developer > > > Phone: +45 7023 9080 > E-mail: j...@udbudsvagten.dk > Web: www.udbudsvagten.dk > Parken - Tårn D - 5. Sal > Øster Allé 48 | DK - 2100 København > > <http://dk.linkedin.com/in/JonKjaerAmundsen/> > > Intelligent Offentlig Samhandel > *Før, under og efter udbud* > > *Følg UdbudsVagten og markedet her Linkedin > <http://www.linkedin.com/groups?groupDashboard=&gid=1862353> * > > > Den tor. 17. okt. 2019 kl. 10.06 skrev Koen De Groote < > koen.degro...@limecraft.com>: > > > Yes, both servers back up to a network drive. > > > > However, that is not the point of my question. > > > > The point of my question is: If I execute the curl command, that contacts > > the collections API, to perform the backup, does it matter that the > leader > > is on a different host from the one where the backup command was > executed? > > > > > > > > On Thu, Oct 17, 2019 at 9:30 AM Jon Kjær Amundsen <j...@udbudsvagten.dk> > > wrote: > > > > > Hi Koen > > > > > > A quick sanity check: > > > Do you use a network drive accessible from both servers to make the > > backup > > > to? > > > If you've backed up server2's collection to a local disk, then when > > you're > > > trying to restore it via server1 it does not know anything about the > > > backup. > > > > > > Venlig hilsen/Best regards > > > > > > *Jon Kjær Amundsen* > > > Developer > > > > > > > > > Phone: +45 7023 9080 > > > E-mail: j...@udbudsvagten.dk > > > Web: www.udbudsvagten.dk > > > Parken - Tårn D - 5. Sal > > > Øster Allé 48 | DK - 2100 København > > > > > > <http://dk.linkedin.com/in/JonKjaerAmundsen/> > > > > > > Intelligent Offentlig Samhandel > > > *Før, under og efter udbud* > > > > > > *Følg UdbudsVagten og markedet her Linkedin > > > <http://www.linkedin.com/groups?groupDashboard=&gid=1862353> * > > > > > > > > > Den ons. 16. okt. 2019 kl. 17.42 skrev Koen De Groote < > > > koen.degro...@limecraft.com>: > > > > > > > I'm trying to restore a couple of collections, and 1 keeps feeling. > > This > > > > happens to be the only one who's leader isn't on the host that the > > backup > > > > was taken from. > > > > > > > > > > > > The backup was done on server1, for all collections. > > > > > > > > For this collection that is failing, the Leader was on server2. All > > other > > > > collections had their leader on server1. All collections had 1 > replica, > > > on > > > > the other server. > > > > > > > > I would think that having the replica there would be enough to > perform > > a > > > > restore. > > > > > > > > Or does the backup need to happen on the actual leader? > > > > > > > > Kind regards, > > > > Koen De Groote > > > > > > > > > >