> We have 16 shards each approx 30GB - total is ~480GB. I'm also pretty sure > it's a network issue. Very interesting that you can index 20x the data in > 15 min! Not index but backup an index in 15min.
>>> It would also help to ensure your overseer is on a node with a role that > exempts it from any Solr index responsibilities. > How would I ensure this? First I'm hearing about this! Lookup roles and snitches and tags here: https://lucene.apache.org/solr/guide/7_7/rule-based-replica-placement.html <https://lucene.apache.org/solr/guide/7_7/rule-based-replica-placement.html> > On Aug 10, 2020, at 6:54 PM, Ashwin Ramesh <ash...@canva.com.INVALID > <mailto:ash...@canva.com.INVALID>> wrote: > > Hi Aroop, > > We have 16 shards each approx 30GB - total is ~480GB. I'm also pretty sure > it's a network issue. Very interesting that you can index 20x the data in > 15 min! > >>> It would also help to ensure your overseer is on a node with a role that > exempts it from any Solr index responsibilities. > How would I ensure this? First I'm hearing about this! > > Thanks for all the help!! > > On Tue, Aug 11, 2020 at 11:48 AM Aroop Ganguly > <aroopgang...@icloud.com.invalid <mailto:aroopgang...@icloud.com.invalid>> > wrote: > >> Hi Ashwin >> >> Thanks for sharing this detail. >> Do you mind sharing how big are each of these indices ? >> I am almost sure this is network capacity and constraints related per your >> aws setup. >> >> Yes if you can confirm that the backup is complete, or you just want the >> system to move on discarding the backup process, your removal of the backup >> flag from zookeeper will help Solr in moving on to the next task in the >> queue. >> >> It would also help to ensure your overseer is on a node with a role that >> exempts it from any Solr index responsibilities. >> >> >>> On Aug 10, 2020, at 6:43 PM, Ashwin Ramesh <ash...@canva.com.INVALID >>> <mailto:ash...@canva.com.INVALID>> >> wrote: >>> >>> Hey Aroop, the general process for our backup is: >>> - Connect all machines to an EFS drive (AWS's NFS service) >>> - Call the collections API to backup into EFS >>> - ZIP the directory once the backup is completed >>> - Copy the ZIP into an s3 bucket >>> >>> I'll probably have to see which part of the process is the slowest. >>> >>> On another note, can you simply remove the task from the ZK path to >>> continue the execution of tasks? >>> >>> Regards, >>> >>> Ash >>> >>> On Tue, Aug 11, 2020 at 11:40 AM Aroop Ganguly >>> <aroopgang...@icloud.com.invalid <mailto:aroopgang...@icloud.com.invalid>> >>> wrote: >>> >>>> 12 hours is extreme, we take backups of 10TB worth of indexes in 15 mins >>>> using the collection backup api. >>>> How are you taking the backup? >>>> >>>> Do you actually see any backup progress or u are just seeing the task in >>>> the overseer queue linger ? >>>> I have seen restore tasks hanging in the queue forever despite process >>>> completing in Solr 77 so wouldn’t be surprised this happens with backup >> as >>>> well. And also observed that unless that unless that task is removed >> from >>>> the overseer-collection-queue the next ones do not proceed. >>>> >>>> Also adding replicas while backup seems like overkill, why don’t you >> just >>>> have the appropriate replication factor in the first place and have >>>> autoAddReplicas=true for indemnity? >>>> >>>>> On Aug 10, 2020, at 6:32 PM, Ashwin Ramesh <ash...@canva.com.INVALID >>>>> <mailto:ash...@canva.com.INVALID>> >>>> wrote: >>>>> >>>>> Hi everybody, >>>>> >>>>> We are using solr 7.6 (SolrCloud). We notices that when the backup is >>>>> running, we cannot add any replicas to the collection. By the looks of >>>> it, >>>>> the job to add the replica is put into the Overseer queue, but it is >> not >>>>> being processed. Is this expected? And are there any workarounds? >>>>> >>>>> Our backups take about 12 hours. Maybe we should try optimize that too. >>>>> >>>>> Regards, >>>>> >>>>> Ash >>>>> >>>>> -- >>>>> ** >>>>> ** <https://www.canva.com/ <https://www.canva.com/>>Empowering the world >>>>> to design >>>>> Share accurate >>>>> information on COVID-19 and spread messages of support to your >> community. >>>>> >>>>> Here are some resources >>>>> < >>>> >> https://about.canva.com/coronavirus-awareness-collection/?utm_medium=pr&utm_source=news&utm_campaign=covid19_templates >> >> <https://about.canva.com/coronavirus-awareness-collection/?utm_medium=pr&utm_source=news&utm_campaign=covid19_templates> >>> >>>> >>>>> that can help. >>>>> <https://twitter.com/canva> <https://facebook.com/canva> >>>>> <https://au.linkedin.com/company/canva> <https://twitter.com/canva> >>>>> <https://facebook.com/canva> <https://au.linkedin.com/company/canva> >>>>> <https://instagram.com/canva> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>> >>>> >>> >>> -- >>> ** >>> ** <https://www.canva.com/>Empowering the world to design >>> Share accurate >>> information on COVID-19 and spread messages of support to your community. >>> >>> Here are some resources >>> < >> https://about.canva.com/coronavirus-awareness-collection/?utm_medium=pr&utm_source=news&utm_campaign=covid19_templates> >> >>> that can help. >>> <https://twitter.com/canva> <https://facebook.com/canva> >>> <https://au.linkedin.com/company/canva> <https://twitter.com/canva> >>> <https://facebook.com/canva> <https://au.linkedin.com/company/canva> >>> <https://instagram.com/canva> >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> >> >> > > -- > ** > ** <https://www.canva.com/ <https://www.canva.com/>>Empowering the world to > design > Share accurate > information on COVID-19 and spread messages of support to your community. > > Here are some resources > <https://about.canva.com/coronavirus-awareness-collection/?utm_medium=pr&utm_source=news&utm_campaign=covid19_templates > > <https://about.canva.com/coronavirus-awareness-collection/?utm_medium=pr&utm_source=news&utm_campaign=covid19_templates>> > > that can help. > <https://twitter.com/canva <https://twitter.com/canva>> > <https://facebook.com/canva <https://facebook.com/canva>> > <https://au.linkedin.com/company/canva > <https://au.linkedin.com/company/canva>> <https://twitter.com/canva > <https://twitter.com/canva>> > <https://facebook.com/canva <https://facebook.com/canva>> > <https://au.linkedin.com/company/canva > <https://au.linkedin.com/company/canva>> > <https://instagram.com/canva <https://instagram.com/canva>> > > > > > > > > > >