Re: Looking for suggestions to deal with large backups not completing in 24-hours

2018-07-05 Thread Skylar Thompson
We've implemented file count quotas in addition to our existing byte
quotas to try to avoid this situation. You can improve some things
(metadata on SSDs, maybe get an accelerator node if Isilon still offers
those) but the fact is that metadata is expensive in terms of CPU (both
client and server) and disk.

We chose 1 million objects/TB of allocated disk space. We sort of compete
with a storage system offered by our central IT organization, and picked a
limit higher than what they would provide.

To be honest, though, we're retiring our Isilon systems because the
performance/scalability/cost ratios just aren't as great as they used to
be. Our new storage is GPFS and mmbackup works much better with huge number
of files, though it's still not great. In particular, the filelist
generation is based around UNIX sort which is definitely a memory pig,
though it can be split across multiple systems so can scale out pretty
well.

On Thu, Jul 05, 2018 at 02:52:27PM -0400, Zoltan Forray wrote:
> As I have mentioned in the past, we have gone through large migrations to
> DFS based storage on EMC ISILON hardware.  As you may recall, we backup
> these DFS mounts (about 90 at last count) using multiple Windows servers
> that run multiple ISP nodes (about 30-each) and they access each DFS
> mount/filesystem via -object=\\rams.adp.vcu.edu\departmentname.
>
> This has lead to lots of performance issue with backups and some
> departments are now complain that their backups are running into
> multiple-days in some cases.
>
> One such case in a department with 2-nodes with over 30-million objects for
> each node.  In the past, their backups were able to finish quicker since
> they were accessed via dedicated servers and were able to use Journaling to
> reduce the scan times.  Unless things have changed, I believe Journling is
> not an option due to how the files are accessed.
>
> FWIW, average backups are usually <50k files and <200GB once it finished
> scanning.
>
> Also, the idea of HSM/SPACEMANAGEMENT has reared its ugly head since many
> of these objects haven't been accessed in many years old. But as I
> understand it, that won't work either given our current configuration.
>
> Given the current DFS configuration (previously CIFS), what can we do to
> improve backup performance?
>
> So, any-and-all ideas are up for discussion.  There is even discussion on
> replacing ISP/TSM due to these issues/limitations.
>
> --
> *Zoltan Forray*
> Spectrum Protect (p.k.a. TSM) Software & Hardware Administrator
> Xymon Monitor Administrator
> VMware Administrator
> Virginia Commonwealth University
> UCC/Office of Technology Services
> www.ucc.vcu.edu
> zfor...@vcu.edu - 804-828-4807
> Don't be a phishing victim - VCU and other reputable organizations will
> never use email to request that you reply with your password, social
> security number or confidential personal information. For more details
> visit http://phishing.vcu.edu/

--
-- Skylar Thompson (skyl...@u.washington.edu)
-- Genome Sciences Department, System Administrator
-- Foege Building S046, (206)-685-7354
-- University of Washington School of Medicine


Re: Looking for suggestions to deal with large backups not completing in 24-hours

2018-07-05 Thread Harris, Steven
Zoltan

I kind of agree with Ung Yi

What is the purpose of your TSM  backups?  DR?  Long term retention for 
auditability/sarbox/other regulation?

It may well be that a daily or even more frequent snapshot regime might be the 
best way to get back that recently lost/deleted/corrupted file.
Use a TSM backup of a weekly point-of-consistency snapshot as your long term 
strategy.

Of course a better option would be an embedded TSM client on the Isilon itself, 
but the commercial realities are that will never happen.

Cheers

Steve
Steven Harris
TSM Admin
Canberra Australia 

-Original Message-
From: ADSM: Dist Stor Manager [mailto:ADSM-L@VM.MARIST.EDU] On Behalf Of Yi, Ung
Sent: Friday, 6 July 2018 6:36 AM
To: ADSM-L@VM.MARIST.EDU
Subject: Re: [ADSM-L] Looking for suggestions to deal with large backups not 
completing in 24-hours

Hello,
I don’t know much about Isilon.
There might be SAN level snap backups option for Isilon.

For our Data domain, we replicate from Main site to DR site, then take snap at 
our DR site every night. Each snap is consider a backup.

Thank you.

-Original Message-
From: ADSM: Dist Stor Manager [mailto:ADSM-L@VM.MARIST.EDU] On Behalf Of Zoltan 
Forray
Sent: Thursday, July 05, 2018 2:52 PM
To: ADSM-L@VM.MARIST.EDU
Subject: Looking for suggestions to deal with large backups not completing in 
24-hours

As I have mentioned in the past, we have gone through large migrations to DFS 
based storage on EMC ISILON hardware.  As you may recall, we backup these DFS 
mounts (about 90 at last count) using multiple Windows servers that run 
multiple ISP nodes (about 30-each) and they access each DFS mount/filesystem 
via -object=\\rams.adp.vcu.edu\departmentname.

This has lead to lots of performance issue with backups and some departments 
are now complain that their backups are running into multiple-days in some 
cases.

One such case in a department with 2-nodes with over 30-million objects for 
each node.  In the past, their backups were able to finish quicker since they 
were accessed via dedicated servers and were able to use Journaling to reduce 
the scan times.  Unless things have changed, I believe Journling is not an 
option due to how the files are accessed.

FWIW, average backups are usually <50k files and <200GB once it finished 
scanning.

Also, the idea of HSM/SPACEMANAGEMENT has reared its ugly head since many of 
these objects haven't been accessed in many years old. But as I understand it, 
that won't work either given our current configuration.

Given the current DFS configuration (previously CIFS), what can we do to 
improve backup performance?

So, any-and-all ideas are up for discussion.  There is even discussion on 
replacing ISP/TSM due to these issues/limitations.

--
*Zoltan Forray*
Spectrum Protect (p.k.a. TSM) Software & Hardware Administrator Xymon Monitor 
Administrator VMware Administrator Virginia Commonwealth University UCC/Office 
of Technology Services 
https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ucc.vcu.edu&d=DwIBaQ&c=k9MF1d71ITtkuJx-PdWme51dKbmfPEvxwt8SFEkBfs4&r=p7OfdQbObZllF-mnDqjrQg&m=jX8fSV2xXtioczSetX1viQa6EzVNOcZlBX9ddwWGXRM&s=KsUuBwu8G3pWJ7R7hedi0ZISk0CjIRrWQMJneyjNxD4&e=
zfor...@vcu.edu - 804-828-4807
Don't be a phishing victim - VCU and other reputable organizations will never 
use email to request that you reply with your password, social security number 
or confidential personal information. For more details visit 
https://urldefense.proofpoint.com/v2/url?u=http-3A__phishing.vcu.edu_&d=DwIBaQ&c=k9MF1d71ITtkuJx-PdWme51dKbmfPEvxwt8SFEkBfs4&r=p7OfdQbObZllF-mnDqjrQg&m=jX8fSV2xXtioczSetX1viQa6EzVNOcZlBX9ddwWGXRM&s=xiPt_TkUv02i1b7VQfKybQwokZGIKegAHQtBFG_G19U&e=

This message and any attachment is confidential and may be privileged or 
otherwise protected from disclosure. You should immediately delete the message 
if you are not the intended recipient. If you have received this email by 
mistake please delete it from your system; you should not copy the message or 
disclose its content to anyone. 

This electronic communication may contain general financial product advice but 
should not be relied upon or construed as a recommendation of any financial 
product. The information has been prepared without taking into account your 
objectives, financial situation or needs. You should consider the Product 
Disclosure Statement relating to the financial product and consult your 
financial adviser before making a decision about whether to acquire, hold or 
dispose of a financial product. 

For further details on the financial product please go to http://www.bt.com.au 

Past performance is not a reliable indicator of future performance.


Re: Looking for suggestions to deal with large backups not completing in 24-hours

2018-07-05 Thread Yi, Ung
Hello,
I don’t know much about Isilon.
There might be SAN level snap backups option for Isilon.

For our Data domain, we replicate from Main site to DR site, then take snap at 
our DR site every night. Each snap is consider a backup.

Thank you.

-Original Message-
From: ADSM: Dist Stor Manager [mailto:ADSM-L@VM.MARIST.EDU] On Behalf Of Zoltan 
Forray
Sent: Thursday, July 05, 2018 2:52 PM
To: ADSM-L@VM.MARIST.EDU
Subject: Looking for suggestions to deal with large backups not completing in 
24-hours

As I have mentioned in the past, we have gone through large migrations to
DFS based storage on EMC ISILON hardware.  As you may recall, we backup
these DFS mounts (about 90 at last count) using multiple Windows servers
that run multiple ISP nodes (about 30-each) and they access each DFS
mount/filesystem via -object=\\rams.adp.vcu.edu\departmentname.

This has lead to lots of performance issue with backups and some
departments are now complain that their backups are running into
multiple-days in some cases.

One such case in a department with 2-nodes with over 30-million objects for
each node.  In the past, their backups were able to finish quicker since
they were accessed via dedicated servers and were able to use Journaling to
reduce the scan times.  Unless things have changed, I believe Journling is
not an option due to how the files are accessed.

FWIW, average backups are usually <50k files and <200GB once it finished
scanning.

Also, the idea of HSM/SPACEMANAGEMENT has reared its ugly head since many
of these objects haven't been accessed in many years old. But as I
understand it, that won't work either given our current configuration.

Given the current DFS configuration (previously CIFS), what can we do to
improve backup performance?

So, any-and-all ideas are up for discussion.  There is even discussion on
replacing ISP/TSM due to these issues/limitations.

--
*Zoltan Forray*
Spectrum Protect (p.k.a. TSM) Software & Hardware Administrator
Xymon Monitor Administrator
VMware Administrator
Virginia Commonwealth University
UCC/Office of Technology Services
https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ucc.vcu.edu&d=DwIBaQ&c=k9MF1d71ITtkuJx-PdWme51dKbmfPEvxwt8SFEkBfs4&r=p7OfdQbObZllF-mnDqjrQg&m=jX8fSV2xXtioczSetX1viQa6EzVNOcZlBX9ddwWGXRM&s=KsUuBwu8G3pWJ7R7hedi0ZISk0CjIRrWQMJneyjNxD4&e=
zfor...@vcu.edu - 804-828-4807
Don't be a phishing victim - VCU and other reputable organizations will
never use email to request that you reply with your password, social
security number or confidential personal information. For more details
visit 
https://urldefense.proofpoint.com/v2/url?u=http-3A__phishing.vcu.edu_&d=DwIBaQ&c=k9MF1d71ITtkuJx-PdWme51dKbmfPEvxwt8SFEkBfs4&r=p7OfdQbObZllF-mnDqjrQg&m=jX8fSV2xXtioczSetX1viQa6EzVNOcZlBX9ddwWGXRM&s=xiPt_TkUv02i1b7VQfKybQwokZGIKegAHQtBFG_G19U&e=


Looking for suggestions to deal with large backups not completing in 24-hours

2018-07-05 Thread Zoltan Forray
As I have mentioned in the past, we have gone through large migrations to
DFS based storage on EMC ISILON hardware.  As you may recall, we backup
these DFS mounts (about 90 at last count) using multiple Windows servers
that run multiple ISP nodes (about 30-each) and they access each DFS
mount/filesystem via -object=\\rams.adp.vcu.edu\departmentname.

This has lead to lots of performance issue with backups and some
departments are now complain that their backups are running into
multiple-days in some cases.

One such case in a department with 2-nodes with over 30-million objects for
each node.  In the past, their backups were able to finish quicker since
they were accessed via dedicated servers and were able to use Journaling to
reduce the scan times.  Unless things have changed, I believe Journling is
not an option due to how the files are accessed.

FWIW, average backups are usually <50k files and <200GB once it finished
scanning.

Also, the idea of HSM/SPACEMANAGEMENT has reared its ugly head since many
of these objects haven't been accessed in many years old. But as I
understand it, that won't work either given our current configuration.

Given the current DFS configuration (previously CIFS), what can we do to
improve backup performance?

So, any-and-all ideas are up for discussion.  There is even discussion on
replacing ISP/TSM due to these issues/limitations.

--
*Zoltan Forray*
Spectrum Protect (p.k.a. TSM) Software & Hardware Administrator
Xymon Monitor Administrator
VMware Administrator
Virginia Commonwealth University
UCC/Office of Technology Services
www.ucc.vcu.edu
zfor...@vcu.edu - 804-828-4807
Don't be a phishing victim - VCU and other reputable organizations will
never use email to request that you reply with your password, social
security number or confidential personal information. For more details
visit http://phishing.vcu.edu/


Re: [EXT] [ADSM-L] VMFolder in VMware backups

2018-07-05 Thread Matthew McGeary
Good morning Hans,

Subfolders have never worked with the -vmfolder option, which has driven me 
crazy for years.  I gave up and went to cluster-level backups.

__
Matthew McGeary
Senior Advisor, Datacenter
Information Technology
T: (306) 933-8921
www.nutrien.com


-Original Message-
From: ADSM: Dist Stor Manager [mailto:ADSM-L@VM.MARIST.EDU] On Behalf Of Hans 
Christian Riksheim
Sent: Wednesday, July 4, 2018 12:21 AM
To: ADSM-L@VM.MARIST.EDU
Subject: [EXT] [ADSM-L] VMFolder in VMware backups

WARNING: This email originated from outside of the organization. Exercise 
caution when viewing attachments, clicking links, or responding to requests.


Any tip to include any sub folder? I have always believed they were included 
but now I see they are not. Not sure if this behavior has changed with 
versions. Not helping that the documentation tells nothing about this very 
important matter and turning yet another simple thing into a research project.

Hans C. Riksheim
For more information on Nutrien's email policy or to unsubscribe, click here: 
https://www.nutrien.com/important-notice
Pour plus de renseignements sur la politique de courrier électronique d’Nutrien 
ou pour vous désabonnez, cliquez ici : https://www.nutrien.com/avis-important