Re: [lustre-discuss] SC19 bof slides

2020-01-15 Thread Spitz, Cory James
Hi, Ken. Thanks for sharing these. Peter, I have a question and a comment/suggestion for you about the "Lustre 2.13 Contributions" pie charts. For the LINES OF CODE contributors, I don't see ORNL. I've got to figure that there must be some oversight. I'm sure that James Simmons alone contri

Re: [lustre-discuss] [Lwg] SC19 bof slides

2020-01-16 Thread Spitz, Cory James
ce. I hope that James will forgive me this oversight! Peter On 2020-01-15, 10:37 AM, "Spitz, Cory James" wrote: Hi, Ken. Thanks for sharing these. Peter, I have a question and a comment/suggestion for you about the "Lustre 2.13 Contributi

Re: [lustre-discuss] LUSTRE - Installation on DEBIAN 10.x ????

2020-02-11 Thread Spitz, Cory James
No, you are not the only one. There is help for Debian users at https://wiki.whamcloud.com/display/PUB/Building+Lustre+from+Source. Unfortunately, the guide at http://wiki.debian.org/Lustre is ancient old. https://wiki.whamcloud.com/display/PUB/Build+Lustre+MASTER+client+on+Debian+10.1.0+from+

Re: [lustre-discuss] LNET ports and connections

2020-02-19 Thread Spitz, Cory James
Hello, Aurélien. I'm guessing that if you have modern Lustre then idle clients may disconnect, and so you might regularly see Lustre servers initiate the socket connection again. I'm not sure how to show that that it is the case or not. Perhaps someone else can chime in on whether that could

Re: [lustre-discuss] DF bug with lustre 2.12.4

2020-02-27 Thread Spitz, Cory James
Hello, Kevin. I see from LU-13285 that Nathan D. pointed you at LU-13296. I left a comment in the ticket as well. I think that you can try the patch from LU-13296 with your reproducer. -Cory On 2/21/20, 10:08 AM, "lustre-discuss on behalf of Konzem, Kevin P" mailto:lustre-discuss-boun...@li

Re: [lustre-discuss] DNE2 settings are not propagated?

2020-03-20 Thread Spitz, Cory James
As Andreas says, it isn’t really recommended, but you can get sub-directories to inherit the striping by using the -D, --default option to `lfs mkdir` or `lfs setdirstripe`. From the man page: -D, --default Set the default striping pattern of subdirectories. Newly cre-

Re: [lustre-discuss] Linux 5.6 Kernel Support

2020-04-06 Thread Spitz, Cory James
> I notice that there have been various commits needed for support of recent > Linux kernels FYI, 2.13.53 was tagged today, but it may not have included what you are looking for. See the lustre/ChangeLog file (https://git.whamcloud.com/?p=fs/lustre-release.git;a=blob;f=lustre/ChangeLog;h=dfb5f

Re: [lustre-discuss] mlx4 and mxl5 mix environment

2020-06-26 Thread Spitz, Cory James
Megan, You wrote: PS. [I am willing to add/contribute to the http://wiki.lustre.org/Infiniband_Configuration_Howto but I think my account for wiki editing has expired (at least the one I thought I had did not work). Thank you for your offe

Re: [lustre-discuss] Permission denied on lfs getstripe

2020-07-06 Thread Spitz, Cory James
> Could you point me to the Jira site? https://jira.whamcloud.com -Cory On 7/2/20, 6:01 PM, "lustre-discuss on behalf of Chang, Christopher" mailto:lustre-discuss-boun...@lists.lustre.org> on behalf of christopher.ch...@nrel.gov> wrote: Hi Andreas, Inde

Re: [lustre-discuss] tgt grant error ..

2020-08-01 Thread Spitz, Cory James
+ Vladimir I _think_ that the proposed fix for LU-13766 came from Vladimir under LU-12687. Is there a different proposed patch from Whamcloud? It seems that we all do have other non-direct IO grant problems to sort out. I guess I’m kinda fishing to see if there is another grant patch out ther

Re: [lustre-discuss] Lustre optimize for spares data files ?

2020-09-17 Thread Spitz, Cory James
Hello, T.H.Hsieh. You asked, "... is it possible to enable file compression on the ZFS backend only without any side effect in the whole Lustre file system ?" The answer is, yes. As Robert Redl explained earlier, the OST can deal with some objects compressed and others not, even while toggling

Re: [lustre-discuss] setdirstripe

2020-10-23 Thread Spitz, Cory James
Hello, Michael. I noticed in a previous message to the list you said that you were “running clusterstor 3.2”. I assume that your 2.11 servers are really running the Cray/HPE 2.11 as part of Neo 3.2. If so, please know that Neo 3.2 does not support DNE2 striped directories due to numerous poss

Re: [lustre-discuss] setdirstripe

2020-10-30 Thread Spitz, Cory James
me was the phrasing of the error message. had it been "this is disabled" i would have remembered thanks On Sat, Oct 24, 2020 at 1:33 AM Spitz, Cory James wrote: > > Hello, Michael. > > > > I noticed in a previous message to the list you said that you were “running > c

Re: [lustre-discuss] LNET IB intermittent connection

2021-02-11 Thread Spitz, Cory James
Hi, Nate. You asked, “can LNET be easily configured to go over the @tcp connection when the @o2ib flakes out?” Yes, you can use LNet Multi-Rail for it and that _is_ covered in the “fine manual”, chapter 16 ☺ https://doc.lustre.org/lustre_manual.xhtml#lnetmr -Cory On 2/10/21, 4:54 PM, "lustre-

Re: [lustre-discuss] LNET IB intermittent connection

2021-02-12 Thread Spitz, Cory James
ith @o2ib. You need the user defined selection policy feature for that, and that feature is not slated to arrive until after 2.14 (afaik). Chris Horn From: lustre-discuss mailto:lustre-discuss-boun...@lists.lustre.org>> on behalf of "Spitz, Cory James" mailto:cory.sp...@hpe.c

Re: [lustre-discuss] File Level Redundancy - Data Movement

2021-02-18 Thread Spitz, Cory James
Good questions, Indivar. 1) The Lustre Operations Manual doesn’t make this clear at https://doc.lustre.org/lustre_manual.xhtml#flr.operations.resyncmirror , but a mirror sync will be completed in the context of the client that it was executed on. 2) Mirror resync is entirely manual today. Aga

Re: [lustre-discuss] File Level Redundancy - Data Movement

2021-02-21 Thread Spitz, Cory James
edicated client will have to be very fast too. 3. In future it may be possible to move these files using HSM agents. Regards, Indivar Nair On Fri, Feb 19, 2021 at 1:51 AM Spitz, Cory James mailto:cory.sp...@hpe.com>> wrote: Good questions, Indivar. 1) The Lustre Operations Manual do

Re: [lustre-discuss] Cannot move data after upgrading to Lustre 2.12.6

2021-02-22 Thread Spitz, Cory James
Hello, T.H.Hsieh. Your report sounds familiar to me. Although you are concerned about upgrades from 1.8.x, there were some other troubles reported when updating from earlier 2.x. You might want to take a closer look at https://jira.whamcloud.com/browse/LU-13392. I didn’t review it deeply and

Re: [lustre-discuss] Stray files after failed lfs_migrate

2021-03-05 Thread Spitz, Cory James via lustre-discuss
> lfsck needs to be done with the whole volume offline? No, in Lustre 2.x lfsck is an online tool. Per https://doc.lustre.org/lustre_manual.xhtml#idm139675950896912: Disaster recovery tool: The Lustre file system provides an online distributed file system check (LFSCK) that can restore consistenc

Re: [lustre-discuss] Experience with DDN AI400X

2021-03-30 Thread Spitz, Cory James via lustre-discuss
Hello, Megan. I was curious why you made this comment: > A general example is a box with lustre-client 2.10.4 is not going to be > completely happy with a new 2.12.x on the lustre network In general, I think that the two LTS release are very interoperable. What incompatibility are you referring

Re: [lustre-discuss] Unable to mount new OST

2021-07-06 Thread Spitz, Cory James via lustre-discuss
What OST index (number) were you trying to add? Andreas is right: Note that your "--index=0051" value is probably interpreted as an octal number "41", it should be "--index=0x0051" or "--index=0x51" (hex, to match the OST device name) or "--index=81" (decimal). And you said: I'm aware that inde

Re: [lustre-discuss] Full OST

2021-09-16 Thread Spitz, Cory James via lustre-discuss
What versions do you have on your servers and clients? Do you have some wide gap in versions? Is your sever very old? There was a change to the object deletion protocol that you may need to contend with. It was related to LU-5814. If you don't have an older server then this is not your prob

Re: [lustre-discuss] Upgrading lustre servers

2022-02-23 Thread Spitz, Cory James via lustre-discuss
Kurt, Also, please be aware of https://jira.whamcloud.com/browse/LU-15177. A version interop check won’t allow MDT-MDT version skew > 3 minor versions. Note, that check is just about MDT versions. -Cory On 2/21/22, 8:59 PM, "lustre-discuss" wrote: Kurt, The phrasing is a little confusing

Re: [lustre-discuss] Upgrading lustre servers

2022-02-23 Thread Spitz, Cory James via lustre-discuss
No, you won’t have MDT interop to worry about in that case. -Cory On 2/23/22, 8:44 AM, "Kurt Strosahl" wrote: We only have a single, combined, mdt-mds... Would that impact it? ____ From: Spitz, Cory James Sent: Wednesday, February 23, 2022 9:38 AM T

Re: [lustre-discuss] question regarding du vs df on lustre

2022-04-23 Thread Spitz, Cory James via lustre-discuss
If it is taking too long for targets to sync-up you can tune the activity and speed things up by adjusting some osp tunables. First, monitor osp sync_in_progress and destroys_in_flight to see if that’s what’s going on. Then you can tune up the MDS’s osp’s max_rpcs_in_progress if necessary. -C

Re: [lustre-discuss] Changing default recovery window time settings

2022-08-09 Thread Spitz, Cory James via lustre-discuss
The classical way to put a limit on recovery is to use the recovery_time_soft and recovery_time_hard mount options. See the mount.lustre options: https://doc.lustre.org/lustre_manual.xhtml#idm139974521647280 recovery_time_soft=timeout Allows timeout seconds for clients to reconnect for recovery

Re: [lustre-discuss] Lustre recycle bin

2022-10-17 Thread Spitz, Cory James via lustre-discuss
What version(s) are you using? Do you have an old client and a new-ish server? Very old client versions will disagree with the MDSes about how to clean up objects, resulting in orphans. -Cory On 10/17/22, 3:44 AM, "lustre-discuss" wrote: Thank-you! -Original Message- From: Alastair

Re: [lustre-discuss] Changing OST servicenode

2022-11-16 Thread Spitz, Cory James via lustre-discuss
If you are only changing the nid then you do not need to follow the full writeconf procedure. See https://doc.lustre.org/lustre_manual.xhtml#lustremaint.changingservernid : If you need to change only the NID of the MDT or OST, the replace_nids command can simplify this process. The replace_nids