> -----Original Message----- > From: lustre-discuss-boun...@lists.lustre.org [mailto:lustre-discuss- > boun...@lists.lustre.org] On Behalf Of Dilger, Andreas > Sent: 8. november 2014 00:26 > To: Dragseth Roy Einar; lustre-discuss@lists.lustre.org > Subject: Re: [Lustre-discuss] Interpreting stats files > > On 2014/11/07, 4:06 PM, "Dragseth Roy Einar" <roy.drags...@uit.no> wrote: > > >Many thanks for the quick replies. lltop seems to be a good start for > >a tool to single out the heaviest IO users. Just need to create a > >wrapper that maps the node names to torque jobids. > > > >Have a nice weekend! > > If you have Lustre 2.4 or later, you can enable the "Jobstats" aka "JobID" > functionality in Lustre and it will handle the mapping of RPC statistics to > Torque jobids already. > > This is described in the Lustre User Manual. >
The job stats seems to be a really nice feature, but we are still on Lustre 2.1. I will look into it when we upgrade. I created a little python script that restructures the lltop output and identifies which jobs (and users) running on the nodes that hit the lustre file system hardest. It has been accepted into the contrib section of lltop, it only works for torque though. https://github.com/jhammond/lltop/tree/master/contrib/torque Again, thanks for the help and comments, Roy. r. _______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss