[lustre-discuss] Metrics Gathering into ELK stack

2020-12-09 Thread Sid Young
G'Day all, I am about to commission a new HPC over the holiday break and in planning I am looking at metrics gathering of the Lustre Cluster, most likely into an Elastic/Kibana Stack. Are there any reliable/solid Lustre Specific metrics tools that can push data to ELK OR Can generate JSON

Re: [lustre-discuss] More issues with cur_grant_bytes

2020-12-09 Thread Nathan Dauchy - NOAA Affiliate
On Tue, Dec 8, 2020 at 11:46 AM Kevin M. Hildebrand wrote: > We appear to be tripping over the same issues reported recently by > Tung-Han Hsieh and Simon Guilbault, namely that cur_grant_bytes is being > reduced to a very small value and causing abysmal performance. > I'm curious if anyone

Re: [lustre-discuss] More issues with cur_grant_bytes

2020-12-09 Thread Aaron Knister
Hi Kevin, Meg, Nathan, I'm interested in some of the details about how you're hitting this with an eye to reproducing. Could you share any details about the following: - Interconnect - Client count - Workload Thanks! -Aaron On 12/8/20 1:46 PM, Kevin M. Hildebrand wrote: We appear to be

Re: [lustre-discuss] changelogs stop working

2020-12-09 Thread Stephane Thiell
Hi Thomas, Nodemap’s audit_mode is not defaulting to 1 on upgrade to Lustre 2.12. If you have recently experienced the issue when upgraded a filesystem from 2.10 to Lustre 2.12, so please check that flag maybe.