Hi Shamim, Thank you for providing additional information on the issue. Let's address your questions one by one:
* Memory Usage If the error message is about "grid structure consistency," that usually points to an issue setting up the Carpet grids rather than a lack of memory. Maybe Carpet is having a problem breaking up the grids beyond 8 cores? I suggest you submit a ticket to https://bitbucket.org/einsteintoolkit/tickets/ with the full instructions on reproducing the problem. * Invalid MIT-MAGIC-COOKIE-1 Key You're correct in your assumption that the Invalid MIT-MAGIC-COOKIE-1 key issue might not be directly causing the problem you're facing with the BNSM runs, especially since the simulations ran successfully despite the warning. However, I'd still recommend addressing this issue to eliminate it as a potential confounding factor. -Zach * * * Zachariah Etienne Assoc. Prof. of Physics, U. of Idaho Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U. https://etienneresearch.com https://blackholesathome.net On Sat, Oct 14, 2023 at 6:36 AM Shamim Haque 1910511 <sham...@iiserb.ac.in> wrote: > Hi Zach, > > I retried it on a workstation with fairly good configuration (80 threads, > 256 GB RAM), where I ran test BNSM simulations. The problem remains the > same. > > I cannot use more than 8 procs (num-threads 1) for this parfile. > Otherwise, it says, 'grid structure consistency check failed'. So, I > guess this simulation is not demanding too much memory. But is there a > better way to check how much memory is expected? > > Secondly, I checked the error files from my old BNSM runs, those files > also contain the following lines indicating Invalid MIT-MAGIC-COOKIE-1 key: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > *+ mpirun --use-hwthread-cpus -np 10 > /home/astro208/simulations/t1.4_had_1.4M/SIMFACTORY/exe/cactus_sim -L 3 > /home/astro208/simulations/t1.4_had_1.4M/output-0000/t1.4_had_1.4M.parInvalid > MIT-MAGIC-COOKIE-1 > key--------------------------------------------------------------------------WARNING: > No preset parameters were found for the device that Open MPIdetected: > Local host: astro Device name: irdma0 Device vendor > ID: 0x8086 Device vendor part ID: 14289Default device parameters will > be used, which may result in lowerperformance. You can edit any of the > files specified by thebtl_openib_device_param_files MCA parameter to set > values for yourdevice.NOTE: You can turn off this warning by setting the > MCA parameter btl_openib_warn_no_device_params_found to > 0.--------------------------------------------------------------------------No > OpenFabrics connection schemes reported that they were able to beused on a > specific port. As such, the openib BTL (OpenFabricssupport) will be > disabled for this port. Local host: astro Local device: > irdma1 Local port: 1 CPCs attempted: > udcm--------------------------------------------------------------------------Open > MPI failed an OFI Libfabric library call (fi_endpoint). This is > highlyunusual; your job may behave unpredictably (and/or abort) after > this. Local host: astro Location: mtl_ofi_component.c:629 Error: > Unspecified error > (256)--------------------------------------------------------------------------* > > These simulations ran successfully. Even though Invalid-MAGIC-COOKIE could > be a separate issue, it may not be the source of this particular problem, > but I could be totally wrong here. > > Please let me know your thoughts on this. > > Regards > Shamim Haque > Senior Research Fellow (SRF) > Department of Physics > IISER Bhopal > Shamim Haque > Senior Research Fellow (SRF) > Department of Physics > IISER Bhopal > > ᐧ > > On Thu, Oct 12, 2023 at 8:15 PM Zach Etienne <zache...@gmail.com> wrote: > >> Hi Shamim, >> >> Invalid MIT-MAGIC-COOKIE-1 key: This is related to X11 forwarding and >> authentication. The MIT-MAGIC-COOKIE-1 is an authentication scheme used by >> X11, the Linux windowing system. When you're trying to run a program that >> requires graphical output on a remote machine, the X11 system uses these >> "magic cookies" to authenticate the user. If there's a mismatch or the key >> is invalid, you will be denied permission. >> >> We believe the segmentation fault is probably due to running a parameter >> file on a computer that doesn't have enough memory. The error file seemed >> to indicate running on a laptop. >> >> -Zach >> >> * * * >> Zachariah Etienne >> Assoc. Prof. of Physics, U. of Idaho >> Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U. >> https://etienneresearch.com >> https://blackholesathome.net >> >> >> On Thu, Oct 12, 2023 at 6:34 AM Shamim Haque 1910511 < >> sham...@iiserb.ac.in> wrote: >> >>> Hello all, >>> >>> I am trying to use the bbig.par parameter file to extract the metric and >>> thermodynamic information for an isolated star ID from Lorene. This parfile >>> is available in Meudon_Mag_NS/par folder. >>> >>> I tried this par file with the given Lorene ID. It is supposed to exit >>> after iter 0 with IO outfiles. However, it does not give the requested >>> outputs upon completion of the simulation. I can see the Lorene information >>> is read correctly in the out file, but I am unable to find out the problem. >>> >>> I need some help with this. I have attached the parfile, ID, outfile and >>> error file for reference. >>> >>> Regards >>> Shamim Haque >>> Senior Research Fellow (SRF) >>> Department of Physics >>> IISER Bhopal >>> ᐧ >>> _______________________________________________ >>> Users mailing list >>> Users@einsteintoolkit.org >>> http://lists.einsteintoolkit.org/mailman/listinfo/users >>> >>
_______________________________________________ Users mailing list Users@einsteintoolkit.org http://lists.einsteintoolkit.org/mailman/listinfo/users