Alex, How are your values "random" if you provide correct values ? Even for negative values you could use MIN to pick one value and return it. What is the problem with `MPI_Abort` ? it does seem to do what you want.
George. On Tue, Jul 18, 2023 at 4:38 AM Alexander Stadik via users < users@lists.open-mpi.org> wrote: > Hey everyone, > > I am working for longer time now with cuda-aware OpenMPI, and developed > longer time back a small exceptions handling framework including MPI and > CUDA exceptions. > Currently I am using MPI_Abort with costum error numbers, to terminate > everything elegantly, which works well, by just reading the logfile in case > of a crash. > > Now I was wondering how one can handle return / exit codes properly > between processes, since we would like to filter non-zero exits by return > code. > > One way is a simple Allreduce (in my case) + exit instead of Abort. But > the problem seems to be the values are always "random" (since I was using > negative codes), only by using MPI error codes it seems to work correctly. > But usage of that is limited. > > Any suggestions on how to do this / how it can work properly? > > BR Alex > > > <https://www.essteyr.com/> > > <https://at.linkedin.com/company/ess-engineeringsoftwaresteyr> > <https://twitter.com/essteyr> <https://www.facebook.com/essteyr> > <https://www.instagram.com/ess_engineering_software_steyr/> > > DI Alexander Stadik > > Head of Large Scale Solutions > Research & Development | Large Scale Solutions > > Book a Meeting > <https://outlook.office365.com/owa/calendar/di%20alexandersta...@essteyr.com/bookings/> > > > Phone: +4372522044622 > Company: +43725220446 > > Mail: alexander.sta...@essteyr.com > > > Register of Firms No.: FN 427703 a > Commercial Court: District Court Steyr > UID: ATU69213102 > > ESS Engineering Software Steyr GmbH • Berggasse 35 • 4400 • Steyr • Austria > > This message is confidential. It may also be privileged or otherwise > protected by work product immunity or other legal rules. If you have > received it by mistake, please let us know by e-mail reply and delete it > from your system; you may not copy this message or disclose its contents to > anyone. Please send us by fax any message containing deadlines as incoming > e-mails are not screened for response deadlines. The integrity and security > of this message cannot be guaranteed on the Internet. > > <https://www.essteyr.com/event/1-worldwide-coatings-simulation-conference/> > >