-- Bas van der Vlies
> On 9 dec. 2015, at 22:44, Michel Béland <michel.bel...@calculquebec.ca> wrote: > > Hi Joseph, > >> For whatever it is worth, Maui has some serious bugs when it is in >> full use. > > We had problems initially when we first used Maui on our big cluster. > Most of them were fixed by increasing some limits in the include files. > We also had a problem with some idle jobs not running (showstart would > show they should run immediately, but they would not). This was fixed by > commenting out some code. This was in a patch published on this list > many years ago, but it never made it to a release. > > The bugs caused by the Torque 5 change in attribute format are really > the show stoppers for us, hence my desire to fix them. > > The others bugs I can live with, for now. > We use Maui with torque 5 and for us everything is ok. We also have moab but that does not support the Maui features that we have implemented and are needed in our environment. >> I had Maui running for a VERY long time and it would behave >> differently when it was mostly idle as when it was under heavy use - >> we have thousands of cores. >> >> In my frustration I downloaded and enabled "moab" eval and as if by >> magic all of the weirdness we were seeing in Maui went away over a >> 2-month period. After two months of use, when I reverted back to >> Maui, all of the same weirdness came back. >> >> We eventually dropped Maui and went with Son of Grid Engine as Moab >> was price prohibited for us. Grid Engine has been working very well >> albeit via several home grown custom modifications. > > Good for you, but Torque still needs a free alternative to Moab. > pbs_sched is out of the question, unless it is heavily modified to add > missing features like backfilling. Maui is the closest approximation to > a usable free scheduler for Torque. It would be nice if users helped to > fix the bugs instead of giving up, but I understand that users do not > necessarly have time, skill or will to do so. > I totally agree. If you have a some patches please send them then we can apply then or put the result on git server. > > -- > Michel Béland, analyste en calcul scientifique > michel.bel...@calculquebec.ca > bureau S-250, pavillon Roger-Gaudry (principal), Université de Montréal > téléphone : 514 343-6111 poste 3892 télécopieur : 514 343-2155 > Calcul Québec (www.calculquebec.ca) > Calcul Canada (calculcanada.ca) > > _______________________________________________ > mauiusers mailing list > mauiusers@supercluster.org > http://www.supercluster.org/mailman/listinfo/mauiusers _______________________________________________ mauiusers mailing list mauiusers@supercluster.org http://www.supercluster.org/mailman/listinfo/mauiusers