Re: [Breaking change] Move nxmutex to sched

Jukka Laitinen Fri, 14 Apr 2023 05:19:07 -0700

Hi,

I am not sure whether it is necessary to separate mutex and semaphore(although I do see the performance gain that it would give for mutex),but there is another related topic I would like to raise.

Currently, the semaphores don't work (at all) for CONFIG_BUILD_KERNEL.The simple reason is that the semaphores are allocated from theuser-mapped memory, which is not always available for the kernel whilescheduling or in interrupts. At the time when it is needed, there may beanother memory map active for mmu.

There is also an issue with performance; every semaphore access needs togo to the kernel through syscall, although in principle the semaphorecounter handling alone doesn't need that if the compiler & hw has thenecessary atomic support.

We are especially interested in having real-time behaviour (prioritybased scheduling, priority inheritance...) AND runningCONFIG_BUILD_KERNEL. We have used some methods to circumvent the issue,but for those I am not going into details as we don't have a publishableimplementation ready.

A tempting way to fix the problem (which we didn't try out yet) would beseparating the semaphores in two parts, kernel side structure and theuser side structure. Something that zyfeier also did with the "futex"linux-like implementation. But, also this kind of implementation shouldbe real-time - so when there is access to the semaphore via syscall(e.g. when the semaphore blocks), or when scheduling, the kernel musthave O(1) access to the kernel side structure - no hashing / allocatingetc. at runtime.

So to summarize, for CONFIG_BUILD_KERNEL the semaphores could *perhaps*work like this (this is not yet tried out, so please forgive me ifsomething is forgotten):- User-side semaphore handle would have the counter and a direct pointer(handle) to the kernel side structure (which can be passed to kernel insyscall).- Kernel side structure would have the needed wait queue and sem holderstructures (and flags?)- Kernel side structure would be allocated at sem_init (AND if it wasnot initialized, allocate it at the time when it is needed?). To achievereal-time behaviour one should just call sem_init properly at startup ofthe application.- Kernel side structures would be listed in tcb and cleaned up attask_group exit. Also some hard limit/management for how much kernelmemory can one process eat from kernel heap is needed.- Counter manipulation can be handled directly in libc in case compilersupports proper atomic operations, or syscall to kernel when there is nosupport available (this would be just performance optimization - next phase)

Whether it is feasible to do it only for CONFIG_BUILD_KERNEL, or as acommon implementation for all build modes, I didn't think of yet. I amalso not sure whether the re-design of semaphore could also lead tobetter wrapping of it for mutex use, but this is also possible. In thatcase it could *maybe* solve the performance issue zyfeier tried to tackle.

This is just one idea, but somehow the problem of not working semaphoresin CONFIG_BUILD_KERNEL should be tackled. I wonder if this is somethingwe should experiment with? If someone is interested in such anexperiment, please let me know. Or if someone is interested in doingthis experiment, please let me know as well, so we don't end up doingduplicate work :)


Br,
Jukka

Ps. I think that in the current implementation the nxmutex code isinlined everywhere, increasing code size. Not a huge issue for me, butincreasing code size should be managed....


On 7.4.2023 5.18, zyfeier wrote:

Thank you very much for the example you provided. What I want to pointout is that this is not just about " just delete / replace what isalready out there working fine ". Due to the multi-holder of the countsemaphore, the performance of the mutex is much worse than other RTOS(with a performance gap of 10%), but these operations are notnecessary for the mutex. That's why there is an idea to separate themutex and semaphore.
However, if everyone thinks that separating the mutex and semaphore isa bad idea, then we need to think of other methods. Do you have anybetter methods to offer?
从Windows 版邮件 <https://go.microsoft.com/fwlink/?LinkId=550986>发送

*发件人: *Tomek CEDRO <mailto:to...@cedro.info>
*发送时间: *2023年4月6日22:36
*收件人: *dev@nuttx.apache.org
*主题: *Re: [Breaking change] Move nxmutex to sched

On Thu, Apr 6, 2023 at 2:58 PM Gregory Nutt wrote:
> Oh my God!  That sounds terrible!  Does this change actually do
> /anything /positive.
Look Zyfeier, its not that we oppose development, we want thedevelopment to done the right way that will bring elegant coherentstandard compliant solution as a result :-)
Aside from my previous remark on Linux (along with other commercialOS) "enforced changes", lets think about Greg's "does this changeactually do /anything /positive" question with another example.
Take a looks at WS2812 RGB Smart LED. They decided to introduce "aninnovation" by changing the Pin1 marking on the casing and put thatmark on pin 3 instead. Whole world use Pin1 marking to quickly align acomponent pinout, so at first glance you can see where is the pin 1 ofthe component, also most chips use VCC there so you can quicklymeasure things, nothing fancy, everyone knows that. Now take a look atthe pcb design footprint (bottom layer mirrored) and the led datasheet.
You can clearly see that putting Pin1 casing mark on pin 3 is aterrible idea, even more that chip is symmetrical, so it will lead tobad placing and reversed power supply. Sure, this is some innovation,but world does not work that way and everyone just gets confused. Whenyou make such changes to other components a design becomes incoherentand no one will then know anything, but look how many (fake)"innovations" just showed up.
This is why solid coherent standardized fundamentals / foundations oftechnology is so important. So we "just know" things intuitively, andwe can work together to improve things worldwide in a systematicfashion, solid brick after solid brick, evolution not revolution. Youcannot just delete / replace what is already out there working fine.
Example above is about electronic component, but with the software isexactly the same, it is good to stick to well adapted standards, addyour own brick on top of solid inviolable fundamentals / fundation,not necessarily following the quickly changing fashions and trendswith a lifespan of a yogurt, not spreading bad habits from otherenvironments, that will result in far better solution that is coherentand long term maintainable. That results in a solid foundation for agood system / device / solution / product.
--
CeDeROM, SQ7MHZ, http://www.tomek.cedro.info

Re: [Breaking change] Move nxmutex to sched

Reply via email to