Re: [Breaking change] Move nxmutex to sched

Gregory Nutt Mon, 17 Apr 2023 08:50:33 -0700

Linux uses functions like copytouser() and copyfromuser() to getinformation to/from user space.

But I think there is an easier way in NuttX. All user memory comes froma poll of pages that are also mapped in kernel space. I think that istrue for all architectures. And there should be a function to convert auser virtual address to a user virtual address to physical address. Iam not sure what all is in place.

Couldn't accessing user memory from the kernel address alias avoid theproblem you describe. Of course, you would have to be careful at pageboundaries because contiguous virtual pages may not be physicallycontiguous.


On 4/14/2023 6:18 AM, Jukka Laitinen wrote:

Hi,
I am not sure whether it is necessary to separate mutex and semaphore(although I do see the performance gain that it would give for mutex),but there is another related topic I would like to raise.
Currently, the semaphores don't work (at all) for CONFIG_BUILD_KERNEL.The simple reason is that the semaphores are allocated from theuser-mapped memory, which is not always available for the kernel whilescheduling or in interrupts. At the time when it is needed, there maybe another memory map active for mmu.
There is also an issue with performance; every semaphore access needsto go to the kernel through syscall, although in principle thesemaphore counter handling alone doesn't need that if the compiler &hw has the necessary atomic support.
We are especially interested in having real-time behaviour (prioritybased scheduling, priority inheritance...) AND runningCONFIG_BUILD_KERNEL. We have used some methods to circumvent theissue, but for those I am not going into details as we don't have apublishable implementation ready.
A tempting way to fix the problem (which we didn't try out yet) wouldbe separating the semaphores in two parts, kernel side structure andthe user side structure. Something that zyfeier also did with the"futex" linux-like implementation. But, also this kind ofimplementation should be real-time - so when there is access to thesemaphore via syscall (e.g. when the semaphore blocks), or whenscheduling, the kernel must have O(1) access to the kernel sidestructure - no hashing / allocating etc. at runtime.
So to summarize, for CONFIG_BUILD_KERNEL the semaphores could*perhaps* work like this (this is not yet tried out, so please forgiveme if something is forgotten):- User-side semaphore handle would have the counter and a directpointer (handle) to the kernel side structure (which can be passed tokernel in syscall).- Kernel side structure would have the needed wait queue and semholder structures (and flags?)- Kernel side structure would be allocated at sem_init (AND if it wasnot initialized, allocate it at the time when it is needed?). Toachieve real-time behaviour one should just call sem_init properly atstartup of the application.- Kernel side structures would be listed in tcb and cleaned up attask_group exit. Also some hard limit/management for how much kernelmemory can one process eat from kernel heap is needed.- Counter manipulation can be handled directly in libc in casecompiler supports proper atomic operations, or syscall to kernel whenthere is no support available (this would be just performanceoptimization - next phase)
Whether it is feasible to do it only for CONFIG_BUILD_KERNEL, or as acommon implementation for all build modes, I didn't think of yet. Iam also not sure whether the re-design of semaphore could also lead tobetter wrapping of it for mutex use, but this is also possible. Inthat case it could *maybe* solve the performance issue zyfeier triedto tackle.
This is just one idea, but somehow the problem of not workingsemaphores in CONFIG_BUILD_KERNEL should be tackled. I wonder if thisis something we should experiment with? If someone is interested insuch an experiment, please let me know. Or if someone is interested indoing this experiment, please let me know as well, so we don't end updoing duplicate work :)
Br,
Jukka
Ps. I think that in the current implementation the nxmutex code isinlined everywhere, increasing code size. Not a huge issue for me, butincreasing code size should be managed....
On 7.4.2023 5.18, zyfeier wrote:
Thank you very much for the example you provided. What I want topoint out is that this is not just about " just delete / replace whatis already out there working fine ". Due to the multi-holder of thecount semaphore, the performance of the mutex is much worse thanother RTOS (with a performance gap of 10%), but these operations arenot necessary for the mutex. That's why there is an idea to separatethe mutex and semaphore.
However, if everyone thinks that separating the mutex and semaphoreis a bad idea, then we need to think of other methods. Do you haveany better methods to offer?
从Windows 版邮件 <https://go.microsoft.com/fwlink/?LinkId=550986>发送

*发件人: *Tomek CEDRO <mailto:[email protected]>
*发送时间: *2023年4月6日22:36
*收件人: *[email protected]
*主题: *Re: [Breaking change] Move nxmutex to sched

On Thu, Apr 6, 2023 at 2:58 PM Gregory Nutt wrote:
> Oh my God!  That sounds terrible!  Does this change actually do
> /anything /positive.
Look Zyfeier, its not that we oppose development, we want thedevelopment to done the right way that will bring elegant coherentstandard compliant solution as a result :-)
Aside from my previous remark on Linux (along with other commercialOS) "enforced changes", lets think about Greg's "does this changeactually do /anything /positive" question with another example.
Take a looks at WS2812 RGB Smart LED. They decided to introduce "aninnovation" by changing the Pin1 marking on the casing and put thatmark on pin 3 instead. Whole world use Pin1 marking to quickly aligna component pinout, so at first glance you can see where is the pin 1of the component, also most chips use VCC there so you can quicklymeasure things, nothing fancy, everyone knows that. Now take a lookat the pcb design footprint (bottom layer mirrored) and the leddatasheet.
You can clearly see that putting Pin1 casing mark on pin 3 is aterrible idea, even more that chip is symmetrical, so it will lead tobad placing and reversed power supply. Sure, this is some innovation,but world does not work that way and everyone just gets confused.When you make such changes to other components a design becomesincoherent and no one will then know anything, but look how many(fake) "innovations" just showed up.
This is why solid coherent standardized fundamentals / foundations oftechnology is so important. So we "just know" things intuitively, andwe can work together to improve things worldwide in a systematicfashion, solid brick after solid brick, evolution not revolution. Youcannot just delete / replace what is already out there working fine.
Example above is about electronic component, but with the software isexactly the same, it is good to stick to well adapted standards, addyour own brick on top of solid inviolable fundamentals / fundation,not necessarily following the quickly changing fashions and trendswith a lifespan of a yogurt, not spreading bad habits from otherenvironments, that will result in far better solution that iscoherent and long term maintainable. That results in a solidfoundation for a good system / device / solution / product.
--
CeDeROM, SQ7MHZ, http://www.tomek.cedro.info

Re: [Breaking change] Move nxmutex to sched

Reply via email to