I discovered that the system wasn't effectively utilizing the cpu's. Initially I thought it was caused by cpu throttling due to overheating, but after reading out the temperatures, this hypotheses is not correct.
I have increased the number of threads that generate requests. And the a saw pattern on the cpu load disappears (forking and joining) and it remains constant at roughly 80%. And currently I'm at a aggregating 55 GB/s. I also played with 50GB offheap chunk and I'm up to 68 GB/s. It would still be interesting to know if there is a tool that can show the maximum bandwidth of the memory bus. On Sunday, January 14, 2018 at 8:44:00 PM UTC+2, Peter Veentjer wrote: > > I'm working on some very simple aggregations on huge chunks of offheap > memory (500GB+) for a hackaton. This is done using a very simple stride; > every iteration the address increases with 20 bytes. So the prefetcher > should not have any problems with it. > > According to my calculations I'm currently processing 35 GB/s. However I'm > not sure if I'm close to the maximum bandwidth of this machine. Specs: > 2133 MHz, 24x HP 32GiB 4Rx4 PC4-2133P > 2x Intel(R) *Xeon*(R) CPU E5-2687W v3, 3.10GHz, 10 cores per socket > > What is the best tool to determine the maximum bandwidth of a machine > running Linux (RHEL 7) > -- You received this message because you are subscribed to the Google Groups "mechanical-sympathy" group. To unsubscribe from this group and stop receiving emails from it, send an email to mechanical-sympathy+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.