Hi All,
I tried to ran some simulation using multi-core. Benchkmarks that I am using
are single thread code.
I ran one of them with 4 cores and I got following stats for user:
OOO_0_0: cycles 40156294
OOO_1_1: cycles 19384
OOO_2_2: cycles 101236
OOO_2_2: cycles 0
For the Kernel, load distributed among cores in a good way. Is this because my
code is single thread?
How can I fix that without changing code to multi-thread?
How can I distribute the load in a fine grain way?
Thanks in advance for your help and comment.
Regards,
Ali Ahmadi
_______________________________________________
http://www.marss86.org
Marss86-Devel mailing list
[email protected]
https://www.cs.binghamton.edu/mailman/listinfo/marss86-devel