Hi All,

I tried to ran some simulation using multi-core. Benchkmarks that I am using 
are single thread code. 
I ran one of them with 4 cores and I got following stats for user:

OOO_0_0: cycles    40156294
OOO_1_1: cycles    19384

OOO_2_2: cycles    101236

OOO_2_2: cycles    0



For Kernel the load distributed among cores in a good way. Is this because my 
code is single thread?
How can I fix that without changing code to multi-thread?

How can I distribute the load in a fine grain way?


Thanks in advance for your help  and comment.


Regards,
Ali Ahmadi
_______________________________________________
http://www.marss86.org
Marss86-Devel mailing list
[email protected]
https://www.cs.binghamton.edu/mailman/listinfo/marss86-devel

Reply via email to