Hi All, I tried to ran some simulation using multi-core. Benchkmarks that I am using are single thread code. I ran one of them with 4 cores and I got following stats for user:
OOO_0_0: cycles 40156294 OOO_1_1: cycles 19384 OOO_2_2: cycles 101236 OOO_2_2: cycles 0 For Kernel the load distributed among cores in a good way. Is this because my code is single thread? How can I fix that without changing code to multi-thread? How can I distribute the load in a fine grain way? Thanks in advance for your help and comment. Regards, Ali Ahmadi
_______________________________________________ http://www.marss86.org Marss86-Devel mailing list [email protected] https://www.cs.binghamton.edu/mailman/listinfo/marss86-devel
