Re: [m5-dev] Fixing MESI CMP directory protocol

Arkaprava Basu Tue, 04 Jan 2011 11:25:35 -0800

These are the following step I use:

1. First run with whatever default values of threshold are.

2. If deadlocked, take trace and try to find out is there evident reasonfor deadlock or not.

3. If no, double the default threshold value and run again.

4. If the same test passes with larger threshold, then it means thedeadlock was actually not there. So life is good. If not, need to digmore into trace to see whats going on.


@Nilay:

By end of today, I will share with you the patch that seems like fixedthat protocol.


Thanks
Arka

On 01/04/2011 12:51 PM, Nilay Vaish wrote:

What threshold do you use?

On Tue, 4 Jan 2011, Arkaprava Basu wrote:
Hi Nilay,

  On deadlock issue with MESI_CMP_directory :
Yes, this can happen as ruby_tester or Sequencer only reports*possible* deadlocks. With higher number of processors there is morecontention (and thus latency) and it can mistakenly report deadlock.I generally look at the protocol trace to figure out whether there isactually any deadlock or not. You can also try doubling the Sequencerdeadlock threshold and see if the problem goes away. If its a truedeadlock, it will break again.
On some related note, as Brad has pointed out MESI_CMP_directory hasits share of issues. Recently one of Prof. Sarita Adve's studente-mailed us (Multifacet) about 6 bugs he found while model checkingthe MESI_CMP_directory (including a major one). I took some time tolook at them and it seems like MESI_CMP_directory is now fixed(hopefully). The modified protocol is now passing 1M checks with 16processors with multiple random seeds. I can locally coordinate withyou on this, if you want.
Thanks
Arka

_______________________________________________
m5-dev mailing list
m5-dev@m5sim.org
http://m5sim.org/mailman/listinfo/m5-dev

Re: [m5-dev] Fixing MESI CMP directory protocol

Reply via email to