Re: [gem5-dev] Review Request: Forward invalidations from Ruby to O3 CPU

Nilay Vaish Wed, 09 Nov 2011 15:25:59 -0800

Brad, your reply clears some air.

The current patch allows us to use the existing O3 CPU with Ruby. Sincethe O3 CPU already provides Alpha's memory model, we get that for free.Now that we would like to have TSO as well, we need to work out how thetwo models would co-exist. I'll think more about this, but we need abroader consensus on this.


--
Nilay

On Wed, 9 Nov 2011, Beckmann, Brad wrote:

I see. It sounds like you're still worried about how the RubyPort cansupport multiple M5 cpu ports and still adhere to a stronger consistencymodel. Sorry for not directly responding to that question earlier, butto me that seems like an orthogonal issue that you've already solved.If I recall correctly, the patch you sent out for review essentiallyattaches the multiple M5 cpu ports, representing simultaneous cpurequests, to the single RubyPort that represents the CPUs connection tothe L1 caches. That seems reasonable to me and I don't see any problemwith it. The key is that the cpu LSQ cannot blindly issue simultaneousrequests to the memory system without expecting and acting upon probesthat occur between issue and retirement. Furthermore, the CPU needs tocommunicate to Ruby when the instructions associated with the memoryoperations retire (for loads) or reach the head of the store buffer (forstores). Once Ruby receives that notification, it can stop monitoringthat location and move the cache block to a base state.
Now to answer your specific question: We are definitely interested in aTSO model and in my opinion that is the only consistency model that wehave to implement. Remember TSO is a valid implementation of Alpha's orARM's weaker models. We can certainly implement subsequent models, butthat should not be a short term goal.
I know this can be a complicated subject so please send me questions ifyou disagree or are confused. I certainly may be overlooking somethingand my thoughts are constantly evolving as well as I page more of thisinto my memory. For instance, I realize that my previous mail wasincorrect because I confused the LSQ, which contains pre-retirementmemory instructions, with the store buffer, which containspost-retirement store instruction values. If a probe hits in the storebuffer, the CPU doesn't (it can't) reissue the store instruction. Thestore buffer shields the CPU from that probe. As long as the cache haswrite permission when the store reaches the head of the store buffer,stores have a global order and TSO is maintained. Of course probingloads in the LSQ also needs to occur, along with several other featuresfor supporting locks, fences, etc.
If you do have further questions, please be specific as possible. It ishard to talk about this subject using generalities.
Brad

_______________________________________________
gem5-dev mailing list
[email protected]
http://m5sim.org/mailman/listinfo/gem5-dev

Re: [gem5-dev] Review Request: Forward invalidations from Ruby to O3 CPU

Reply via email to