Re: [m5-users] SMT bug? ... Re: ALPHA_SE SMT + EIO traces

Korey Sewell Sun, 06 Jul 2008 18:50:14 -0700

Hey Rick,
are you saying that you are trying to get 2 smt threads to fetch on
the same cycle? If so, I dont think we ever did that with O3 yet.


Also, it sounds like adding a "no-op" on fault was some of the work
Gabe did on adding the address translation/TLB to SE mode. As far as I
remember, when we fault in SMT mode, we stop at that instruction and
no no-op is needed to be placed in the decode stage.

If anything, the instruction in the "toDecode" buffer should be
squashed either by one of the pipeline stage squash functions or
manually by accessing that instruction in the buffer and setting it to
squashed (e.g. toDecode[numInst]->setSquashed()).

Adding a "no-op" and that type of stuff is a little foreign to the
implementation of O3 I was working on...

On Sun, Jul 6, 2008 at 8:20 PM, Rick Strong <[EMAIL PROTECTED]> wrote:
> It does work for smtNumFetchingThreads=1.
>
> -Rick
>
> Steve Reinhardt wrote:
>> I'm just wondering if you configure it to only fetch from at most one
>> thread per cycle if this problem will go away.
>>
>> On Sun, Jul 6, 2008 at 4:00 PM, Rick Strong <[EMAIL PROTECTED]> wrote:
>>
>>> There is a round robin policy that fetches from multiple threads.
>>>
>>> -Rick
>>>
>>> Steve Reinhardt wrote:
>>>
>>>> Thanks for tracking this down... it validates with my theory that the
>>>> new SE-mode address translation is the culprit, since before that was
>>>> added you never would have had a fault in SE mode.
>>>>
>>>> Unfortunately I'm no O3 expert, so I can't give you any pointers on
>>>> how to solve the problem off the top of my head.  I am a little
>>>> curious about why both threads would be fetching in the same cycle
>>>> though; I'd expect the SMT model to choose one thread or the other to
>>>> fetch from but not both.  Is there a policy to enable this behavior?
>>>> I'm more familiar with the SMT that Steve Raasch added to the old
>>>> obsolete SimpleScalar-derived CPU though, and not so much with what
>>>> Korey's done to O3.
>>>>
>>>> Steve
>>>>
>>>> On Sun, Jul 6, 2008 at 3:24 PM, Rick Strong <[EMAIL PROTECTED]> wrote:
>>>>
>>>>
>>>>> After diving into this problem more, it seems that there might be a
>>>>> problem in src/cpu/o3/fetch_impl.hh for SMT in general.  At the bottom
>>>>> of function fetch(), if a fault has occurred, a no-op instruction is
>>>>> formed and placed in the toDecode wire. However, the reference that is
>>>>> used is "toDecode->insts[numInst] = instruction;" at line 1256 where
>>>>> numInsts is never incremented. The end result is that if two SMT threads
>>>>> both fault on the fetch tick when using EIO traces, they both write to
>>>>> the same location in the insts field of the toDecode struct.  Attempted
>>>>> to solutions that have not worked.
>>>>>
>>>>> 1) If I increment numInst afterwards, it seems that it is possible to
>>>>> fetch more instructions than the width due to the insertion of the
>>>>> no-ops.
>>>>>
>>>>> 2) If I use toDecode->size value before it is increment on line 1258
>>>>> (approx.), this leads to a null access sometime later down the road.
>>>>>
>>>>> If someone has a good understanding with O3 or SMT implementation, your
>>>>> help would be appreciated.
>>>>>
>>>>> -Rick
>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>>
>
> _______________________________________________
> m5-users mailing list
> [email protected]
> http://m5sim.org/cgi-bin/mailman/listinfo/m5-users
>



-- 
----------
Korey L Sewell
Graduate Student - PhD Candidate
Computer Science & Engineering
University of Michigan
_______________________________________________
m5-users mailing list
[email protected]
http://m5sim.org/cgi-bin/mailman/listinfo/m5-users

Re: [m5-users] SMT bug? ... Re: ALPHA_SE SMT + EIO traces

Reply via email to