On 12/29/2011 5:13 AM, a wrote:
The needles loads and stores would make it impossible to write an efficient
simd add function even if the functions containing asm blocks could be
inlined.

This does what you're asking for:

void test(ref float a, ref float b)
{
    asm
    {
        naked;
        movaps  XMM0,[RSI];
        addps   XMM0,[RDI];
        movaps  [RSI],XMM0;
        movaps  XMM0,[RSI];
        addps   XMM0,[RDI];
        movaps  [RSI],XMM0;
        ret;
    }
}

Reply via email to