Le 03/09/2019 à 18:04, Segher Boessenkool a écrit :
On Tue, Sep 03, 2019 at 04:28:09PM +0200, Christophe Leroy wrote:
Le 03/09/2019 à 15:04, Segher Boessenkool a écrit :
On Tue, Sep 03, 2019 at 03:23:57PM +1000, Alastair D'Silva wrote:
+ asm volatile(
+ " mtctr %2;"
+ " mtmsr %3;"
+ " isync;"
+ "0: dcbst 0, %0;"
+ " addi %0, %0, %4;"
+ " bdnz 0b;"
+ " sync;"
+ " mtctr %2;"
+ "1: icbi 0, %1;"
+ " addi %1, %1, %4;"
+ " bdnz 1b;"
+ " sync;"
+ " mtmsr %5;"
+ " isync;"
+ : "+r" (loop1), "+r" (loop2)
+ : "r" (nb), "r" (msr), "i" (bytes), "r" (msr0)
+ : "ctr", "memory");
This outputs as one huge assembler statement, all on one line. That's
going to be fun to read or debug.
Do you mean \n has to be added after the ; ?
Something like that. There is no really satisfying way for doing huge
inline asm, and maybe that is a good thing ;-)
Often people write \n\t at the end of each line of inline asm. This works
pretty well (but then there are labels, oh joy).
loop1 and/or loop2 can be assigned the same register as msr0 or nb. They
need to be made earlyclobbers. (msr is fine, all of its reads are before
any writes to loop1 or loop2; and bytes is fine, it's not a register).
Can you explicit please ? Doesn't '+r' means that they are input and
output at the same time ?
That is what + means, yes -- that this output is an input as well. It is
the same to write
asm("mov %1,%0 ; mov %0,42" : "+r"(x), "=r"(y));
or to write
asm("mov %1,%0 ; mov %0,42" : "=r"(x), "=r"(y) : "0"(x));
(So not "at the same time" as in "in the same machine instruction", but
more loosely, as in "in the same inline asm statement").
"to be made earlyclobbers", what does this means exactly ? How to do that ?
You write &, like "+&r" in this case. It means the machine code writes
to this register before it has consumed all asm inputs (remember, GCC
does not understand (or even parse!) the assembler string).
So just
: "+&r" (loop1), "+&r" (loop2)
will do. (Why are they separate though? It could just be one loop var).
Yes it could just be a single loop var, but in that case it would have
to be reset at the start of the second loop, which means we would have
to pass 'addr' for resetting the loop anyway, so I opted to do it
outside the inline asm by using to separate loop vars set to their
starting value outside the inline asm.
Christophe