> It seems that when set in a loop, the program is able to perform some > type of optimization to actually get the use of the offsets where as > in the case of no loop, we have twice the calculations of instructions > for each address calculations.
I suggest you look at the dumps for i386 to see which pass does the changes, and then see what happens in your port. Paolo