[order instructions to minimize ROB read port and other stalls. No measurable speedup, though peter@cordes.ca**20080314212019] { hunk ./rshift.asm 109 - movd %ecx, %xmm0 C %0=64-cnt=left count=lc; %1=cnt; - - pxor %xmm2, %xmm2 C we need this for later, in L(out) - hunk ./rshift.asm 110 - shlq %cl, %rax C return value=limb0<2 + C seems to make no diff where we put pxor, so move it to function start if that helps alignment + pxor %xmm2, %xmm2 C we need this for later, in L(out). }