SSE2 prefetch example (M. Lüscher)
Using inlining assembler in GCC:
#define _prefetch_su3(addr) \
__asm__ __volatile__ ("prefetcht0 %0 \n\t" \
"prefetcht0 %1" \
: \
: \
"m" (*(((char*)((unsigned int)(addr))))), \
"m" (*(((char*)((unsigned int)(addr)))+128)))
...
su3 *um;
....
um=&gauge_field[iy][0];
_prefetch_su3(um);
Previous slide
Next slide
Back to first slide
View graphic version