Prefetching Examples?
Here’s an actual piece of code that I’ve pulled out of a larger project. (Sorry, it’s the shortest one I can find that had a noticable speedup from prefetching.) This code performs a very large data transpose. This example uses the SSE prefetch instructions, which may be the same as the one that GCC emits. … Read more