writing code that uses instruction pipelining as a form of parallelism
Anyone have a good example of code that exploits ooo and instruction pipelining as a form of parallelism in conjunction with simd like fabian described at the end of the compression session of handmade con?