I found slides for a talk where they try to get max perf out of an OOO machine:
https://deplinenoise.files.wordpr...6/03/gdc16_fredriksson_jaguar.pdf
though one key point to make is that the exact numbers (like the instructions/cycle or latency of instructions) depend heavily on the machine.