Just to be clear, it was SIMD stuff we did (Single Instruction Multiple Data), not VLIW (Very Long…
Awesome catch! Yes I think that is a bug. This probably serves me write for violating my own rul…
All of this is true. I've found that "1.5x speedup" is a reasonably good rule of thumb for most g…
Unfortunately no. Clang does same thing that GCC does for these vector types:
elle —
I noticed that LLVM is less strict about the vector types than MSVC by default. For example, it do…
Nick —
Ah I see. Thank you both for the explanation.
It's hard to ignore 8 processors. After all, they're just sitting there, unused. I'm pretty sure h…
robert —
Hello All, I was watching episode 124 and i think i ahve spotted a possible bug in the threading c…
There's more stuff about MESI on ryg blog:
Kladdehelvete, did you read all the text I posted or just the first sentence? If what you are sayi…
The problem here is strictly that there is no compression happening, and the dynamic range of the …
Ah! Good :) Although there's probably still a smarter way to do it if I thought about it for a b…
The problem with any of this is that you really don't know until you actually test it with your wo…
Well, that may be true, but we cannot get rid of it until we duplicate the function, because keep …
Hyperthreading definitely does allow you to potentially double the work done per core. The reason…
Please take note that I am 100% confident that Casey will do extremely well with his game eventual…
I think you are not understanding hyperthreading well. Using 4 CPU's 100% doesn't mean that CPU i…
Marco —
Hey everyone, I hope Casey won't ban me for posting CppCon references :) Here are some videos and …
elle —
I think we can save 1 multiply, 1 add, and 1 shift at the end of the function when we unnecessaril…
I think you are not understanding hyperthreading well. Using 4 CPU's 100% doesn't mean that CPU i…
I don't know how applicable this is to optimization via hyperthreading, but the two links below se…
Has anyone else noticed this too, or is this a bug in my version?[/quote] No, it seems to me it s…
You were not tired - you looked up if there are shift by variable amount for 128-bit register. The…
For last few versions of Visual Studio (at least from VS2005, maybe even older) all CRT functions …
Yeah, the _mm_slli_si128 instruction is for shifting by an immediate, which means it has to be bak…
I have never had a problem using CreateThread, but that is probably because I don't generally use …
Nick —
I just finished episode 122 so forgive me if this has already been answered. I've been reading a b…
Personally, Andy, I think what you've done will suffice. By which I actually mean, you've totally …