We are currently in the process of converting the website to the new design. Some pages, like this one, are still broken. We appreciate your patience.
elle
After debugging my vectorized version for a long time, I thought of a few minor things that might …
»
Mārtiņš Možeiko
Yes, for variables allocated on stack compiler aligns them automatically. It gets tricky with stuf…
»
Marius Adaškevičius
MSDN states that parameters for InterlockedCompareExchange and InterlockedIncrement functions must…
»
Benjamin Kloster
Not much that hasn't already been discussed in these forums or on the streams, but interesting re…
»
noxy_key
Yes, I got VLIW somehow confused with SIMD, not sure why. SIMD executes one instruction on several…
»
Casey Muratori
Just to be clear, it was SIMD stuff we did (Single Instruction Multiple Data), not VLIW (Very Long…
»
Casey Muratori
Awesome catch! Yes I think that is a bug. This probably serves me write for violating my own rul…
»
Andrew Bromage
All of this is true. I've found that "1.5x speedup" is a reasonably good rule of thumb for most g…
»
Mārtiņš Možeiko
Unfortunately no. Clang does same thing that GCC does for these vector types:
»
elle
I noticed that LLVM is less strict about the vector types than MSVC by default. For example, it do…
»
Nick
Ah I see. Thank you both for the explanation.
»
noxy_key
It's hard to ignore 8 processors. After all, they're just sitting there, unused. I'm pretty sure h…
»
robert
Hello All, I was watching episode 124 and i think i ahve spotted a possible bug in the threading c…
»
Mārtiņš Možeiko
There's more stuff about MESI on ryg blog:
»
Mārtiņš Možeiko
Kladdehelvete, did you read all the text I posted or just the first sentence? If what you are sayi…
»
Casey Muratori
The problem here is strictly that there is no compression happening, and the dynamic range of the …
»
Casey Muratori
Ah! Good :) Although there's probably still a smarter way to do it if I thought about it for a b…
»
Casey Muratori
The problem with any of this is that you really don't know until you actually test it with your wo…
»
Casey Muratori
Well, that may be true, but we cannot get rid of it until we duplicate the function, because keep …
»
Casey Muratori
Hyperthreading definitely does allow you to potentially double the work done per core. The reason…
»
Livet Ersomen Strøm
Please take note that I am 100% confident that Casey will do extremely well with his game eventual…
»
Livet Ersomen Strøm
I think you are not understanding hyperthreading well. Using 4 CPU's 100% doesn't mean that CPU i…
»
Marco
Hey everyone, I hope Casey won't ban me for posting CppCon references :) Here are some videos and …
»
elle
I think we can save 1 multiply, 1 add, and 1 shift at the end of the function when we unnecessaril…
»
Abner Coimbre
Hear, hear! B)
»
Mārtiņš Možeiko
I think you are not understanding hyperthreading well. Using 4 CPU's 100% doesn't mean that CPU i…
»
»
noxy_key
I don't know how applicable this is to optimization via hyperthreading, but the two links below se…
»
Livet Ersomen Strøm
Has anyone else noticed this too, or is this a bug in my version?[/quote] No, it seems to me it s…
»
Mārtiņš Možeiko
You were not tired - you looked up if there are shift by variable amount for 128-bit register. The…
»