To clarify on the anti-aliasing: Since we are always drawing from bitmaps which have alpha-blended…
There's really no such thing as non-32-bit-aligned 32-bit value on the stack unless you really go …
What other platforms are you talking about? You have working TLS also on Linux, OSX, Android and i…
The point of the queue here is not to maximize the amount of sleeping, but rather minimize the amo…
We haven't really gotten to the point where we need to start caring about thread_context. It's re…
I was wondering what the purpose of thread_context is now that multi-threading has added to HH. Do…
Such approach is fine. It will work. Just don't forget to increase max count. Semaphore value has …
If I was to implement something like this I'd probably go the route of the WaitForSingleObjectEx e…
Oh, you're right. Value will be 0 if no other jobs are being added. I don't know what I was thinki…
Ok, well I grabbed the source code and looking through it properly I can see there's no bug. I don…
Why do you think semaphore is at 0 at 4th step? Semaphore decreases only when WaitForSingleObjectE…
Apologies if I'm mistaken, but I just watched episode 126 and spotted what might be a possible rar…
then there is which will take care of alignment to arbitrary boundaries for you: that is, it wil…
_mm_mul_epu32 only multiplies 2 32-bit integers and stores result as 2 64-bit integers. So you wou…
elle —
After debugging my vectorized version for a long time, I thought of a few minor things that might …
Yes, for variables allocated on stack compiler aligns them automatically. It gets tricky with stuf…
MSDN states that parameters for InterlockedCompareExchange and InterlockedIncrement functions must…
Not much that hasn't already been discussed in these forums or on the streams, but interesting re…
Yes, I got VLIW somehow confused with SIMD, not sure why. SIMD executes one instruction on several…
Just to be clear, it was SIMD stuff we did (Single Instruction Multiple Data), not VLIW (Very Long…
Awesome catch! Yes I think that is a bug. This probably serves me write for violating my own rul…
All of this is true. I've found that "1.5x speedup" is a reasonably good rule of thumb for most g…
Unfortunately no. Clang does same thing that GCC does for these vector types:
elle —
I noticed that LLVM is less strict about the vector types than MSVC by default. For example, it do…
Nick —
Ah I see. Thank you both for the explanation.
It's hard to ignore 8 processors. After all, they're just sitting there, unused. I'm pretty sure h…
robert —
Hello All, I was watching episode 124 and i think i ahve spotted a possible bug in the threading c…
There's more stuff about MESI on ryg blog:
Kladdehelvete, did you read all the text I posted or just the first sentence? If what you are sayi…
The problem here is strictly that there is no compression happening, and the dynamic range of the …