Re: core.bitop.bt not faster than & ?

Trollgeir via Digitalmars-d-learn Wed, 17 Dec 2014 08:11:09 -0800

On Wednesday, 17 December 2014 at 14:58:13 UTC, Adam D. Ruppewrote:

On Wednesday, 17 December 2014 at 14:12:16 UTC, Trollgeir wrote:
I'd expect the bt function to be up to 32 times faster as Ithought it only compared two bits, and not the entire lengthof bits in the uint.
The processor doesn't work in terms of bits like that - itstill needs to look at the whole integer. In fact, according tomy (OLD) asm reference, the bt instruction is slower than theand instruction at the cpu level.
I think it has to do a wee bit more work, translating the 16into a mask then moving the result into the flag... then movingthe flag back into a register to return the value. (That laststep could probably be skipped if you do an if() on it and thecompiler optimizes the branch, and the first step might beskipped too if it is a constant, since the compiler can rewritethe instruction. So given that, I'd expect what you saw: nodifference when they are optimized to the same thing or whenthe CPU's stars align right, and & a bit faster when bt isn'toptimized)
bt() and friends are special instructions for specialized usecases. Probably useful for threading and stuff.

Thanks for the explanation, I suspected it might work somethinglike that.

For my implementation - I have bits shifting to the right everyupdate, and I want to check if it has reached certain markers.Hence, I felt it was really inefficient to check every single bitin the uint when I was only interested in some specific ones. Isan alternative (optimized) version of bt even possible?

Re: core.bitop.bt not faster than & ?

Reply via email to