2009/11/28 Matt Turner <matts...@gmail.com>:
> On Sat, Nov 28, 2009 at 2:13 PM, Yang Zhao <y...@yangman.ca> wrote:
>> The speed-up is definitely there, but __builtin_popcount() will still
>> be drastically faster when architecture-specific optimizations are
>> enabled:
>
> I don't think this is the case (except for with SSE4's popcnt
> instruction, which your CFLAGS seem to be enabling.)
>
> Even when compiling with the Intel CC, which can undoubtedly can
> optimize code for Core 2 better than gcc, fast_bitcount is
> significantly faster.

Right.  The point is, it's not going to be a consistent speedup across
the board, hence it would be unwise to remove the __builtin_popcount()
option.

-- 
Yang Zhao
http://yangman.ca

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Mesa3d-dev mailing list
Mesa3d-dev@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mesa3d-dev

Reply via email to