2009/11/28 Matt Turner <matts...@gmail.com>: > On Sat, Nov 28, 2009 at 2:13 PM, Yang Zhao <y...@yangman.ca> wrote: >> The speed-up is definitely there, but __builtin_popcount() will still >> be drastically faster when architecture-specific optimizations are >> enabled: > > I don't think this is the case (except for with SSE4's popcnt > instruction, which your CFLAGS seem to be enabling.) > > Even when compiling with the Intel CC, which can undoubtedly can > optimize code for Core 2 better than gcc, fast_bitcount is > significantly faster.
Right. The point is, it's not going to be a consistent speedup across the board, hence it would be unwise to remove the __builtin_popcount() option. -- Yang Zhao http://yangman.ca ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Mesa3d-dev mailing list Mesa3d-dev@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/mesa3d-dev