On Thu, Jul 9, 2009 at 8:02 AM, Pauli Virtanen<pav...@iki.fi> wrote: > > I don't think we want to go the ATNumPy route, or even have > tunable parameters chosen at build or compile time.
Detecting things like cache size at compile time should not be too difficult, at least for common platforms. Even detecting it at runtime should be relatively simple in some particular cases (x86). BTW, one good baseline for those summation is to use dot: np.ones((80000, 256)).sum(axis=0) vs np.dot(np.ones((1, 80000)), np.ones((80000, 256))) Assuming dot uses an optimized blas, this is generally one order of magnitude faster than sum. > (Unless, of > course, we want to bring a monster into the world -- think about > cross-breeding distutils with the ATLAS build system :) Kill me now :) David _______________________________________________ NumPy-Discussion mailing list NumPy-Discussion@scipy.org http://mail.scipy.org/mailman/listinfo/numpy-discussion