Re: [Numpy-discussion] Comparing NumPy/IDL Performance

Johann Cohen-Tanugi Mon, 26 Sep 2011 07:38:53 -0700

hi Keith,

I do not think that your primary concern should be with this kind ofspeed test at this stage :1/ rest assured that this sort of tests have been performed in othercontexts, and you can always do some hard work on high level computinglanguages like IDL and python to improve performance

2/ "early optimization is the root of all evil" (Knuth?)

3/ I believe that your primary motivation is to provide an alternativelibrary to a proprietary software. If this is so, then your effort ismost welcome and I would suggest first to port an interesting but smallpiece of the IDL solar physics lib and then study the path to speedimprovements on such a concrete use case.

As for your python time_test3, if it is a benchmark code proprietary tothe IDL codebas, there is no wonder it performs well there! :)

At any rate, I would suggest simplifying your code with ipython :

In [1]: import numpy as np
In [2]: a = np.zeros([512, 512], dtype=np.uint8)
In [3]: a[200:250, 200:250] = 10
In [4]: from scipy import ndimage
In [5]: %timeit ndimage.filters.median_filter(a, size=(5, 5))
10 loops, best of 3: 98 ms per loop

I am not sure what unit is your vertical axis....

best,
Johann

On 09/26/2011 04:19 PM, Keith Hughitt wrote:

Hi all,
Myself and several colleagues have recently started work on a Pythonlibrary for solar physics <http://www.sunpy.org/>, in order to providean alternative to the current mainstay for solar physics<http://www.lmsal.com/solarsoft/>, which is written in IDL.
One of the first steps we have taken is to create a Python port<https://github.com/sunpy/sunpy/blob/master/benchmarks/time_test3.py>of a popular benchmark for IDL (time_test3) which measures performancefor a variety of (primarily matrix) operations. In our initialattempt, however, Python performs significantly poorer than IDL forseveral of the tests. I have attached a graph which shows the resultsfor one machine: the x-axis is the test # being compared, and they-axis is the time it took to complete the test, in milliseconds.While it is possible that this is simply due to limitations inPython/Numpy, I suspect that this is due at least in part to our lackin familiarity with NumPy and SciPy.
So my question is, does anyone see any places where we are doingthings very inefficiently in Python?
In order to try and ensure a fair comparison between IDL and Pythonthere are some things (e.g. the style of timing and output) which wehave deliberately chosen to do a certain way. In other cases, however,it is likely that we just didn't know a better method.
Any feedback or suggestions people have would be greatly appreciated.Unfortunately, due to the proprietary nature of IDL, we cannot sharethe original version of time_test3, but hopefully the comments intime_test3.py will be clear enough.
Thanks!
Keith

--
This message has been scanned for viruses and
dangerous content by *MailScanner* <http://www.mailscanner.info/>, and is
believed to be clean.


_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@scipy.org
http://mail.scipy.org/mailman/listinfo/numpy-discussion

_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@scipy.org
http://mail.scipy.org/mailman/listinfo/numpy-discussion

Re: [Numpy-discussion] Comparing NumPy/IDL Performance

Reply via email to