subject:"Re\: \[julia\-users\] Re\: Performance variability \- can we expect Julia to be the fastest \(best\) language\?"



On Friday, May 1, 2015 at 9:16:43 AM UTC-4, Tim Holy wrote:

 On Friday, May 01, 2015 03:19:03 AM Scott Jones wrote: 
  As the string grows, Julia's internals end up having to reallocate the 
  memory and sometimes copy it to a new location, hence the O(n^2) nature 
 of 
  the code. 

 Small correction: push! is not O(n^2), it's O(nlogn). Internally, the 
 storage 
 array grows by factors of 2 [1]; after one allocation of size 2n you can 
 add n 
 more elements without reallocating. 


Good to know, I hate to say it, but the performance looked so bad to me, I 
didn't bother to see if it even had that optimization (which is exactly 
what I did for strings for the language I used to develop)

Does it always grow by factors of 2?  That might not be so good...  we 
found that after a certain point, it was better to increase in chunks, say 
of 64K, or 1M, because increasing the size that way of large LOBs could 
make you run out of memory fairly quickly...

 

 That said, O(nlogn) can be pretty easily beat by O(2n): make one pass 
 through 
 and count how many you'll need, allocate the whole thing, and then stuff 
 in 
 elements. As you seem to be planning to do. 


Yes, and have very nice performance improvements to show for it (most were 
around 4-10x faster, go look at what I put in my gist), and that's even 
with my pure Julia
version... :-)
 


 --Tim 

 [1] Last I looked, that is; there was some discussion about switching it 
 to 
 something like 1.5 because of various discussions of memory fragmentation 
 and 
 reuse. 


Still, same issue as I described above... probably better to increase by 2x 
up to a point, and then by chunk sizes, where the chunk sizes might slowly 
get larger...

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

On Friday, May 01, 2015 08:03:31 AM Scott Jones wrote:
 Still, same issue as I described above... probably better to increase by 2x 
 up to a point, and then by chunk sizes, where the chunk sizes might slowly
 get larger...

I see your point, but it will also break the O(nlogn) scaling. We couldn't 
hard-code the cutoff, because some people run julia on machines with 4GB of RAM 
and others with 1TB of RAM. So, we could query the amount of RAM available and 
switch based on that result, but since all this would only make a difference 
for operations that consume between 0.5x and 1x the user's RAM (which to me 
seems like a very narrow window, on the log scale), is it really worth the 
trouble?

--Tim

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Jeff Bezanson

Steven -- I agree and I find it very refreshing that you're willing to
judge a language by more than just performance. Any given language can
always be optimized better, so ideally you want to compare them by
more robust criteria.

Obviously a particular system might have a well-tuned library routine
that's faster than our equivalent. But think about it: is having a
slow interpreter, and relying on code to spend all its time in
pre-baked library kernels the *right* way to get performance? That's
just the same boring design that has been used over and over again, in
matlab, IDL, octave, R, etc. In those cases the language isn't
bringing much to the table, except a pile of rules about how important
code must still be written in C/Fortran, and how your code must be
vectorized or shame on you.

On Fri, May 1, 2015 at 11:48 AM, Tim Holy tim.h...@gmail.com wrote:
 On Friday, May 01, 2015 08:03:31 AM Scott Jones wrote:
 Still, same issue as I described above... probably better to increase by 2x
 up to a point, and then by chunk sizes, where the chunk sizes might slowly
 get larger...

 I see your point, but it will also break the O(nlogn) scaling. We couldn't
 hard-code the cutoff, because some people run julia on machines with 4GB of 
 RAM
 and others with 1TB of RAM. So, we could query the amount of RAM available and
 switch based on that result, but since all this would only make a difference
 for operations that consume between 0.5x and 1x the user's RAM (which to me
 seems like a very narrow window, on the log scale), is it really worth the
 trouble?

 --Tim

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Jameson Nash

The threshold would likely be most beneficial if it was based on pagesize
(which is constant relative to RAM size). For small allocations (less than
several megabytes), a modern malloc implementation typically uses a pool,
so growing a allocation (except by a small amount) will probably result in
a copy anyways, and no memory reuse. Once malloc switches to direct mmap
calls, then it probably makes sense to add pages at a more gradual rate.

On Fri, May 1, 2015 at 11:48 AM Tim Holy tim.h...@gmail.com wrote:

 On Friday, May 01, 2015 08:03:31 AM Scott Jones wrote:
  Still, same issue as I described above... probably better to increase by
 2x
  up to a point, and then by chunk sizes, where the chunk sizes might
 slowly
  get larger...

 I see your point, but it will also break the O(nlogn) scaling. We couldn't
 hard-code the cutoff, because some people run julia on machines with 4GB
 of RAM
 and others with 1TB of RAM. So, we could query the amount of RAM available
 and
 switch based on that result, but since all this would only make a
 difference
 for operations that consume between 0.5x and 1x the user's RAM (which to me
 seems like a very narrow window, on the log scale), is it really worth the
 trouble?

 --Tim

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

Of course I'm not saying loops should not be benchmarked and I do use loops
in julia also. I'm just saying that when doing performance comparison one
should try to write the programs in each language in their most optimal
style rather than similar style which is optimal for one language but very
suboptimal in another language.
Ah I didn't know the article was rebutted by Stefan. I read that article
before that happened and just looked it up again now as an example.

I guess the conclusion is that cross-language performance benchmarks are
very tricky which was kinda my point :)

On Friday, May 1, 2015 at 3:13:24 PM UTC+2, Tim Holy wrote:

Hi Steven,

I understand your point---you're saying you'd be unlikely to write those
algorithms in that manner, if your goal were to do those particular
computations. But the important point to keep in mind is that those
benchmarks
are simply toys for the purpose of testing performance of various
language
constructs. If you think it's irrelevant to benchmark loops for scientific
code, then you do very, very different stuff than me. Not all algorithms
reduce
to BLAS calls. I use julia to write all kinds of algorithms that I used to
write MEX functions for, back in my Matlab days. If all you need is A*b,
then
of course basically any scientific language will be just fine, with
minimal
differences in performance.

Moreover, that R benchmark on cumsum is simply not credible. I'm not sure
what
was happening (and that article doesn't post its code or procedures used
to
test), but julia's cumsum reduces to efficient machine code (basically, a
bunch
of addition operations). If they were computing cumsum across a specific
dimension, then this PR:
https://github.com/JuliaLang/julia/pull/7359
changed things. But more likely, someone forgot to run the code twice (so
it
got JIT-compiled), had a type-instability in the code they were testing,
or
some other mistake. It's too bad one can make mistakes, of course, but
then it
becomes a comparison of different programmers rather than different
programming
languages.

Indeed, if you read the comments in that post, Stefan already rebutted
that
benchmark, with a 4x advantage for Julia:

https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/comment-page-1/#comment-89

--Tim

On Friday, May 01, 2015 01:25:50 AM Steven Sagaert wrote:
I think the performance comparisons between Julia Python are flawed.
They
seem to be between standard Python Julia but since Julia is all about
scientific programming it really should be between SciPi Julia. Since
SciPi uses much of the same underlying libs in Fortran/C the performance
gap will be much smaller and to be really fair it should be between
numba
compiled SciPi code julia. I suspect the performance will be very
close
then (and close to C performance).

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case scenario
for julia (loops mutable datastructures) the worst case scenario for
R.
When the same R program is rewritten in vectorised style it beat julia
see

https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyon
e-else-wanna-challenge-r/.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim
that)
but because it's a clean interesting language (still needs work for some
rough edges of course) with clean(er) clear(er) libraries and that
gives
reasonable performance out of the box without much tweaking.

On Friday, May 1, 2015 at 12:10:58 AM UTC+2, Scott Jones wrote:
Yes... Python will win on string processing... esp. with Python 3... I
quickly ran into things that were 800x faster in Python...
(I hope to help change that though!)

Scott

On Thursday, April 30, 2015 at 6:01:45 PM UTC-4, Páll Haraldsson
wrote:
I wouldn't expect a difference in Julia for code like that (didn't
check). But I guess what we are often seeing is someone comparing a
tuned
Python code to newbie Julia code. I still want it faster than that
code..
(assuming same algorithm, note row vs. column major caveat).

The main point of mine, *should* Python at any time win?

2015-04-30 21:36 GMT+00:00 Sisyphuss zhengw...@gmail.com:
This post interests me. I'll write something here to follow this
post.

The performance gap between normal code in Python and badly-written
code
in Julia is something I'd like to know too.
As far as I know, Python interpret does some mysterious
optimizations.
For example `(x**2)**2` is 100x faster than `x**4`.

On Thursday, April 30, 2015 at 9:58:35 PM UTC+2, Páll Haraldsson
wrote:
Hi,

[As a best

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Stefan Karpinski

I'll quote one of my comments on this StackOverflow question
http://stackoverflow.com/questions/9968578/speeding-up-julias-poorly-written-r-examples
:

That all depends on what you are trying to measure. Personally, I'm not at
all interested in how fast one can compute Fibonacci numbers. Yet that is
one of our benchmarks. Why? Because I am very interested in how well
languages support recursion – and the doubly recursive algorithm happens to
be a great test of recursion, precisely because it is such a terrible way
to compute Fibonacci numbers. So what would be learned by comparing an
intentionally slow, excessively recursive algorithm in C and Julia against
a tricky, clever, vectorized algorithm in R? Nothing at all.

On Fri, May 1, 2015 at 12:58 PM, Steven Sagaert steven.saga...@gmail.com
wrote:

Of course I'm not saying loops should not be benchmarked and I do use
loops in julia also. I'm just saying that when doing performance comparison
one should try to write the programs in each language in their most optimal
style rather than similar style which is optimal for one language but very
suboptimal in another language.
Ah I didn't know the article was rebutted by Stefan. I read that article
before that happened and just looked it up again now as an example.

I guess the conclusion is that cross-language performance benchmarks are
very tricky which was kinda my point :)

On Friday, May 1, 2015 at 3:13:24 PM UTC+2, Tim Holy wrote:

Hi Steven,

I understand your point---you're saying you'd be unlikely to write those
algorithms in that manner, if your goal were to do those particular
computations. But the important point to keep in mind is that those
benchmarks
are simply toys for the purpose of testing performance of various
language
constructs. If you think it's irrelevant to benchmark loops for
scientific
code, then you do very, very different stuff than me. Not all algorithms
reduce
to BLAS calls. I use julia to write all kinds of algorithms that I used
to
write MEX functions for, back in my Matlab days. If all you need is A*b,
then
of course basically any scientific language will be just fine, with
minimal
differences in performance.

Indeed, if you read the comments in that post, Stefan already rebutted
that
benchmark, with a 4x advantage for Julia:

https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/comment-page-1/#comment-89

--Tim

On Friday, May 01, 2015 01:25:50 AM Steven Sagaert wrote:
I think the performance comparisons between Julia Python are flawed.
They
seem to be between standard Python Julia but since Julia is all about
scientific programming it really should be between SciPi Julia. Since
SciPi uses much of the same underlying libs in Fortran/C the
performance
gap will be much smaller and to be really fair it should be between
numba
compiled SciPi code julia. I suspect the performance will be very
close
then (and close to C performance).

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case
scenario
for julia (loops mutable datastructures) the worst case scenario
for R.
When the same R program is rewritten in vectorised style it beat julia
see

https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyon
e-else-wanna-challenge-r/.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim
that)
but because it's a clean interesting language (still needs work for
some
rough edges of course) with clean(er) clear(er) libraries and that
gives
reasonable performance out of the box without much tweaking.

On Friday, May 1, 2015 at 12:10:58 AM UTC+2, Scott Jones wrote:
Yes... Python will win on string processing... esp. with Python 3...
I
quickly ran into things that were 800x faster in Python...
(I hope to help change that though!)

Scott

On Thursday, April 30, 2015 at 6:01:45 PM UTC-4, Páll Haraldsson
wrote:
I wouldn't expect a difference in Julia for code like that (didn't
check). But I guess what we are often seeing is someone

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Steven G. Johnson



On Thursday, April 30, 2015 at 6:10:58 PM UTC-4, Scott Jones wrote:

 Yes... Python will win on string processing... esp. with Python 3... I 
 quickly ran into things that were  800x faster in Python...
 (I hope to help change that though!)


The 800x faster example that you've referred to several times, if I 
recall correctly, is one where you repeatedly concatenate strings.  In 
CPython, under certain circumstances, this is optimized to mutating one of 
the strings in-place and is consequently O(n) where n is the final length, 
although this is not guaranteed by the language itself.  In Julia, Ruby, 
Java, Go, and many other languages, concatenation allocates a new string 
and hence building a string by repeated concatenation is O(n^2).   That 
doesn't mean that those other languages lose on string processing to 
Python, it just means that you have to do things slightly differently (e.g. 
write to an IOBuffer in Julia).

You can't always expect the *same code* (translated as literally as 
possible) to be the optimal approach in different languages, and it is 
inflammatory to compare languages according to this standard.

A fairer question is whether it is *much harder* to get good performance in 
one language vs. another for a certain task.   There will certainly be 
tasks where Python is still superior in this sense simply because there are 
many cases where Python calls highly tuned C libraries for operations that 
have not been as optimized in Julia.  Julia will tend to shine the further 
you stray from built-in operations in your performance-critical code.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Steven G. Johnson

On Friday, May 1, 2015 at 1:12:00 PM UTC-4, Steven Sagaert wrote: 

 That wasn't what I was saying. I like the philosophy behind julia. But in 
 practice (as of now) even in julia you still have to code in a certain 
 style if you want very good performance and that's no different than in any 
 other language.


The goal of Julia is not to be a language in which it is *impossible* to 
write slow code, or a language in which all programming styles are equally 
fast.   The goal (or at least, one of the goals) is to be an expressive, 
high-level dynamic language, in which it is also *possible* to write 
performance-critical inner-loop code.

That *is* different from other high-level languages, in which it is 
typically *not* possible to write performance-critical inner-loop code 
without dropping down to a lower-level language (C, Fortran, Cython...).   
If you are coding exclusively in Python or R, and there isn't an optimized 
function appropriate for the innermost loops of your task at hand, you are 
out of luck.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?





 Obviously a particular system might have a well-tuned library routine 
 that's faster than our equivalent. But think about it: is having a 
 slow interpreter, and relying on code to spend all its time in 
 pre-baked library kernels the *right* way to get performance? That's 
 just the same boring design that has been used over and over again, in 
 matlab, IDL, octave, R, etc. In those cases the language isn't 
 bringing much to the table, except a pile of rules about how important 
 code must still be written in C/Fortran, and how your code must be 
 vectorized or shame on you.


That wasn't what I was saying. I like the philosophy behind julia. But in 
practice (as of now) even in julia you still have to code in a certain 
style if you want very good performance and that's no different than in any 
other language. Ideally of course the compiler should be able to optimize 
the code so that different styles (e.g. functional/vectorized style vs 
imperative/loops style) gives the same performance and the programmer 
doesn't have to think about it and maybe one day it will be like that in 
julia but we're not quite there yet AFAIK.

Having said that, I like Julia and hopefully it will keep on getting 
better/faster. So good job and keep up the good work.



 On Fri, May 1, 2015 at 11:48 AM, Tim Holy tim@gmail.com javascript: 
 wrote: 
  On Friday, May 01, 2015 08:03:31 AM Scott Jones wrote: 
  Still, same issue as I described above... probably better to increase 
 by 2x 
  up to a point, and then by chunk sizes, where the chunk sizes might 
 slowly 
  get larger... 
  
  I see your point, but it will also break the O(nlogn) scaling. We 
 couldn't 
  hard-code the cutoff, because some people run julia on machines with 4GB 
 of RAM 
  and others with 1TB of RAM. So, we could query the amount of RAM 
 available and 
  switch based on that result, but since all this would only make a 
 difference 
  for operations that consume between 0.5x and 1x the user's RAM (which to 
 me 
  seems like a very narrow window, on the log scale), is it really worth 
 the 
  trouble? 
  
  --Tim

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 1:23:40 PM UTC-4, Steven G. Johnson wrote:

 On Friday, May 1, 2015 at 1:12:00 PM UTC-4, Steven Sagaert wrote: 

 That wasn't what I was saying. I like the philosophy behind julia. But in 
 practice (as of now) even in julia you still have to code in a certain 
 style if you want very good performance and that's no different than in any 
 other language.


 The goal of Julia is not to be a language in which it is *impossible* to 
 write slow code, or a language in which all programming styles are equally 
 fast.   The goal (or at least, one of the goals) is to be an expressive, 
 high-level dynamic language, in which it is also *possible* to write 
 performance-critical inner-loop code.


Yep, totally agree!  I had to deal with people (smart people too, who went 
to MIT also ;-) ) who expected the compiler/interpreter to magically 
improve their O(n^2) code!
 

 That *is* different from other high-level languages, in which it is 
 typically *not* possible to write performance-critical inner-loop code 
 without dropping down to a lower-level language (C, Fortran, Cython...).   
 If you are coding exclusively in Python or R, and there isn't an optimized 
 function appropriate for the innermost loops of your task at hand, you are 
 out of luck.


Also, very true...  I do hope that any issues that make my C version of UTF 
conversion routines faster than my equivalent Julia versions will be 
addressed before too long.
(and I don't even think it is that far off, or hard for any particular 
reason)

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 11:48:21 AM UTC-4, Tim Holy wrote:

 On Friday, May 01, 2015 08:03:31 AM Scott Jones wrote: 
  Still, same issue as I described above... probably better to increase by 
 2x 
  up to a point, and then by chunk sizes, where the chunk sizes might 
 slowly 
  get larger... 

 I see your point, but it will also break the O(nlogn) scaling. We couldn't 
 hard-code the cutoff, because some people run julia on machines with 4GB 
 of RAM 
 and others with 1TB of RAM. So, we could query the amount of RAM available 
 and 
 switch based on that result, but since all this would only make a 
 difference 
 for operations that consume between 0.5x and 1x the user's RAM (which to 
 me 
 seems like a very narrow window, on the log scale), is it really worth the 
 trouble? 

 --Tim 


For what I was doing, yes, it was definitely worth the trouble, because 
you'd have systems with 10s of thousands of processes (the limit was 64K on 
a single node), and you had to be very careful about not using up too much 
memory, and ending up thrashing...
Very different than when you maybe have a process for each core, and you 
have lots of memory for each one...
Different usage... different performance issues...

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 12:42:57 PM UTC-4, Steven G. Johnson wrote:



 On Thursday, April 30, 2015 at 6:10:58 PM UTC-4, Scott Jones wrote:

 Yes... Python will win on string processing... esp. with Python 3... I 
 quickly ran into things that were  800x faster in Python...
 (I hope to help change that though!)


 The 800x faster example that you've referred to several times, if I 
 recall correctly, is one where you repeatedly concatenate strings.  In 
 CPython, under certain circumstances, this is optimized to mutating one of 
 the strings in-place and is consequently O(n) where n is the final length, 
 although this is not guaranteed by the language itself.  In Julia, Ruby, 
 Java, Go, and many other languages, concatenation allocates a new string 
 and hence building a string by repeated concatenation is O(n^2).   That 
 doesn't mean that those other languages lose on string processing to 
 Python, it just means that you have to do things slightly differently (e.g. 
 write to an IOBuffer in Julia).


I just don't think that IOBuffers are a very good way to do that...  what I 
really need are mutable strings... and I know there is a package, and I 
need to investigate that further...
it's something that would be nice to have as part of the core of the 
language, instead of having to use either Vectors or IOBuffers...
As a new users, I would think, if I'm not doing IO, why should be using an 
IOBuffer...
 

 You can't always expect the *same code* (translated as literally as 
 possible) to be the optimal approach in different languages, and it is 
 inflammatory to compare languages according to this standard.


I was not intending to be inflammatory, just relating what my first 
experience was, which let me to investigate much more deeply, into the good 
and bad issues in Julia wrt performance (more good than bad, by a long 
shot).
 

 A fairer question is whether it is *much harder* to get good performance 
 in one language vs. another for a certain task.   There will certainly be 
 tasks where Python is still superior in this sense simply because there are 
 many cases where Python calls highly tuned C libraries for operations that 
 have not been as optimized in Julia.  Julia will tend to shine the further 
 you stray from built-in operations in your performance-critical code.


Yes, that is true... and that is why I'm betting on Julia in the long run 
(the other option for a lot of the code would have been Python or C++11, 
and I've already found Julia easier to deal with than either of them, even 
in it's pre 1.0 state)

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 12:38:33 PM UTC-4, Jeff Bezanson wrote:

 Steven -- I agree and I find it very refreshing that you're willing to 
 judge a language by more than just performance. Any given language can 
 always be optimized better, so ideally you want to compare them by 
 more robust criteria. 

 Obviously a particular system might have a well-tuned library routine 
 that's faster than our equivalent. But think about it: is having a 
 slow interpreter, and relying on code to spend all its time in 
 pre-baked library kernels the *right* way to get performance? That's 
 just the same boring design that has been used over and over again, in 
 matlab, IDL, octave, R, etc. In those cases the language isn't 
 bringing much to the table, except a pile of rules about how important 
 code must still be written in C/Fortran, and how your code must be 
 vectorized or shame on you. 


That's a very good point... and is one of the things I like a lot about 
Julia...
Even with my initial surprise about a single performance issue (the 
building up a string by concatenation), I did NOT judge Julia by that alone,
and have been quite happy with it overall [and I've been converting all of 
the developers at the startup where I'm consulting to Julia fans].
I also have faith, from what I've seen so far, is that performance issues 
*will* be addressed, as best as possible considering the architecture and 
goals of the language,
by a number of pretty smart people, both in and outside of the core team.

Scott

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 7:23:40 PM UTC+2, Steven G. Johnson wrote:

 On Friday, May 1, 2015 at 1:12:00 PM UTC-4, Steven Sagaert wrote: 

 That wasn't what I was saying. I like the philosophy behind julia. But in 
 practice (as of now) even in julia you still have to code in a certain 
 style if you want very good performance and that's no different than in any 
 other language.


 The goal of Julia is not to be a language in which it is *impossible* to 
 write slow code, or a language in which all programming styles are equally 
 fast. 


I didn't say that was a goal of Julia but it sure  would be nice to have 
though :) but probably an impossible dream.
 

   The goal (or at least, one of the goals) is to be an expressive, 
 high-level dynamic language, in which it is also *possible* to write 
 performance-critical inner-loop code.

 That *is* different from other high-level languages, in which it is 
 typically *not* possible to write performance-critical inner-loop code 
 without dropping down to a lower-level language (C, Fortran, Cython...).   
 If you are coding exclusively in Python or R, and there isn't an optimized 
 function appropriate for the innermost loops of your task at hand, you are 
 out of luck.

like I said: I like Julia and I am rooting for it but just to play devil's 
advocate: I believe it's also a goal ( possibility) of numba to write 
c-level efficient code in Python. All you have to do add an annotation here 
and there.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Jameson Nash

I believe that both are actually very similar in that manner. I think the
main difference comes from the fact that Julia is an attempt to design the
core library to support and use the efficient constructs, while Numba and
other related projects are, for better or worse, inheriting the default
python semantics and built-in libraries.

Sometimes a new language is better than an old language simply because it
can drop compatibility concerns. For example, Java is known for providing
far more consistent multi-threading support than C, since it is a language
construct and not an add-on feature. It was possible in both, one just made
it easier for the programmer to access. Similarly, Node made it feasible to
write programs without any concept of a blocking operation. Again, this was
already possible in languages like Python and C, but Node (with it's legacy
in Javascript), made it a feature of the language and designed all of the
core API's to deal with it.


On Fri, May 1, 2015 at 2:27 PM Steven G. Johnson stevenj@gmail.com
wrote:



 On Friday, May 1, 2015 at 2:04:44 PM UTC-4, Steven Sagaert wrote:

 like I said: I like Julia and I am rooting for it but just to play
 devil's advocate: I believe it's also a goal ( possibility) of numba to
 write c-level efficient code in Python. All you have to do add an
 annotation here and there.


 Numba is arguably a 2nd lower-level language that happens to be embedded
 in Python — it is telling that Numba's documentation explicitly states that
 it can only get good performance when it is able to JIT the inner loops in
 nopython mode — basically, code that doesn't stray outside a small set of
 types.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Steven G. Johnson



On Friday, May 1, 2015 at 2:04:44 PM UTC-4, Steven Sagaert wrote:

 like I said: I like Julia and I am rooting for it but just to play devil's 
 advocate: I believe it's also a goal ( possibility) of numba to write 
 c-level efficient code in Python. All you have to do add an annotation here 
 and there. 


Numba is arguably a 2nd lower-level language that happens to be embedded in 
Python — it is telling that Numba's documentation explicitly states that it 
can only get good performance when it is able to JIT the inner loops in 
nopython mode — basically, code that doesn't stray outside a small set of 
types.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?



On Friday, May 1, 2015 at 1:25:41 AM UTC-4, Jeff Bezanson wrote:

 It is true that we have not yet done enough to optimize the worst and 
 worse performance cases. The bright side of that is that we have room 
 to improve; it's not that we've run out of ideas and techniques. 

 Tim is right that the complexity of our dispatch system makes julia 
 potentially slower than python. But in dispatch-heavy code I've seen 
 cases where we are faster or slower; it depends. 

 Python's string and dictionary operations, in particular, are really 
 fast. This is not surprising considering what the language was 
 designed for, and that they have a big library of well-tuned C code 
 for these things. 

 I still maintain that it is misleading to describe an *asymptotic* 
 slowdown as 800x slower. If you name a constant factor, it sounds 
 like you're talking about a constant factor slowdown. But the number 
 is arbitrary, because it depends on data size. In theory, of course, 
 an asymptotic slowdown is *much worse* than a constant factor 
 slowdown. However in the systems world constant factors are often more 
 important, and are often what we talk about. 


No, that was just my very first test comparing Julia  Python, using a size 
that matched the record sizes I'd typically seen from way too many years of
benchmarking (database / string processing operations)
 

 You say a lot of the algorithms are O(n) instead of O(1). Are there 
 any examples other than length()? 


Actually, it's worse than that... length, and getting finding a particular 
character by character position, and getting a substring by character 
position, some of the most frequent operations for what I deal with, are 
O(n) instead of O(1),  and things like conversions are O(n^2), not O(n) 
[and the conversions are much more complex, due to the string 
representation in Julia, unlike Python 3].
The conversions I am fixing, so that they are not O(n^2), but rather O(n) 
[slower than Python, again because of the representation, but not 
asymptotic].
The reason they are O(n^2), like the string concatenation problem I ran 
into right when I first started to evaluate Julia, is because of the way 
the conversion functions are written,
initially creating a 0-length array, and then doing push! to successively 
add characters to the array, and then finally calling UTF8String, 
UTF16String, or UTF32String to convert
the Vector{UInt8}, Vector{UInt16} or Vector{Char} respectively into an 
immutable string.
As the string grows, Julia's internals end up having to reallocate the 
memory and sometimes copy it to a new location, hence the O(n^2) nature of 
the code.

My changes, which hopefully will be accepted (after I check in my next 
round of pure Julia optimizations), solve that by first validating the 
input UTF-8, UTF-16, or UTF-32
string at the same time as calculating how many characters of the different 
ranges are present, so that the memory can be allocated once, exactly the 
size needed, and also
frequently allowing dispatching to simpler conversion code, when it is know 
that all of the characters in the string just need to be widened 
(zero-extended), or narrowed.

I disagree that UTF-8 has no space savings over UTF-32 when using the 
 full range of unicode. The reason is that strings often have only a 
 small percentage of non-BMP characters, with lots of spaces and 
 newlines etc. You don't want your whole file to use 4x the space just 
 to use one emoji. 


Please read my statement more carefully...

 UTF-8 *can* take up to 50% more storage than UTF-16 if you are just 
 dealing with BMP characters.
 If you have some field that needs to hold *a certain number of Unicode 
 characters*, for the full range of Unicode,
 you need to allocate 4 bytes for every character, so no savings compared 
 to UTF-16 or UTF-32.


My point was that if you have to allocate a buffer to hold a certain # of 
characters, say because you have a CHAR, NCHAR, or WCHAR, or VARCHAR, etc. 
field from a DBMS,
for UTF-8, you need to allocate at least 4 bytes per character, so no 
savings over UTF-16 or UTF-32 for those operations...

I spent over two years going back and forth to Japan, when I designed (and 
was the main implementor) for the Unicode support of a database system / 
language, and spent a lot of time looking at the just how much storage 
space different representations would take... Note, at that time, Unicode 
2.0 was not out, so the choice was between UCS-2 (no surrogates then), 
UTF-8, some combination thereof, or some new encoding.

My first version, released finally in 1997, used either 8-bit (ANSI Latin 
1) or UCS-2 to store data...  The next release, I came up with a new 
encoding for Unicode, that was much more compact (at the insistence of the 
Japanese customers, who didn't want their storage requirements to increase 
because of moving from SJIS and EUC to Unicode).
In memory, all strings were UCS-2 (or really UTF-16, but like Java, because 
I designed it

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

I think the performance comparisons between Julia Python are flawed. They
seem to be between standard Python Julia but since Julia is all about
scientific programming it really should be between SciPi Julia. Since
SciPi uses much of the same underlying libs in Fortran/C the performance
gap will be much smaller and to be really fair it should be between numba
compiled SciPi code julia. I suspect the performance will be very close
then (and close to C performance).

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case scenario
for julia (loops mutable datastructures) the worst case scenario for R.
When the same R program is rewritten in vectorised style it beat julia
see
https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim that)
but because it's a clean interesting language (still needs work for some
rough edges of course) with clean(er) clear(er) libraries and that gives
reasonable performance out of the box without much tweaking.

On Friday, May 1, 2015 at 12:10:58 AM UTC+2, Scott Jones wrote:

Yes... Python will win on string processing... esp. with Python 3... I
quickly ran into things that were 800x faster in Python...
(I hope to help change that though!)

Scott

On Thursday, April 30, 2015 at 6:01:45 PM UTC-4, Páll Haraldsson wrote:

I wouldn't expect a difference in Julia for code like that (didn't
check). But I guess what we are often seeing is someone comparing a tuned
Python code to newbie Julia code. I still want it faster than that code..
(assuming same algorithm, note row vs. column major caveat).

The main point of mine, *should* Python at any time win?

2015-04-30 21:36 GMT+00:00 Sisyphuss zhengw...@gmail.com:

This post interests me. I'll write something here to follow this post.

The performance gap between normal code in Python and badly-written code
in Julia is something I'd like to know too.
As far as I know, Python interpret does some mysterious optimizations.
For example `(x**2)**2` is 100x faster than `x**4`.

On Thursday, April 30, 2015 at 9:58:35 PM UTC+2, Páll Haraldsson wrote:

Hi,

[As a best language is subjective, I'll put that aside for a moment.]

Part I.

The goal, as I understand, for Julia is at least within a factor of two
of C and already matching it mostly and long term beating that (and C++).
[What other goals are there? How about 0.4 now or even 1.0..?]

While that is the goal as a language, you can write slow code in any
language and Julia makes that easier. :) [If I recall, Bezanson mentioned
it (the global problem) as a feature, any change there?]

I've been following this forum for months and newbies hit the same
issues. But almost always without fail, Julia can be speed up (easily as
Tim Holy says). I'm thinking about the exceptions to that - are there any
left? And about the first code slowness (see Part II).

Just recently the last two flaws of Julia that I could see where fixed:
Decimal floating point is in (I'll look into the 100x slowness, that is
probably to be expected of any language, still I think may be a
misunderstanding and/or I can do much better). And I understand the tuple
slowness has been fixed (that was really the only core language defect).
The former wasn't a performance problem (mostly a non existence problem
and
correctness one (where needed)..).

Still we see threads like this one recent one:

https://groups.google.com/forum/#!topic/julia-users/-bx9xIfsHHw
It seems changing the order of nested loops also helps

Obviously Julia can't beat assembly but really C/Fortran is already
close enough (within a small factor). The above row vs. column major
(caching effects in general) can kill performance in all languages.
Putting
that newbie mistake aside, is there any reason Julia can be within a small
factor of assembly (or C) in all cases already?

Part II.

Except for caching issues, I still want the most newbie code or
intentionally brain-damaged code to run faster than at least
Python/scripting/interpreted languages.

Potential problems (that I think are solved or at least not problems in
theory):

1. I know Any kills performance. Still, isn't that the default in
Python (and Ruby, Perl?)? Is there a good reason Julia can't be faster
than
at least all the so-called scripting languages in all cases (excluding
small startup overhead, see below)?

2. The global issue, not sure if that slows other languages down, say
Python. Even if it doesn't, should Julia be slower than Python because of
global?

3. Garbage collection. I do not see that as a problem, incorrect?
Mostly performance variability ([3D] games - subject for another post,
as
I'm not sure

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

On Friday, May 1, 2015 at 4:25:50 AM UTC-4, Steven Sagaert wrote:

I think the performance comparisons between Julia Python are flawed.
They seem to be between standard Python Julia but since Julia is all
about scientific programming it really should be between SciPi Julia.
Since SciPi uses much of the same underlying libs in Fortran/C the
performance gap will be much smaller and to be really fair it should be
between numba compiled SciPi code julia. I suspect the performance will
be very close then (and close to C performance).

Why should Julia be limited to scientific programming?
I think it can be a great language for general programming, for the most
part, I think it already is (it can use some changes for string handling
[I'd like to work on that ;-)], decimal floating point support [that is
currently being addressed, kudos to Steven G. Johnson], maybe some better
language constructs to allow better software engineering practices [that is
being hotly debated!], and definitely a real debugger [I think keno is
working on that]).

Comparing Julia to Python for general computing is totally valid and
interesting.
Comparing Julia to SciPy for scientific computing is also totally valid and
interesting.

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case scenario
for julia (loops mutable datastructures) the worst case scenario for R.
When the same R program is rewritten in vectorised style it beat julia see
https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/
.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim that)
but because it's a clean interesting language (still needs work for some
rough edges of course) with clean(er) clear(er) libraries and that gives
reasonable performance out of the box without much tweaking.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

Don't apologize; instead, tell us more about what Go does, and how you think 
things can be better. Those of us who don't know Go will thank you for it.

Best,
--Tim

On Thursday, April 30, 2015 09:42:47 PM Harry B wrote:
 Sorry my comment wasn't well thought out and a bit off topic. On
 exceptions/errors my issue is this
 https://github.com/JuliaLang/julia/issues/7026
 On profiling, I was comparing to Go, but again off topic and I take my
 comment back. I don't have any intelligent remarks to add (yet!) :)
 Thank you for the all the work you are doing.
 
 On Thursday, April 30, 2015 at 7:00:01 PM UTC-7, Tim Holy wrote:
  Harry, I'm curious about 2 of your 3 last points:
  
  On Thursday, April 30, 2015 05:50:15 PM Harry B wrote:
   (exceptions?, debugging, profiling tools)
  
  We have exceptions. What aspect are you referring to?
  Debugger: yes, that's missing, and it's a huge gap.
  Profiling tools: in my view we're doing OK (better than Matlab, in my
  opinion),
  but what do you see as missing?
  
  --Tim
  
   Thanks
   
It seemed to me tuples where slow because of Any used. I understand
  
  tuples
  
have been fixed, I'm not sure how.

I do not remember the post/all the details. Yes, tuples where slow/er
  
  than
  
Python. Maybe it was Dict, isn't that kind of a tuple? Now we have
  
  Pair in
  
0.4. I do not have 0.4, maybe I should bite the bullet and install..
  
  I'm
  
not doing anything production related and trying things out and using
0.3[.5] to avoid stability problems.. Then I can't judge the speed..

Another potential issue I saw with tuples (maybe that is not a problem
  
  in
  
general, and I do not know that languages do this) is that they can
  
  take a
  
lot of memory (to copy around). I was thinking, maybe they should do
similar to databases, only use a fixed amount of memory (a page)
  
  with a
  
pointer to overflow data..

2015-04-30 22:13 GMT+00:00 Ali Rezaee arv@gmail.com
  
  javascript::
They were interesting questions.
I would also like to know why poorly written Julia code
sometimes performs worse than similar python code, especially when
  
  tuples
  
are involved. Did you say it was fixed?

On Thursday, April 30, 2015 at 9:58:35 PM UTC+2, Páll Haraldsson
  
  wrote:
Hi,

[As a best language is subjective, I'll put that aside for a
  
  moment.]
  
Part I.

The goal, as I understand, for Julia is at least within a factor of
  
  two
  
of C and already matching it mostly and long term beating that (and
C++).
[What other goals are there? How about 0.4 now or even 1.0..?]

While that is the goal as a language, you can write slow code in any
language and Julia makes that easier. :) [If I recall, Bezanson
mentioned
it (the global problem) as a feature, any change there?]


I've been following this forum for months and newbies hit the same
issues. But almost always without fail, Julia can be speed up
  
  (easily as
  
Tim Holy says). I'm thinking about the exceptions to that - are
  
  there
  
any
left? And about the first code slowness (see Part II).

Just recently the last two flaws of Julia that I could see where
  
  fixed:
Decimal floating point is in (I'll look into the 100x slowness, that
  
  is
  
probably to be expected of any language, still I think may be a
misunderstanding and/or I can do much better). And I understand the
tuple
slowness has been fixed (that was really the only core language
defect).
The former wasn't a performance problem (mostly a non existence
  
  problem
  
and
correctness one (where needed)..).


Still we see threads like this one recent one:

https://groups.google.com/forum/#!topic/julia-users/-bx9xIfsHHw
It seems changing the order of nested loops also helps

Obviously Julia can't beat assembly but really C/Fortran is already
close enough (within a small factor). The above row vs. column major
(caching effects in general) can kill performance in all languages.
Putting
that newbie mistake aside, is there any reason Julia can be within a
small
factor of assembly (or C) in all cases already?


Part II.

Except for caching issues, I still want the most newbie code or
intentionally brain-damaged code to run faster than at least
Python/scripting/interpreted languages.

Potential problems (that I think are solved or at least not problems
  
  in
  
theory):

1. I know Any kills performance. Still, isn't that the default in
  
  Python
  
(and Ruby, Perl?)? Is there a good reason Julia can't be faster than
  
  at
  
least all the so-called scripting languages in all cases (excluding
small
startup overhead, see below)?

2. The global issue, not sure if that slows other languages down,
  
  say

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

I just read through all of that very interesting thread on exceptions... it 
seems that Stefan was trying to reinvent the wheel, without knowing it.

Everybody interested in exception handling should go look up CLU... Julia 
seems to have gotten a lot of ideas from CLU (possibly rather indirectly,
through C++, Java, Lua...).
CLU had this well handled 40 years ago ;-)

Scott

On Friday, May 1, 2015 at 12:42:47 AM UTC-4, Harry B wrote:

 Sorry my comment wasn't well thought out and a bit off topic. On 
 exceptions/errors my issue is this 
 https://github.com/JuliaLang/julia/issues/7026
 On profiling, I was comparing to Go, but again off topic and I take my 
 comment back. I don't have any intelligent remarks to add (yet!) :)
 Thank you for the all the work you are doing. 

 On Thursday, April 30, 2015 at 7:00:01 PM UTC-7, Tim Holy wrote:

 Harry, I'm curious about 2 of your 3 last points: 

 On Thursday, April 30, 2015 05:50:15 PM Harry B wrote: 
  (exceptions?, debugging, profiling tools) 

 We have exceptions. What aspect are you referring to? 
 Debugger: yes, that's missing, and it's a huge gap. 
 Profiling tools: in my view we're doing OK (better than Matlab, in my 
 opinion), 
 but what do you see as missing? 

 --Tim 

  
  Thanks 
  -- 
  Harry 
  
  On Thursday, April 30, 2015 at 3:43:36 PM UTC-7, Páll Haraldsson wrote: 
   It seemed to me tuples where slow because of Any used. I understand 
 tuples 
   have been fixed, I'm not sure how. 
   
   I do not remember the post/all the details. Yes, tuples where slow/er 
 than 
   Python. Maybe it was Dict, isn't that kind of a tuple? Now we have 
 Pair in 
   0.4. I do not have 0.4, maybe I should bite the bullet and install.. 
 I'm 
   not doing anything production related and trying things out and using 
   0.3[.5] to avoid stability problems.. Then I can't judge the speed.. 
   
   Another potential issue I saw with tuples (maybe that is not a 
 problem in 
   general, and I do not know that languages do this) is that they can 
 take a 
   lot of memory (to copy around). I was thinking, maybe they should do 
   similar to databases, only use a fixed amount of memory (a page) 
 with a 
   pointer to overflow data.. 
   
   2015-04-30 22:13 GMT+00:00 Ali Rezaee arv@gmail.com 
 javascript:: 
   They were interesting questions. 
   I would also like to know why poorly written Julia code 
   sometimes performs worse than similar python code, especially when 
 tuples 
   are involved. Did you say it was fixed? 
   
   On Thursday, April 30, 2015 at 9:58:35 PM UTC+2, Páll Haraldsson 
 wrote: 
   Hi, 
   
   [As a best language is subjective, I'll put that aside for a 
 moment.] 
   
   Part I. 
   
   The goal, as I understand, for Julia is at least within a factor of 
 two 
   of C and already matching it mostly and long term beating that (and 
   C++). 
   [What other goals are there? How about 0.4 now or even 1.0..?] 
   
   While that is the goal as a language, you can write slow code in 
 any 
   language and Julia makes that easier. :) [If I recall, Bezanson 
   mentioned 
   it (the global problem) as a feature, any change there?] 
   
   
   I've been following this forum for months and newbies hit the same 
   issues. But almost always without fail, Julia can be speed up 
 (easily as 
   Tim Holy says). I'm thinking about the exceptions to that - are 
 there 
   any 
   left? And about the first code slowness (see Part II). 
   
   Just recently the last two flaws of Julia that I could see where 
 fixed: 
   Decimal floating point is in (I'll look into the 100x slowness, 
 that is 
   probably to be expected of any language, still I think may be a 
   misunderstanding and/or I can do much better). And I understand the 
   tuple 
   slowness has been fixed (that was really the only core language 
   defect). 
   The former wasn't a performance problem (mostly a non existence 
 problem 
   and 
   correctness one (where needed)..). 
   
   
   Still we see threads like this one recent one: 
   
   https://groups.google.com/forum/#!topic/julia-users/-bx9xIfsHHw 
   It seems changing the order of nested loops also helps 
   
   Obviously Julia can't beat assembly but really C/Fortran is already 
   close enough (within a small factor). The above row vs. column 
 major 
   (caching effects in general) can kill performance in all languages. 
   Putting 
   that newbie mistake aside, is there any reason Julia can be within 
 a 
   small 
   factor of assembly (or C) in all cases already? 
   
   
   Part II. 
   
   Except for caching issues, I still want the most newbie code or 
   intentionally brain-damaged code to run faster than at least 
   Python/scripting/interpreted languages. 
   
   Potential problems (that I think are solved or at least not 
 problems in 
   theory): 
   
   1. I know Any kills performance. Still, isn't that the default in 
 Python 
   (and Ruby, Perl?)? Is there a good reason Julia can't be faster 
 than at 
   least all

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

On Friday, May 1, 2015 at 12:26:54 PM UTC+2, Scott Jones wrote:

On Friday, May 1, 2015 at 4:25:50 AM UTC-4, Steven Sagaert wrote:

I think the performance comparisons between Julia Python are flawed.
They seem to be between standard Python Julia but since Julia is all
about scientific programming it really should be between SciPi Julia.
Since SciPi uses much of the same underlying libs in Fortran/C the
performance gap will be much smaller and to be really fair it should be
between numba compiled SciPi code julia. I suspect the performance will
be very close then (and close to C performance).

Why should Julia be limited to scientific programming?
I think it can be a great language for general programming,

I agree but for now the short time future I think the core domain of
julia is scientific computing/data science and so to have fair comparisons
one should not just compare julia to vanilla Python but especially scipi
numba.

for the most part, I think it already is (it can use some changes for
string handling [I'd like to work on that ;-)], decimal floating point
support [that is currently being addressed, kudos to Steven G. Johnson],
maybe some better language constructs to allow better software engineering
practices [that is being hotly debated!], and definitely a real debugger [I
think keno is working on that]).

Comparing Julia to Python for general computing is totally valid and
interesting.
Comparing Julia to SciPy for scientific computing is also totally valid
and interesting.

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case scenario
for julia (loops mutable datastructures) the worst case scenario for R.
When the same R program is rewritten in vectorised style it beat julia see
https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/
.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim that)
but because it's a clean interesting language (still needs work for some
rough edges of course) with clean(er) clear(er) libraries and that gives
reasonable performance out of the box without much tweaking.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

2015-05-01 Thread Patrick O'Leary

On Friday, May 1, 2015 at 3:25:50 AM UTC-5, Steven Sagaert wrote:

 I think the performance comparisons between Julia  Python are flawed. 
 They seem to be between standard Python  Julia but since Julia is all 
 about scientific programming it really should be between SciPi  Julia. 
 Since SciPi uses much of the same underlying libs in Fortran/C the 
 performance gap will be much smaller and to be really fair it should be 
 between numba compiled SciPi code  julia. I suspect the performance will 
 be very close then (and close to C performance).

 Similarly the standard benchmark (on the opening page of julia website) 
 between R  julia is also flawed because it takes the best case scenario 
 for julia (loops  mutable datastructures)  the worst case scenario for R. 
 When the same R program is rewritten in vectorised style it beat julia see 
 https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/
 .


All benchmarks are flawed in that sense--a single benchmark can't tell you 
everything. The Julia performance benchmarks are testing algorithms 
expressed in the langauges themselves. It is not a test of foreign-function 
interfaces and BLAS implementations, so the benchmarks don't test that. 
This has been discussed at length--as one example, see 
https://github.com/JuliaLang/julia/issues/2412.

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?

Hi Steven,

I understand your point---you're saying you'd be unlikely to write those
algorithms in that manner, if your goal were to do those particular
computations. But the important point to keep in mind is that those benchmarks
are simply toys for the purpose of testing performance of various language
constructs. If you think it's irrelevant to benchmark loops for scientific
code, then you do very, very different stuff than me. Not all algorithms reduce
to BLAS calls. I use julia to write all kinds of algorithms that I used to
write MEX functions for, back in my Matlab days. If all you need is A*b, then
of course basically any scientific language will be just fine, with minimal
differences in performance.

Moreover, that R benchmark on cumsum is simply not credible. I'm not sure what
was happening (and that article doesn't post its code or procedures used to
test), but julia's cumsum reduces to efficient machine code (basically, a bunch
of addition operations). If they were computing cumsum across a specific
dimension, then this PR:
https://github.com/JuliaLang/julia/pull/7359
changed things. But more likely, someone forgot to run the code twice (so it
got JIT-compiled), had a type-instability in the code they were testing, or
some other mistake. It's too bad one can make mistakes, of course, but then it
becomes a comparison of different programmers rather than different programming
languages.

Indeed, if you read the comments in that post, Stefan already rebutted that
benchmark, with a 4x advantage for Julia:
https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyone-else-wanna-challenge-r/comment-page-1/#comment-89

--Tim

On Friday, May 01, 2015 01:25:50 AM Steven Sagaert wrote:
I think the performance comparisons between Julia Python are flawed. They
seem to be between standard Python Julia but since Julia is all about
scientific programming it really should be between SciPi Julia. Since
SciPi uses much of the same underlying libs in Fortran/C the performance
gap will be much smaller and to be really fair it should be between numba
compiled SciPi code julia. I suspect the performance will be very close
then (and close to C performance).

Similarly the standard benchmark (on the opening page of julia website)
between R julia is also flawed because it takes the best case scenario
for julia (loops mutable datastructures) the worst case scenario for R.
When the same R program is rewritten in vectorised style it beat julia
see
https://matloff.wordpress.com/2014/05/21/r-beats-python-r-beats-julia-anyon
e-else-wanna-challenge-r/.

So my interest in julia isn't because it is the fastest scientific high
level language (because clearly at this stage you can't really claim that)
but because it's a clean interesting language (still needs work for some
rough edges of course) with clean(er) clear(er) libraries and that gives
reasonable performance out of the box without much tweaking.

Scott

On Thursday, April 30, 2015 at 6:01:45 PM UTC-4, Páll Haraldsson wrote:
I wouldn't expect a difference in Julia for code like that (didn't
check). But I guess what we are often seeing is someone comparing a tuned
Python code to newbie Julia code. I still want it faster than that code..
(assuming same algorithm, note row vs. column major caveat).

The main point of mine, *should* Python at any time win?

2015-04-30 21:36 GMT+00:00 Sisyphuss zhengw...@gmail.com:
This post interests me. I'll write something here to follow this post.

The performance gap between normal code in Python and badly-written code
in Julia is something I'd like to know too.
As far as I know, Python interpret does some mysterious optimizations.
For example `(x**2)**2` is 100x faster than `x**4`.

On Thursday, April 30, 2015 at 9:58:35 PM UTC+2, Páll Haraldsson wrote:
Hi,

[As a best language is subjective, I'll put that aside for a moment.]

Part I.

The goal, as I understand, for Julia is at least within a factor of two
of C and already matching it mostly and long term beating that (and
C++).
[What other goals are there? How about 0.4 now or even 1.0..?]

While that is the goal as a language, you can write slow code in any
language and Julia makes that easier. :) [If I recall, Bezanson
mentioned
it (the global problem) as a feature, any change there?]

I've been following this forum for months and newbies hit the same
issues. But almost always without fail, Julia can be speed up (easily
as
Tim Holy says). I'm thinking about the exceptions to that - are there
any
left? And about the first code slowness (see Part II).

Just recently the last two flaws of Julia

Re: [julia-users] Re: Performance variability - can we expect Julia to be the fastest (best) language?