Re: [julia-users] Re: TinySegmenter benchmark

2015-11-06 Thread Michiaki ARIGA
Finally, I compared with C++ and Go version using TinySegmenterMaker. https://github.com/shogo82148/TinySegmenterMaker/pull/10 The resulting times(in seconds for 100 loops a text file) is following: RubyC++PerlNode.jsGoPythonJulia132.9848134105.3110.50111.8511.70 After my blog post, ikawaha

Re: [julia-users] Re: TinySegmenter benchmark

2015-11-06 Thread Stefan Karpinski
I'm kind of surprised that C++ is so slow. I would imagine that anything you can do performance-wise in Go or Julia, you ought to be able to do in C++. Any idea what's going on there? On Fri, Nov 6, 2015 at 12:00 PM, Michiaki ARIGA wrote: > Finally, I compared with C++ and Go

Re: [julia-users] Re: TinySegmenter benchmark

2015-11-06 Thread Steven G. Johnson
Note that the Go version went one step further and packed the tuples into 64-bit integers where possible. We could do the same thing, though at this point we seem to be hitting the point of diminishing returns.

Re: [julia-users] Re: TinySegmenter benchmark

2015-11-06 Thread Steven G. Johnson
The C++ version is basically transcribed from the JavaScript version and constructs tons of temporary strings. The key improvement we made (and Go subsequently adapted) is to use tuples of Char instead for the hash tables. I'm pretty pleased that Julia is within 10% of the Go version, and

Re: [julia-users] Re: TinySegmenter benchmark

2015-10-22 Thread Michiaki ARIGA
Masahiro Nakagawa a.k.a. repeatedly told me my mistakes of the benchmark, I re-benchmarked. Node.jsPython2Python3JuliaRuby9.6293.0823.941.4619.44 - loop number of Python was 10 times smaller than other languages - repeatedly optimized Ruby implementation - changed loop size from 100 to 10

[julia-users] Re: TinySegmenter benchmark

2015-10-21 Thread Michiaki Ariga
Thanks for Steven's great help, I learned many things to optimize string operation of Julia. Finally, I wrote this episode on my blog (in Japanese only, sorry). http://chezou.hatenablog.com/entry/2015/10/21/234317 -- chezou 2015年10月21日水曜日 2時49分56秒 UTC+9 Steven G. Johnson: > > I thought people

Re: [julia-users] Re: TinySegmenter benchmark

2015-10-21 Thread Stefan Karpinski
That's an excellent performance comparison case study! Nice work, Chezou and nice blog post (the Google translation is pretty readable). On Wed, Oct 21, 2015 at 10:58 AM, Michiaki Ariga wrote: > Thanks for Steven's great help, I learned many things to optimize string >

Re: [julia-users] Re: TinySegmenter benchmark

2015-10-21 Thread Pontus Stenetorp
On 21 October 2015 at 17:49, Stefan Karpinski wrote: > > That's an excellent performance comparison case study! Nice work, Chezou and > nice blog post (the Google translation is pretty readable). Very readable indeed and I am always happy to see more NLP code in Julia!