Finally, I compared with C++ and Go version using TinySegmenterMaker.
https://github.com/shogo82148/TinySegmenterMaker/pull/10
The resulting times(in seconds for 100 loops a text file) is following:
RubyC++PerlNode.jsGoPythonJulia132.9848134105.3110.50111.8511.70
After my blog post, ikawaha
I'm kind of surprised that C++ is so slow. I would imagine that anything
you can do performance-wise in Go or Julia, you ought to be able to do in
C++. Any idea what's going on there?
On Fri, Nov 6, 2015 at 12:00 PM, Michiaki ARIGA wrote:
> Finally, I compared with C++ and Go
Note that the Go version went one step further and packed the tuples into
64-bit integers where possible. We could do the same thing, though at this
point we seem to be hitting the point of diminishing returns.
The C++ version is basically transcribed from the JavaScript version and
constructs tons of temporary strings. The key improvement we made (and Go
subsequently adapted) is to use tuples of Char instead for the hash tables.
I'm pretty pleased that Julia is within 10% of the Go version, and
Masahiro Nakagawa a.k.a. repeatedly told me my mistakes of the benchmark, I
re-benchmarked.
Node.jsPython2Python3JuliaRuby9.6293.0823.941.4619.44
- loop number of Python was 10 times smaller than other languages
- repeatedly optimized Ruby implementation
- changed loop size from 100 to 10
Thanks for Steven's great help, I learned many things to optimize string
operation of Julia.
Finally, I wrote this episode on my blog (in Japanese only, sorry).
http://chezou.hatenablog.com/entry/2015/10/21/234317
-- chezou
2015年10月21日水曜日 2時49分56秒 UTC+9 Steven G. Johnson:
>
> I thought people
That's an excellent performance comparison case study! Nice work, Chezou
and nice blog post (the Google translation is pretty readable).
On Wed, Oct 21, 2015 at 10:58 AM, Michiaki Ariga wrote:
> Thanks for Steven's great help, I learned many things to optimize string
>
On 21 October 2015 at 17:49, Stefan Karpinski wrote:
>
> That's an excellent performance comparison case study! Nice work, Chezou and
> nice blog post (the Google translation is pretty readable).
Very readable indeed and I am always happy to see more NLP code in
Julia!