[julia-users] Re: String performance in Julia

Keith Campbell Tue, 12 May 2015 12:33:02 -0700

Yes, I care about faster conversions between string types.


On Monday, May 11, 2015 at 10:52:28 PM UTC-4, Scott Jones wrote:
>
> I would like to get the performance of string handling in Julia improved 
> (and also correct a number of flaws in handling Unicode).
> Currently, many of the operations are very slow (because of the way 
> strings are represented, and also simply
> because the algorithms used were not as fast as they could be).
> For communicating with Java where the String type is always UTF-16, C/C++, 
> where wide strings are either UTF-16 or UTF-32,
> databases (which frequently use UTF-16 in Asia, because UTF-8 can take 
> more space, depending on the data), or using Unicode
> APIs in Windows or the ICU libraries which use UTF-16, the performance of 
> the conversions between the ASCIIString/UTF8String
> types that Julia normally uses for strings and UTF-16 can be critical for 
> an applications performance.
>
> I have a PR (#11004) to fix the performance issue #10959 I raised (in the 
> GitHub repository ScottPJones/julia branch spj/fast_utf),
> if anybody would like to try this out.
> My own testing shows that it gives about a 2-10x improvement (most of the 
> time, 10x), you can get my latest benchmark results
> along with the code I used to benchmark it at my gist, 
> https://gist.github.com/ScottPJones/bb712f7b85d1d8d91a9a.
>
> I am curious if these sorts of performance improvements (along with better 
> error handling and Unicode input validation)
> would be important to anybody else, as I am trying to convince people to 
> merge this PR into the Julia base...
>
> Thanks, Scott
>
>
>

[julia-users] Re: String performance in Julia

Reply via email to