[ 
https://issues.apache.org/jira/browse/ARROW-12987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17358608#comment-17358608
 ] 

Anthony Louis Gotlib Ferreira commented on ARROW-12987:
-------------------------------------------------------

[~jorisvandenbossche] I found the error,  I think it is a bug in the *utf8proc* 
library. 

The library is not converting the [https://www.compart.com/en/unicode/U+00DF] 
character to its uppercase representation that is 
[https://www.compart.com/en/unicode/U+1E9E.|https://www.compart.com/en/unicode/U+1E9E]

 

I take a look at the *utf8proc* repository and I check that the error was fixed 
by that Pull Request [https://github.com/JuliaStrings/utf8proc/pull/134] that 
is part of the 2.2 release: 
[https://github.com/JuliaStrings/utf8proc/releases/tag/v2.2.0]

So I think that the *utf8proc* library version is outdated in the nightly build 
environment, I will check how to update it with the members

> [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva "TestUpper" 
> test failure
> -----------------------------------------------------------------------------------------
>
>                 Key: ARROW-12987
>                 URL: https://issues.apache.org/jira/browse/ARROW-12987
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++ - Gandiva, Continuous Integration
>            Reporter: Joris Van den Bossche
>            Assignee: Anthony Louis Gotlib Ferreira
>            Priority: Major
>
> Didn't yet see another JIRA for this, see eg 
> https://github.com/ursacomputing/crossbow/runs/2756066459. It's failing on 
> the "test-ubuntu-18.04-cpp", "test-ubuntu-18.04-cpp-release" and 
> "test-ubuntu-18.04-cpp-static" builds.
>  
> Failure message:
> {code}
>  [ RUN      ] TestGdvFnStubs.TestUpper
> /arrow/cpp/src/gandiva/gdv_function_stubs_test.cc:327: Failure
> Expected equality of these values:
>   std::string(out_str, out_len)
>     Which is: "M\xC3\x96" "BELTR\xC3\x84GERF\xC3\x9C\xC3\x9F" "E"
>     As Text: "MÖBELTRÄGERFÜßE"
>   "MÖBELTRÄGERFÜẞE"
>     Which is: "M\xC3\x96" "BELTR\xC3\x84GERF\xC3\x9C\xE1\xBA\x9E" "E"
>     As Text: "MÖBELTRÄGERFÜẞE"
> [  FAILED  ] TestGdvFnStubs.TestUpper (1 ms)
> {code}
> I _think_ this is failing since https://github.com/apache/arrow/pull/10040 / 
> ARROW-11565 (cc [~praveenbingo] [~anthonylouis] [~projjal])



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to