[ 
https://issues.apache.org/jira/browse/ARROW-14686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kazuaki Ishizaki updated ARROW-14686:
-------------------------------------
    Description: 
Although the current NumPy converter code tries to detect endian against a 
given object, it does not work for a built-in object. This is because a 
build-in object may have {{native |}} byte order.

This issue causes test failure as follows:
{code:java}
FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - 
UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code 
point not in range(0x110000)
FAILED pyarrow/tests/test_array.py::test_array_from_strided - 
UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code 
point not in range(0x110000)
FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - 
UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code 
point not in range(0x110000) {code}
 

  was:Although the current NumPy converter code tries to detect endian against 
a given object, it does not work for a built-in object. This is because a 
build-in object may have `native |` byte order.


> [Python] Current endian decision for numpy does not work for built-in data 
> type.
> --------------------------------------------------------------------------------
>
>                 Key: ARROW-14686
>                 URL: https://issues.apache.org/jira/browse/ARROW-14686
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 7.0.0
>            Reporter: Kazuaki Ishizaki
>            Assignee: Kazuaki Ishizaki
>            Priority: Major
>             Fix For: 7.0.0
>
>
> Although the current NumPy converter code tries to detect endian against a 
> given object, it does not work for a built-in object. This is because a 
> build-in object may have {{native |}} byte order.
> This issue causes test failure as follows:
> {code:java}
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - 
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: 
> code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_strided - 
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: 
> code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - 
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: 
> code point not in range(0x110000) {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to