[ https://issues.apache.org/jira/browse/ARROW-14686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kazuaki Ishizaki updated ARROW-14686: ------------------------------------- Description: Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have {{native |}} byte order. This issue causes test failure as follows: {code:java} FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) FAILED pyarrow/tests/test_array.py::test_array_from_strided - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) {code} was:Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have `native |` byte order. > [Python] Current endian decision for numpy does not work for built-in data > type. > -------------------------------------------------------------------------------- > > Key: ARROW-14686 > URL: https://issues.apache.org/jira/browse/ARROW-14686 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Affects Versions: 7.0.0 > Reporter: Kazuaki Ishizaki > Assignee: Kazuaki Ishizaki > Priority: Major > Fix For: 7.0.0 > > > Although the current NumPy converter code tries to detect endian against a > given object, it does not work for a built-in object. This is because a > build-in object may have {{native |}} byte order. > This issue causes test failure as follows: > {code:java} > FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - > UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: > code point not in range(0x110000) > FAILED pyarrow/tests/test_array.py::test_array_from_strided - > UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: > code point not in range(0x110000) > FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - > UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: > code point not in range(0x110000) {code} > -- This message was sent by Atlassian Jira (v8.20.1#820001)