Victor Jimenez created ARROW-1999: ------------------------------------- Summary: [Python] from_numpy_dtype returns wrong types Key: ARROW-1999 URL: https://issues.apache.org/jira/browse/ARROW-1999 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.8.0 Environment: Windows 10 Build 15063.850 Numpy: 1.14.0 Reporter: Victor Jimenez
The following code shows multiple issues when using {{from_numpy_dtype}}: {code} import numpy as np import pyarrow as pa pa.from_numpy_dtype(np.unicode) # returns DataType(bool) pa.from_numpy_dtype(np.int) # returns DataType(bool) pa.from_numpy_dtype(np.int64) # Fails with the following message: # # ArrowNotImplementedError Traceback (most recent call last) # <ipython-input-14-ca0855a7dda8> in <module>() # ----> 1 pa.from_numpy_dtype(np.int64) # 2 # # types.pxi in pyarrow.lib.from_numpy_dtype() # # error.pxi in pyarrow.lib.check_status() # # ArrowNotImplementedError: Unsupported numpy type 32760 {code} Additionally, a potentially related issue is also seen when using {{to_pandas_dtype}}: {code} pa.DataType.to_pandas_dtype(pa.string()) # Returns numpy.object_ # (shouldn't it be numpy.unicode?) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)