[
https://issues.apache.org/jira/browse/ARROW-3428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17660451#comment-17660451
]
Rok Mihevc commented on ARROW-3428:
-----------------------------------
This issue has been migrated to [issue
#19753|https://github.com/apache/arrow/issues/19753] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [Python] from_pandas gives incorrect results when converting floating point
> to bool
> -----------------------------------------------------------------------------------
>
> Key: ARROW-3428
> URL: https://issues.apache.org/jira/browse/ARROW-3428
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: Bryan Cutler
> Assignee: Bryan Cutler
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.12.0
>
> Time Spent: 5.5h
> Remaining Estimate: 0h
>
> When converting Pandas data that contains floating point values to boolean,
> incorrect results are given
> {noformat}
> In [2]: import pyarrow as pa
> ...: import pandas as pd
> ...: a = [0.0, 1.0, 2.0, None, float('NaN')]
> ...:
> In [3]: s = pd.Series(a)
> In [4]: pa.Array.from_pandas(s, type=pa.bool_())
> Out[4]:
> <pyarrow.lib.BooleanArray object at 0x7f1bfd099e68>
> [
> False,
> False,
> False,
> False,
> False
> ]
> {noformat}
> Expected output should be True when value != 0
> This originated from SPARK-25461
--
This message was sent by Atlassian Jira
(v8.20.10#820010)