[ https://issues.apache.org/jira/browse/ARROW-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joris Van den Bossche updated ARROW-6325: ----------------------------------------- Description: >From https://github.com/pandas-dev/pandas/issues/28090 {code} In [19]: df = pd.DataFrame(np.ones((3, 2), dtype=bool), columns=['a', 'b']) In [20]: df Out[20]: a b 0 True True 1 True True 2 True True In [21]: table = pa.table(df) In [23]: table.column(0) Out[23]: <pyarrow.lib.ChunkedArray object at 0x7fd08a96e090> [ [ true, false, false, ] ] {code} The resulting table has False values while the original DataFrame had only true values. It seems this has to do with the fact that it are multiple columns, as with a single column it converts correctly. was: >From https://github.com/pandas-dev/pandas/issues/28090 {code} In [19]: df = pd.DataFrame(np.ones((5, 2), dtype=bool), columns=['a', 'b']) In [20]: df Out[20]: a b 0 True True 1 True True 2 True True 3 True True 4 True True In [21]: table = pa.table(df) In [23]: table.column(0) Out[23]: <pyarrow.lib.ChunkedArray object at 0x7fd08a96e090> [ [ true, false, false, false, false ] ] {code} The resulting table has False values while the original DataFrame had only true values. It seems this has to do with the fact that it are multiple columns, as with a single column it converts correctly. > [Python] wrong conversion of DataFrame with boolean values > ---------------------------------------------------------- > > Key: ARROW-6325 > URL: https://issues.apache.org/jira/browse/ARROW-6325 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.14.1 > Reporter: Joris Van den Bossche > Priority: Major > Fix For: 0.15.0 > > > From https://github.com/pandas-dev/pandas/issues/28090 > {code} > In [19]: df = pd.DataFrame(np.ones((3, 2), dtype=bool), columns=['a', 'b']) > In [20]: df > Out[20]: > a b > 0 True True > 1 True True > 2 True True > In [21]: table = pa.table(df) > In [23]: table.column(0) > Out[23]: > <pyarrow.lib.ChunkedArray object at 0x7fd08a96e090> > [ > [ > true, > false, > false, > ] > ] > {code} > The resulting table has False values while the original DataFrame had only > true values. > It seems this has to do with the fact that it are multiple columns, as with a > single column it converts correctly. -- This message was sent by Atlassian Jira (v8.3.2#803003)