Joris Van den Bossche created ARROW-8314:
--------------------------------------------
Summary: [Python] Provide a method to select a subset of columns
of a Table
Key: ARROW-8314
URL: https://issues.apache.org/jira/browse/ARROW-8314
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Reporter: Joris Van den Bossche
I looked through the open issues and in our API, but didn't directly find
something about selecting a subset of columns of a table.
Assume you have a table like:
{code}
table = pa.table({'a': [1, 2], 'b': [.1, .2], 'c': ['a', 'b']})
{code}
You can select a single column with {{table.column('a')}} or {{table['a']}} to
get a chunked array. You can add, append, remove and replace columns (with
{{add_column}}, {{append_column}}, {{remove_column}}, {{set_column}}).
But an easy way to get a subset of the columns (without the manuall removing
the ones you don't want one by one) doesn't seem possible.
I would propose something like:
{code}
table.select(['a', 'c'])
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)