[jira] [Updated] (ARROW-18001) [Python] Provide a way to specify the type of a subset of columns for from_pandas

Alenka Frim (Jira) Wed, 12 Oct 2022 06:28:04 -0700


     [ 
https://issues.apache.org/jira/browse/ARROW-18001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Alenka Frim updated ARROW-18001:
--------------------------------
    Summary: [Python] Provide a way to specify the type of a subset of columns 
for from_pandas  (was: [Python] parquet.write_table/parquet.ParquetWriter 
should except a subset of columns)

> [Python] Provide a way to specify the type of a subset of columns for 
> from_pandas
> ---------------------------------------------------------------------------------
>
>                 Key: ARROW-18001
>                 URL: https://issues.apache.org/jira/browse/ARROW-18001
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Alenka Frim
>            Priority: Major
>
> This question came up in the GitHub issue: 
> [https://github.com/apache/arrow/issues/14025] and it would be a good 
> improvement to the Parquet part of PyArrow. Haven't found any existing issue 
> and so created a new one.
> h6. Description:
> If a user wants to change a type of one single column when using 
> {{{}parquet.write_table{}}}/{{{}parquet.ParquetWriter{}}} they currently need 
> to specify the schema with all columns included. If a column is not specified 
> in the schema, it will not be included in the parquet file.
> h6. Proposal
> There should be a possibility for {{parquet.ParquetWriter}} excepting a 
> subset of columns in a Schema and infer everything else.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (ARROW-18001) [Python] Provide a way to specify the type of a subset of columns for from_pandas

Reply via email to