[ 
https://issues.apache.org/jira/browse/ARROW-4311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661334#comment-17661334
 ] 

Rok Mihevc commented on ARROW-4311:
-----------------------------------

This issue has been migrated to [issue 
#15944|https://github.com/apache/arrow/issues/15944] on GitHub. Please see the 
[migration documentation|https://github.com/apache/arrow/issues/14542] for 
further details.

> [Python] Regression on pq.ParquetWriter incorrectly handling source string
> --------------------------------------------------------------------------
>
>                 Key: ARROW-4311
>                 URL: https://issues.apache.org/jira/browse/ARROW-4311
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.12.0
>            Reporter: Francisco Sanchez
>            Assignee: Antoine Pitrou
>            Priority: Major
>             Fix For: 0.13.0
>
>
> In the latest changes to filesystem.py some new functions have been added to 
> check the source string when calling pq.ParquetWriter. With the current 
> implementation some assumptions are done about the format of the string which 
> means that if the string is provided following some of these patterns it will 
> be automatically split/formatted and changed to something else.
> To give you a specific example, if I provide a string like 
> {{directory/level1#level2.parquet}} it will be written to disk as 
> {{directory/level1}}. The behaviour has changed on 0.12.0 from 0.11.1 and 
> nothing is stated in the documentation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to