[ https://issues.apache.org/jira/browse/ARROW-13074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17375893#comment-17375893 ]
Joris Van den Bossche commented on ARROW-13074: ----------------------------------------------- [~ldacey] the current PR is only about the {{ParquetDataset}} class, so for reading parquet, not for writing parquet Let's keep discussion about {{partition_filename_cb}} for ARROW-12365 (the issue [~ldacey] created specifically for that topic) > [Python] Start with deprecating ParquetDataset custom attributes > ---------------------------------------------------------------- > > Key: ARROW-13074 > URL: https://issues.apache.org/jira/browse/ARROW-13074 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Joris Van den Bossche > Assignee: Joris Van den Bossche > Priority: Major > Labels: pull-request-available > Fix For: 5.0.0 > > Time Spent: 40m > Remaining Estimate: 0h > > As a first step for ARROW-9720, we should start with deprecating > attributes/methods of {{pq.ParquetDataset}} that we would definitely not keep > / are conflicting with the "dataset API". > I am thinking of the {{pieces}} attribute (and the {{ParquetDatasetPiece}} > class), the {{partitions}} attribute (and the {{ParquetPartitions}} class). > In addition, some of the keywords are also exposed as properties (memory_map, > read_dictionary, buffer_size, fs), and could be deprecated. -- This message was sent by Atlassian Jira (v8.3.4#803005)