[
https://issues.apache.org/jira/browse/PARQUET-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14394773#comment-14394773
]
Boris Peltekov commented on PARQUET-226:
----------------------------------------
I decided to do it by exposing ParquetProperties to the user. It was already
public. I just had to change once static method and its corresponding
invocations.
It also turned out that the constructors of ParquetWriter are way too many and
I decided to add a builder while marking the regular constructors for
deprecation.
Here is the pull request:
https://github.com/apache/incubator-parquet-mr/pull/157
> Introduce an interface for controlling the encoding per column
> --------------------------------------------------------------
>
> Key: PARQUET-226
> URL: https://issues.apache.org/jira/browse/PARQUET-226
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Boris Peltekov
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> There are use-cases in which it is advantageous to have control over the type
> of encoding used for a given column. To achieve this, the ParquetProperties
> class must be refactored to facilitate its subclassing. The constructors of
> ParquetWriter and (probably) InternalParquetRecordWriter have to be modified
> as well. All the internal parquet classes have to refrain from directly
> instantiating ParquetProperties and should accept the instance from outside.
> Currently me and my team are using a good amount of reflection and
> unenforceable assumptions in order to gain control of this behavior. I'd like
> to implement the interface change. Are you interested in this feature?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)