[ 
https://issues.apache.org/jira/browse/PARQUET-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Buck updated PARQUET-1020:
-------------------------------
    Description: 
Hello. We would like to pass in a DynamicMessage rather than using the 
generated protobuf classes to allow us to make our job very generic. 

I think this could be achieved by setting the descriptor upfront, similarly to 
how there is a ProtoParquetOutputFormat today.

In ProtoWriteSupport in the init method it could then generate the parquet 
schema created by ProtoSchemaConverter using the passed in descriptor, rather 
than taking it from the generated proto class.

Would there be interest in incorporating this change? If so does the approach 
above sound sensible? I am happy to do a pull request

initial PR here: https://github.com/apache/parquet-mr/pull/414

  was:
Hello. We would like to pass in a DynamicMessage rather than using the 
generated protobuf classes to allow us to make our job very generic. 

I think this could be achieved by setting the descriptor upfront, similarly to 
how there is a ProtoParquetOutputFormat today.

In ProtoWriteSupport in the init method it could then generate the parquet 
schema created by ProtoSchemaConverter using the passed in descriptor, rather 
than taking it from the generated proto class.

Would there be interest in incorporating this change? If so does the approach 
above sound sensible? I am happy to do a pull request


> Add support for Dynamic Messages in parquet-protobuf
> ----------------------------------------------------
>
>                 Key: PARQUET-1020
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1020
>             Project: Parquet
>          Issue Type: New Feature
>            Reporter: Alex Buck
>
> Hello. We would like to pass in a DynamicMessage rather than using the 
> generated protobuf classes to allow us to make our job very generic. 
> I think this could be achieved by setting the descriptor upfront, similarly 
> to how there is a ProtoParquetOutputFormat today.
> In ProtoWriteSupport in the init method it could then generate the parquet 
> schema created by ProtoSchemaConverter using the passed in descriptor, rather 
> than taking it from the generated proto class.
> Would there be interest in incorporating this change? If so does the approach 
> above sound sensible? I am happy to do a pull request
> initial PR here: https://github.com/apache/parquet-mr/pull/414



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to