[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format
[ https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17275969#comment-17275969 ] Chao Sun commented on ARROW-10297: -- [~andygrove] could you add [~manojkarthick] as Arrow contributor and assign this to him? Thanks! > [Rust] Parameter for parquet-read to output data in json format > --- > > Key: ARROW-10297 > URL: https://issues.apache.org/jira/browse/ARROW-10297 > Project: Apache Arrow > Issue Type: New Feature > Components: Rust >Reporter: Jörn Horstmann >Priority: Minor > Labels: pull-request-available > Fix For: 4.0.0 > > Time Spent: 6h 10m > Remaining Estimate: 0h > > When analyzing data related issues I found it really helpful to filter or > portprocess the contents of parquet files on the command line using jq > (https://stedolan.github.io/jq/manual/). > Currently the output of parquet-read is in a custom json-like format, I > propose to add an optional flag that outputs the contents as json using the > serde_json library. This should probably be behind a feature gate to avoid > adding the dependency for everyone. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format
[ https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270909#comment-17270909 ] Manoj Karthick commented on ARROW-10297: Thank you! > [Rust] Parameter for parquet-read to output data in json format > --- > > Key: ARROW-10297 > URL: https://issues.apache.org/jira/browse/ARROW-10297 > Project: Apache Arrow > Issue Type: New Feature > Components: Rust >Reporter: Jörn Horstmann >Priority: Minor > Labels: pull-request-available > Time Spent: 3h > Remaining Estimate: 0h > > When analyzing data related issues I found it really helpful to filter or > portprocess the contents of parquet files on the command line using jq > (https://stedolan.github.io/jq/manual/). > Currently the output of parquet-read is in a custom json-like format, I > propose to add an optional flag that outputs the contents as json using the > serde_json library. This should probably be behind a feature gate to avoid > adding the dependency for everyone. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format
[ https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270906#comment-17270906 ] Jörn Horstmann commented on ARROW-10297: [~manojkarthick] Just saw you already opened a PR. I'm totally fine with you taking over as I didn't have any time working on this topic. Thanks! > [Rust] Parameter for parquet-read to output data in json format > --- > > Key: ARROW-10297 > URL: https://issues.apache.org/jira/browse/ARROW-10297 > Project: Apache Arrow > Issue Type: New Feature > Components: Rust >Reporter: Jörn Horstmann >Assignee: Jörn Horstmann >Priority: Minor > Labels: pull-request-available > Time Spent: 2h 50m > Remaining Estimate: 0h > > When analyzing data related issues I found it really helpful to filter or > portprocess the contents of parquet files on the command line using jq > (https://stedolan.github.io/jq/manual/). > Currently the output of parquet-read is in a custom json-like format, I > propose to add an optional flag that outputs the contents as json using the > serde_json library. This should probably be behind a feature gate to avoid > adding the dependency for everyone. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format
[ https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270671#comment-17270671 ] Manoj Karthick commented on ARROW-10297: [~jhorstmann] - This feature would be very much useful for me as well. Would it be okay if I worked on this based off your changes on [PR #8686|https://github.com/apache/arrow/pull/8686] and incorporated the comments in that PR? > [Rust] Parameter for parquet-read to output data in json format > --- > > Key: ARROW-10297 > URL: https://issues.apache.org/jira/browse/ARROW-10297 > Project: Apache Arrow > Issue Type: New Feature > Components: Rust >Reporter: Jörn Horstmann >Assignee: Jörn Horstmann >Priority: Minor > Labels: pull-request-available > Time Spent: 2h 20m > Remaining Estimate: 0h > > When analyzing data related issues I found it really helpful to filter or > portprocess the contents of parquet files on the command line using jq > (https://stedolan.github.io/jq/manual/). > Currently the output of parquet-read is in a custom json-like format, I > propose to add an optional flag that outputs the contents as json using the > serde_json library. This should probably be behind a feature gate to avoid > adding the dependency for everyone. -- This message was sent by Atlassian Jira (v8.3.4#803005)