[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format

2021-01-31 Thread Chao Sun (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17275969#comment-17275969
 ] 

Chao Sun commented on ARROW-10297:
--

[~andygrove] could you add [~manojkarthick] as Arrow contributor and assign 
this to him? Thanks!

> [Rust] Parameter for parquet-read to output data in json format
> ---
>
> Key: ARROW-10297
> URL: https://issues.apache.org/jira/browse/ARROW-10297
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Rust
>Reporter: Jörn Horstmann
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> When analyzing data related issues I found it really helpful to filter or 
> portprocess the contents of parquet files on the command line using jq 
> (https://stedolan.github.io/jq/manual/).
> Currently the output of parquet-read is in a custom json-like format, I 
> propose to add an optional flag that outputs the contents as json using the 
> serde_json library. This should probably be behind a feature gate to avoid 
> adding the dependency for everyone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format

2021-01-24 Thread Manoj Karthick (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270909#comment-17270909
 ] 

Manoj Karthick commented on ARROW-10297:


Thank you!

> [Rust] Parameter for parquet-read to output data in json format
> ---
>
> Key: ARROW-10297
> URL: https://issues.apache.org/jira/browse/ARROW-10297
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Rust
>Reporter: Jörn Horstmann
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> When analyzing data related issues I found it really helpful to filter or 
> portprocess the contents of parquet files on the command line using jq 
> (https://stedolan.github.io/jq/manual/).
> Currently the output of parquet-read is in a custom json-like format, I 
> propose to add an optional flag that outputs the contents as json using the 
> serde_json library. This should probably be behind a feature gate to avoid 
> adding the dependency for everyone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format

2021-01-24 Thread Jira


[ 
https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270906#comment-17270906
 ] 

Jörn Horstmann commented on ARROW-10297:


[~manojkarthick] Just saw you already opened a PR. I'm totally fine with you 
taking over as I didn't have any time working on this topic. Thanks!

> [Rust] Parameter for parquet-read to output data in json format
> ---
>
> Key: ARROW-10297
> URL: https://issues.apache.org/jira/browse/ARROW-10297
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Rust
>Reporter: Jörn Horstmann
>Assignee: Jörn Horstmann
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> When analyzing data related issues I found it really helpful to filter or 
> portprocess the contents of parquet files on the command line using jq 
> (https://stedolan.github.io/jq/manual/).
> Currently the output of parquet-read is in a custom json-like format, I 
> propose to add an optional flag that outputs the contents as json using the 
> serde_json library. This should probably be behind a feature gate to avoid 
> adding the dependency for everyone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-10297) [Rust] Parameter for parquet-read to output data in json format

2021-01-23 Thread Manoj Karthick (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270671#comment-17270671
 ] 

Manoj Karthick commented on ARROW-10297:


[~jhorstmann] - This feature would be very much useful for me as well. Would it 
be okay if I worked on this based off your changes on [PR 
#8686|https://github.com/apache/arrow/pull/8686] and incorporated the comments 
in that PR?

> [Rust] Parameter for parquet-read to output data in json format
> ---
>
> Key: ARROW-10297
> URL: https://issues.apache.org/jira/browse/ARROW-10297
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Rust
>Reporter: Jörn Horstmann
>Assignee: Jörn Horstmann
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> When analyzing data related issues I found it really helpful to filter or 
> portprocess the contents of parquet files on the command line using jq 
> (https://stedolan.github.io/jq/manual/).
> Currently the output of parquet-read is in a custom json-like format, I 
> propose to add an optional flag that outputs the contents as json using the 
> serde_json library. This should probably be behind a feature gate to avoid 
> adding the dependency for everyone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)