[jira] [Updated] (ARROW-16777) [R] printing data in Table/RecordBatch print method

2022-06-30 Thread Jonathan Keane (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Keane updated ARROW-16777:
---
Fix Version/s: (was: 9.0.0)

> [R] printing data in Table/RecordBatch print method
> ---
>
> Key: ARROW-16777
> URL: https://issues.apache.org/jira/browse/ARROW-16777
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python, R
>Reporter: Thomas Mock
>Priority: Minor
>
> Related to ARROW-16776 but after a brief discussion with Neal Richardson, he 
> requested that I split the improvement request into separate issues.
> When working with Arrow datasets/tables, I often find myself wanting to 
> interactively print or "see" the results of a query or the first few rows of 
> the data without having to fully collect into memory. 
> It would be ideal to lazily print some data with Table/RecordBatch print 
> methods, however, currently, the print methods return schema without data. 
> IE:
> ``` r
> library(dplyr)
> library(arrow)
> mtcars %>% arrow::write_parquet("mtcars.parquet")
> car_ds <- arrow::open_dataset("mtcars.parquet")
> car_ds
> #> FileSystemDataset with 1 Parquet file
> #> mpg: double
> #> cyl: double
> #> disp: double
> #> hp: double
> #> drat: double
> #> wt: double
> #> qsec: double
> #> vs: double
> #> am: double
> #> gear: double
> #> carb: double
> #> 
> #> See $metadata for additional Schema metadata
> car_ds %>%
>   compute()
> #> Table
> #> 32 rows x 11 columns
> #> $mpg 
> #> $cyl 
> #> $disp 
> #> $hp 
> #> $drat 
> #> $wt 
> #> $qsec 
> #> $vs 
> #> $am 
> #> $gear 
> #> $carb 
> #> 
> #> See $metadata for additional Schema metadata
> ```



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (ARROW-16777) [R] printing data in Table/RecordBatch print method

2022-06-07 Thread Neal Richardson (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Neal Richardson updated ARROW-16777:

Summary: [R] printing data in Table/RecordBatch print method  (was: 
printing data in Table/RecordBatch print method)

> [R] printing data in Table/RecordBatch print method
> ---
>
> Key: ARROW-16777
> URL: https://issues.apache.org/jira/browse/ARROW-16777
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python, R
>Reporter: Thomas Mock
>Priority: Minor
> Fix For: 9.0.0
>
>
> Related to ARROW-16776 but after a brief discussion with Neal Richardson, he 
> requested that I split the improvement request into separate issues.
> When working with Arrow datasets/tables, I often find myself wanting to 
> interactively print or "see" the results of a query or the first few rows of 
> the data without having to fully collect into memory. 
> It would be ideal to lazily print some data with Table/RecordBatch print 
> methods, however, currently, the print methods return schema without data. 
> IE:
> ``` r
> library(dplyr)
> library(arrow)
> mtcars %>% arrow::write_parquet("mtcars.parquet")
> car_ds <- arrow::open_dataset("mtcars.parquet")
> car_ds
> #> FileSystemDataset with 1 Parquet file
> #> mpg: double
> #> cyl: double
> #> disp: double
> #> hp: double
> #> drat: double
> #> wt: double
> #> qsec: double
> #> vs: double
> #> am: double
> #> gear: double
> #> carb: double
> #> 
> #> See $metadata for additional Schema metadata
> car_ds %>%
>   compute()
> #> Table
> #> 32 rows x 11 columns
> #> $mpg 
> #> $cyl 
> #> $disp 
> #> $hp 
> #> $drat 
> #> $wt 
> #> $qsec 
> #> $vs 
> #> $am 
> #> $gear 
> #> $carb 
> #> 
> #> See $metadata for additional Schema metadata
> ```



--
This message was sent by Atlassian Jira
(v8.20.7#820007)