[ 
https://issues.apache.org/jira/browse/ARROW-6687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andy Grove updated ARROW-6687:
------------------------------
    Reporter: Adam Lippai  (was: Andy Grove)

> [Rust] [DataFusion] Query returns incorrect row count
> -----------------------------------------------------
>
>                 Key: ARROW-6687
>                 URL: https://issues.apache.org/jira/browse/ARROW-6687
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Rust, Rust - DataFusion
>    Affects Versions: 0.15.0
>            Reporter: Adam Lippai
>            Priority: Major
>
> I received this bug report directly via email:
>  
> Hi,
>  
> I've just tried out the master branch of the arrow lib, the SQL interface for 
> parquet file generated by pyarrow 0.14.1 and pandas 0.25.1
>  
> It returns incorrect num_rows for my file (with ~3000columns x 2456rows), 
> it's actually the batch size number 1024*1024 instead of the 2456 rows. The 
> query is simple SELECT col FROM data and it's the sample code you've created 
> and works for the test file in the arrow testing repo.
>  
> Sorry for reporting the issue via mail, it was faster & easier this way. 
>  
> I'm super happy and grateful that you decided to add parquet support. This is 
> an awesome project, keep up the good work!
>  
> Best regards,
> Adam Lippai



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to