[ 
https://issues.apache.org/jira/browse/ARROW-11324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17332349#comment-17332349
 ] 

Andrew Lamb commented on ARROW-11324:
-------------------------------------

Migrated to github: https://github.com/apache/arrow-datafusion/issues/153

> [Rust] Querying datetime data in DataFusion with an embedded timezone always 
> fails
> ----------------------------------------------------------------------------------
>
>                 Key: ARROW-11324
>                 URL: https://issues.apache.org/jira/browse/ARROW-11324
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Rust - DataFusion
>            Reporter: Max Burke
>            Priority: Blocker
>         Attachments: 0100c909-2537-c4dc-ce1d-1b7a75d613e8.parquet
>
>
> We have a number (~ hundreds of thousands) of Parquet files that have 
> embedded Arrow schemas in them that have time-valued columns with the type 
> DateTime(TimeUnit::Nanosecond, Some("UTC")).
>  
> One of the changes in the Arrow 2 -> 3 working window was to make the Parquet 
> loader prefer the Arrow schema compared to the one generated from the 
> columns. 
>  
> But because DataFusion has the timezone field of the DateTime variant 
> hardcoded as None, we can't load any of our data after this upgrade; we get 
> errors like:
> {{SELECT * FROM parquet_table WHERE ("timestamp" >= 
> to_timestamp('2010-03-24T13:00:00.000000Z') AND "timestamp" <= 
> to_timestamp('2010-03-25T00:00:00.000000Z')) ORDER BY timestamp ASC NULLS 
> LAST;}}
> {{Plan("\'Timestamp(Nanosecond, Some(\"UTC\")) >= Timestamp(Nanosecond, 
> None)\' can\'t be evaluated because there isn\'t a common type to coerce the 
> types to")}}
>  
> Any ideas/thoughts? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to