[GitHub] [arrow-datafusion] alamb opened a new issue, #6653: `date_trunc` always returns `Timestamp(Nanosecond, None)` which might truncate ranges

via GitHub Mon, 12 Jun 2023 11:12:14 -0700


alamb opened a new issue, #6653:
URL: https://github.com/apache/arrow-datafusion/issues/6653


   ### Is your feature request related to a problem or challenge?
   
   Prior to https://github.com/apache/arrow-datafusion/pull/6632 and 
https://github.com/apache/arrow-datafusion/pull/6638 the `date_trunc` function 
was inconsistent between reported types and actual types for arrays/scalars
   
   Now it is consistent in the sense that
   
   ```
   date_trunc(Timestamp(Second, ..)) --> Timestamp(Nanosecond, None)
   date_trunc(Timestamp(Millisecond, ..)) --> Timestamp(Nanosecond, None)
   date_trunc(Timestamp(Microsecond ..)) --> Timestamp(Nanosecond, None)
   date_trunc(Timestamp(Nanosecond ..)) --> Timestamp(Nanosecond, None)
   ```
   
   However, by using `Timestamp(Nanosecond, None)` everywhere will truncate the 
range over which `date_trunc` will work -- specifically if there is some value 
that can be represented using `Timestamp(Second)` that doesn't fit into a 
`Timestamp(Nanosecond)`  such as a value before `1677-09-21 00:12:43.145224193` 
or after  `2262-04-11 23:47:16.854775807` date_trunc will error.
   
   
   
   ### Describe the solution you'd like
   
   We could potentially change the function to keep the same `TimeUnit` of its 
input:
   
   Namely
   
   ```
   date_trunc(Timestamp(Second, ..)) --> Timestamp(Second, None)
   date_trunc(Timestamp(Millisecond, ..)) --> Timestamp(Millisecond, None)
   date_trunc(Timestamp(Microsecond ..)) --> Timestamp(Microsecond, None)
   date_trunc(Timestamp(Nanosecond ..)) --> Timestamp(Nanosecond, None)
   ```
   
   
   ### Describe alternatives you've considered
   
   We could also just leave the current behavior 
   
   ### Additional context
   
   See comments 
https://github.com/apache/arrow-datafusion/pull/6632#issuecomment-1587335750


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] alamb opened a new issue, #6653: `date_trunc` always returns `Timestamp(Nanosecond, None)` which might truncate ranges

Reply via email to