Lucas Mation created ARROW-18242:
------------------------------------

             Summary: [R] arrow implementation of lubridate::dmy parses invalid 
date "00001976" as date
                 Key: ARROW-18242
                 URL: https://issues.apache.org/jira/browse/ARROW-18242
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Lucas Mation


Sorry for so many issues, but I think this is another bug.

Wrong behavior of the arrow implementation of the  `lubridate::dmy`.

An invalid date such as '00001976' is being parsed as a valid (and completely 
unrelated) date.

#in R
'00001976' %>% dmy
[1] NA
Warning message:
  All formats failed to parse. No formats found. 

#In arrow
q <- data.table(x=c('00001976','30111976','01011976'))
q %>% write_dataset(paste0(p2,'/q'))
q2 <- paste0(p2,'/q') %>% open_dataset %>% mutate(x2=dmy(x)) %>% collect
q2
x
1: 1975-11-30
2: 1976-11-30
3: 1976-01-01
#notice '00001976' is an invalid date. First row of x2 should be NA!!!

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to