nealrichardson commented on pull request #11405:
URL: https://github.com/apache/arrow/pull/11405#issuecomment-942736149


   > Looks good to me. I don't think it covers pathological cases like 
`airquality %>% arrange(Temp) %>% head(10) %>% arrange(Wind)` or `airquality 
%>% head(10) %>% tail(5)` but I'm pretty sure that's a given for our current 
dplyr implementation anyways.
   
   Those do work, though internally they can't be done in a single ExecPlan:
   
   ```
   Table$create(airquality) %>% 
     arrange(Temp) %>% 
     head(10) %>% 
     arrange(Wind) %>%
     collect()
     
      Ozone Solar.R Wind Temp Month Day
   1     NA      NA  8.0   57     5  27
   2      1       8  9.7   59     5  21
   3      4      25  9.7   61     5  23
   4     32      92 12.0   61     5  24
   5     18      65 13.2   58     5  15
   6     19      99 13.8   59     5   8
   7     NA      NA 14.3   56     5   5
   8     NA     266 14.9   58     5  26
   9     NA      66 16.6   57     5  25
   10     6      78 18.4   57     5  18
   
   
   Table$create(airquality) %>% 
     head(10) %>% 
     tail(5) %>%
     collect()
   
     Ozone Solar.R Wind Temp Month Day
   1    28      NA 14.9   66     5   6
   2    23     299  8.6   65     5   7
   3    19      99 13.8   59     5   8
   4     8      19 20.1   61     5   9
   5    NA     194  8.6   69     5  10
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to