paleolimbot commented on PR #13635: URL: https://github.com/apache/arrow/pull/13635#issuecomment-1187641603
I don't think this works yet, or at least it doesn't reduce the amount of time it takes to read a big CSV. I'm using this as my reprex to test. I don't understand a lot about the content of this PR so feel free to change everything about it! ``` r library(arrow, warn.conflicts = FALSE) #> Some features are not enabled in this build of Arrow. Run `arrow_info()` for more information. tf <- tempfile() readr::write_csv(vctrs::vec_rep(mtcars, 5e5), tf) # try to slow down CSV reading set_cpu_count(1) set_io_thread_count(2) # hit Control-C while this line runs! system.time(read_csv_arrow(tf)) #> user system elapsed #> 2.785 0.337 3.220 ``` <sup>Created on 2022-07-18 by the [reprex package](https://reprex.tidyverse.org) (v2.0.1)</sup> I think the overriding calling handler is being called, or at least I can do stuff there that causes R to crash (but breakpoints and printing don't seem to work here, maybe because of the interrupt?). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
