tdhock commented on PR #36304:
URL: https://github.com/apache/arrow/pull/36304#issuecomment-1608564686

   hi @paleolimbot thanks writing a PR that partially addresses the issue I 
created. 
   To fully address that issue, can you please add documentation to (1) clarify 
the difference between IO threads and CPU threads, and (2) to explain which 
function should be called to control the CSV reading operation?
   For (1) a link to the C++ doc web page 
https://arrow.apache.org/docs/cpp/threading.html would be very helpful.
   Could a link to that page be added on the R man pages for arrow::cpu_count 
and arrow::io_thread_count?
   For (2) I would have expected some mention of how to control number of 
threads used for CSV reading on the man page for read_csv_arrow, but there is 
no mention of threads on that man page. Something like "use 
arrow::set_cpu_count(N_CPUS) to tell arrow to use N_CPUS for reading the CSV 
file" on that man page would be useful.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to