adamreeve commented on issue #46930:
URL: https://github.com/apache/arrow/issues/46930#issuecomment-3017651383

   Hi, I just wanted to add that I'm also interested in this and a while ago I 
made a branch that implemented this in order to do some benchmarking, but 
haven't made any further progress: 
https://github.com/apache/arrow/compare/apache-arrow-19.0.0...adamreeve:arrow:s3-crt
   
   Unfortunately it's not super straightforward to add this feature because it 
uses the separate "CRT" S3 library, which has nearly the exact same API as the 
standard S3 library but has different configuration options, and doesn't share 
any base classes to allow easily abstracting over the different libraries. It 
might be possible for Arrow to completely switch to the CRT library, but from 
what I understand this has very different performance characteristics and can 
cause performance regressions in some cases, so we might want to make use of 
the CRT library optional.
   
   In my branch I just copy-pasted most of the S3 file system implementation to 
get something working, but if this is something we want to add to Arrow without 
switching completely to the CRT library, we'd want to come up with a better 
approach.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to