amoeba opened a new issue, #40986: URL: https://github.com/apache/arrow/issues/40986
### Describe the enhancement requested A user who packages Arrow for nixpkgs reached out because they had built the arrow R package for aarch64, saw `arrow_info()` report: ``` ... Runtime: SIMD Level none Detected SIMD Level none ... ``` They thought the above implied the package wasn't compiled with ASIMD/NEON support (which isn't true). Part of the confusion here is that this section is for runtime dispatch and doesn't reflect the compiler flags used. I originally filed https://github.com/apache/arrow/issues/40806 but this will get reverted since the approach affected the runtime info which was wrong. PyArrow and arrow R both provide users and developers with helpers for printing info about the package. The current output of each on an aarch64 machine is, <details> <summary>PyArrow</summary> ``` In [1]: pa.show_info() pyarrow version info -------------------- Package kind : python-wheel-macos Arrow C++ library version : 14.0.2 Arrow C++ compiler : AppleClang 15.0.0.15000040 Arrow C++ compiler flags : -Qunused-arguments -fcolor-diagnostics Arrow C++ git revision : 740889f413af9b1ae1d81eb1e5a4a9fb4ce9cf97 Arrow C++ git description : Arrow C++ build type : release Platform: OS / Arch : Darwin arm64 SIMD Level : none Detected SIMD Level : none Memory: Default backend : mimalloc Bytes allocated : 0 bytes Max memory : 0 bytes Supported Backends : mimalloc, system Optional modules: csv : Enabled cuda : - dataset : Enabled feather : Enabled flight : Enabled fs : Enabled gandiva : - json : Enabled orc : Enabled parquet : Enabled Filesystems: GcsFileSystem : Enabled HadoopFileSystem : Enabled S3FileSystem : Enabled Compression Codecs: brotli : Enabled bz2 : Enabled gzip : Enabled lz4_frame : Enabled lz4 : Enabled snappy : Enabled zstd : Enabled ``` </details> <details> <summary>R</summary> ``` > arrow_info() Arrow package version: 15.0.2 Capabilities: acero TRUE dataset TRUE substrait FALSE parquet TRUE json TRUE s3 TRUE gcs FALSE utf8proc TRUE re2 TRUE snappy TRUE gzip TRUE brotli TRUE zstd TRUE lz4 TRUE lz4_frame TRUE lzo FALSE bz2 TRUE jemalloc TRUE mimalloc FALSE Memory: Allocator jemalloc Current 0 bytes Max 0 bytes Runtime: SIMD Level none Detected SIMD Level none Build: C++ Library Version 15.0.2 C++ Compiler AppleClang C++ Compiler Version 15.0.0.15000309 ``` </details> Would it be reasonable and possible to expose build-time SIMD info in this output, perhaps under the "Build" section? This request might be a bit niche so I'm curious to hear if others think this would be generally useful. ### Component(s) C++, Python, R -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
