amoeba opened a new issue, #40986:
URL: https://github.com/apache/arrow/issues/40986

   ### Describe the enhancement requested
   
   A user who packages Arrow for nixpkgs reached out because they had built the 
arrow R package for aarch64, saw `arrow_info()` report:
   
   ```
   ...
   Runtime:
   
   SIMD Level          none
   Detected SIMD Level none
   ...
   ```
   
   They thought the above implied the package wasn't compiled with ASIMD/NEON 
support (which isn't true). Part of the confusion here is that this section is 
for runtime dispatch and doesn't reflect the compiler flags used. I originally 
filed https://github.com/apache/arrow/issues/40806 but this will get reverted 
since the approach affected the runtime info which was wrong.
   
   PyArrow and arrow R both provide users and developers with helpers for 
printing info about the package. The current output of each on an aarch64 
machine is,
   
   <details>
   <summary>PyArrow</summary>
   
   ```
   In [1]: pa.show_info()
   pyarrow version info
   --------------------
   Package kind              : python-wheel-macos
   Arrow C++ library version : 14.0.2
   Arrow C++ compiler        : AppleClang 15.0.0.15000040
   Arrow C++ compiler flags  :  -Qunused-arguments -fcolor-diagnostics
   Arrow C++ git revision    : 740889f413af9b1ae1d81eb1e5a4a9fb4ce9cf97
   Arrow C++ git description :
   Arrow C++ build type      : release
   
   Platform:
     OS / Arch           : Darwin arm64
     SIMD Level          : none
     Detected SIMD Level : none
   
   Memory:
     Default backend     : mimalloc
     Bytes allocated     : 0 bytes
     Max memory          : 0 bytes
     Supported Backends  : mimalloc, system
   
   Optional modules:
     csv                 : Enabled
     cuda                : -
     dataset             : Enabled
     feather             : Enabled
     flight              : Enabled
     fs                  : Enabled
     gandiva             : -
     json                : Enabled
     orc                 : Enabled
     parquet             : Enabled
   
   Filesystems:
     GcsFileSystem       : Enabled
     HadoopFileSystem    : Enabled
     S3FileSystem        : Enabled
   
   Compression Codecs:
     brotli              : Enabled
     bz2                 : Enabled
     gzip                : Enabled
     lz4_frame           : Enabled
     lz4                 : Enabled
     snappy              : Enabled
     zstd                : Enabled
   ```
   
   </details>
   
   <details>
   <summary>R</summary>
   
   ```
   > arrow_info()
   Arrow package version: 15.0.2
   
   Capabilities:
   
   acero      TRUE
   dataset    TRUE
   substrait FALSE
   parquet    TRUE
   json       TRUE
   s3         TRUE
   gcs       FALSE
   utf8proc   TRUE
   re2        TRUE
   snappy     TRUE
   gzip       TRUE
   brotli     TRUE
   zstd       TRUE
   lz4        TRUE
   lz4_frame  TRUE
   lzo       FALSE
   bz2        TRUE
   jemalloc   TRUE
   mimalloc  FALSE
   
   Memory:
   
   Allocator jemalloc
   Current    0 bytes
   Max        0 bytes
   
   Runtime:
   
   SIMD Level          none
   Detected SIMD Level none
   
   Build:
   
   C++ Library Version           15.0.2
   C++ Compiler              AppleClang
   C++ Compiler Version 15.0.0.15000309
   ```
   </details>
   
   Would it be reasonable and possible to expose build-time SIMD info in this 
output, perhaps under the "Build" section? This request might be a bit niche so 
I'm curious to hear if others think this would be generally useful.
   
   ### Component(s)
   
   C++, Python, R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to