XiangpengHao commented on issue #7350:
URL: https://github.com/apache/arrow-rs/issues/7350#issuecomment-2766461872

   > we add new new ByteView to support 8bytes prefix
   
   I think Arrow spec says we need to do 4 bytes prefix: 
https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout
   
   As you have pointed out, StringViewArray is not always better than 
StringArray, especially when the prefixes are the same.
   
   But I do believe there are micro-architecture level optimizations we can do 
to improve performance, like better compiler hint, prefetching, gc tuning etc.
   
   Another direction is probably to rewrite the FilterExec/CoalesenceExec to 
emit StringArray rather than StringViewArray, the idea is to use StringView in 
lower levels of the plan and use String in higher levels of the plan 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to