sherman commented on issue #599: URL: https://github.com/apache/arrow-java/issues/599#issuecomment-2647807829
> In this case I think you would be better served by the new [string view vector](https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout), which is unfortunately not fully implemented in Java (contributions are welcome). In this case the view buffer could be easily preallocated and memset up front (setting it all to 0 is perfectly fine) and then only slots with string data need to be touched. > > (BTW, I am assuming the data is a bit sparse? Otherwise I think fillHoles should not be doing much work. It's just trying to ensure that if you set slot `N` that all the slots before `N` are valid.) Yeah, I’ve seen and tried this implementation in data fusion (string views on top of data pages), and it outperforms Java quite well. That was my second question (is there string view impl. in java) ;-) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
