alamb commented on issue #16903:
URL: https://github.com/apache/datafusion/issues/16903#issuecomment-3114729829

   I agree we should have a better plan about this
   
   Another thing that I would like to consider is allowing users to pick which  
of the several string representations to generate code for
   
   Right now, many of our functions have implementations for `StringArray`, 
`LargeStringArray`, `StringViewArray`. I suspect some will end up with special 
implementations for `DictionaryArray` and `REEArray` as well
   
   This means that we have 3x the code (and compile time, etc). This leads to 
large wasm sizes, among other problems
   - #16554 
   
   The same thing applies to group by values, sorting, etc that have 
specialized versions for String and StringView
   
   I wonder if we could/should add config flags for each of the string types 🤔 
   
   `features=utf8`, `features=utf8view`, etc 
   
   However, that sounds like a large testing exercise so maybe it isn't worth 
the complexity 
   
   I would love to hear other ideas


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to