[GitHub] spark issue #23171: [SPARK-26205][SQL] Optimize In for bytes, shorts, ints

mgaido91 Thu, 29 Nov 2018 12:01:06 -0800

Github user mgaido91 commented on the issue:

    https://github.com/apache/spark/pull/23171
  
    @dbtsai I see, it would be great, though, to check which is this threshold. 
My understanding is that the current solution has better performance even for 
several hundreds of items. If this number is some thousands and since this 
depends on the datatype (so it is hard to control by the users with a single 
config), it is arguable which is the best solution: I don't think it is very 
common to have thousands of elements, while for lower numbers (more common) we 
would use the less efficient solution.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #23171: [SPARK-26205][SQL] Optimize In for bytes, shorts, ints

Reply via email to