Github user kiszk commented on the issue:

    https://github.com/apache/spark/pull/19518
  
    Thank you for creating a PR for the latest Spark.
    
    I think that it is great to reduce # of constant pool entries. I have one 
high level comment.  
    IIUC, this PR **always** perform mutable state compaction. In other words, 
mutable states are in arrays.  
    I am afraid about possible performance degradation due to increasing access 
cost by putting states in arrays.
    
    What do you think about putting mutable states into arrays (i.e. performing 
mutable state compaction) only when there are many mutable states or only for 
certain mutable states that are rarely accessed?  
    Or, can we say there is no performance degradation due to mutable state 
compaction?
    
    What do you think?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to