j-esse opened a new pull request #23556: [SPARK-26626] Maximum size for 
repeatedly substituted aliases in SQL expressions
URL: https://github.com/apache/spark/pull/23556
 
 
   ## What changes were proposed in this pull request?
   
   This adds a `spark.sql.maxRepeatedAliasSize` config option, which specifies 
the maximum size of an aliased expression to be substituted (in CollapseProject 
and PhysicalOperation).  This prevents large aliased expressions from being 
substituted multiple times and exploding the size of the expression tree, 
eventually OOMing the driver.
   
   The default config value of 100 was chosen through testing to find the 
optimally performant value:
   
   
![image](https://user-images.githubusercontent.com/17480705/51204201-dd285300-18b7-11e9-8781-dd698df00389.png)
   
   ## How was this patch tested?
   
   Added unit tests, and did manual testing
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to