[ https://issues.apache.org/jira/browse/SPARK-19443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Liang-Chi Hsieh closed SPARK-19443. ----------------------------------- Resolution: Won't Fix > The function to generate constraints takes too long when the query plan grows > continuously > ------------------------------------------------------------------------------------------ > > Key: SPARK-19443 > URL: https://issues.apache.org/jira/browse/SPARK-19443 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 2.1.0 > Reporter: Liang-Chi Hsieh > > This issue is originally reported and discussed at > http://apache-spark-developers-list.1001551.n3.nabble.com/SQL-ML-Pipeline-performance-regression-between-1-6-and-2-x-tc20803.html > When run a ML `Pipeline` with many stages, during the iterative updating to > `Dataset` , it is observed the it takes longer time to finish the fit and > transform as the query plan grows continuously. > Specially, the time spent on preparing optimized plan in current branch > (74294 ms) is much higher than 1.6 (292 ms). Actually, the time is spent > mostly on generating query plan's constraints during few optimization rules. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org