[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Issue Type: Improvement  (was: Bug)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Reporter: Yesheng Ma  (was: Wenchen Fan)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Yesheng Ma
>Assignee: Yesheng Ma
>Priority: Major
> Fix For: 3.0.0
>
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Priority: Major  (was: Blocker)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Target Version/s:   (was: 3.0.0)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
> Fix For: 3.0.0
>
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Description: The current catalyst optimizer's predicate pushdown is divided 
into two separate rules: PushDownPredicate and PushThroughJoin. This is not 
efficient for optimizing cascading joins such as TPC-DS q64, where a whole 
default batch is re-executed just due to this. We need a more efficient 
approach to pushdown predicate as much as possible in a single pass.  (was: 
Currently there is a hack in `DataFrameWriter`, which passes `SaveMode` to file 
source v2. This should be removed and file source v2 should not accept 
SaveMode.)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Blocker
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-07-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-27815:
-
Fix Version/s: 3.0.0

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
> Fix For: 3.0.0
>
>
> The current catalyst optimizer's predicate pushdown is divided into two 
> separate rules: PushDownPredicate and PushThroughJoin. This is not efficient 
> for optimizing cascading joins such as TPC-DS q64, where a whole default 
> batch is re-executed just due to this. We need a more efficient approach to 
> pushdown predicate as much as possible in a single pass.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-05-23 Thread Wenchen Fan (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenchen Fan updated SPARK-27815:

Priority: Blocker  (was: Major)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Blocker
>
> Currently there is a hack in `DataFrameWriter`, which passes `SaveMode` to 
> file source v2. This should be removed and file source v2 should not accept 
> SaveMode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-05-23 Thread Wenchen Fan (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenchen Fan updated SPARK-27815:

Target Version/s: 3.0.0

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Blocker
>
> Currently there is a hack in `DataFrameWriter`, which passes `SaveMode` to 
> file source v2. This should be removed and file source v2 should not accept 
> SaveMode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-05-23 Thread Wenchen Fan (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenchen Fan updated SPARK-27815:

Description: Currently there is a hack in `DataFrameWriter`, which passes 
`SaveMode` to file source v2. This should be removed and file source v2 should 
not accept SaveMode.

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
>
> Currently there is a hack in `DataFrameWriter`, which passes `SaveMode` to 
> file source v2. This should be removed and file source v2 should not accept 
> SaveMode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2

2019-05-23 Thread Wenchen Fan (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-27815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenchen Fan updated SPARK-27815:

Summary: do not leak SaveMode to file source v2  (was: do not support file 
source v2 in DataFrameWriter)

> do not leak SaveMode to file source v2
> --
>
> Key: SPARK-27815
> URL: https://issues.apache.org/jira/browse/SPARK-27815
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.0.0
>Reporter: Wenchen Fan
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org