GitHub user sameeragarwal opened a pull request:

    https://github.com/apache/spark/pull/13209

    [SPARK-15425][SQL] Disallow cartesian joins by default

    ## What changes were proposed in this pull request?
    
    In order to prevent users from inadvertently writing queries with cartesian 
joins, this patch introduces a new conf `spark.sql.join.cartesian.enabled` (set 
to `false` by default) that if not set, results in an `AnalysisException` if 
the query contains one or more cartesian products.
    
    ## How was this patch tested?
    
    Added a test to verify the new behavior in `JoinSuite`. Additionally, 
`SQLQuerySuite` and `SQLMetricsSuite` were modified to explicitly enable 
cartesian products.
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sameeragarwal/spark disallow-cartesian

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/13209.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #13209
    
----
commit 2c4d6e29e1e7921b23a0ca45b8a9882a6a4c53a3
Author: Sameer Agarwal <sam...@databricks.com>
Date:   2016-05-20T02:04:04Z

    Disallow cartesian joins by default

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to