Dima Zhiyanov created SPARK-5919:
------------------------------------

             Summary: Enable broadcast joins for Parquet files
                 Key: SPARK-5919
                 URL: https://issues.apache.org/jira/browse/SPARK-5919
             Project: Spark
          Issue Type: Improvement
          Components: DataFrame
    Affects Versions: 1.2.1
            Reporter: Dima Zhiyanov


Unable to perform broadcast join of Schema RDDs created from Parquet files. 
Computing statistics is only available for real Hive tables, and it is not 
always convenient to create a Hive table for every Parquet file

The issue is discussed here

http://apache-spark-user-list.1001560.n3.nabble.com/How-to-do-broadcast-join-in-SparkSQL-td15298.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to