Dima Zhiyanov created SPARK-5919: ------------------------------------ Summary: Enable broadcast joins for Parquet files Key: SPARK-5919 URL: https://issues.apache.org/jira/browse/SPARK-5919 Project: Spark Issue Type: Improvement Components: DataFrame Affects Versions: 1.2.1 Reporter: Dima Zhiyanov
Unable to perform broadcast join of Schema RDDs created from Parquet files. Computing statistics is only available for real Hive tables, and it is not always convenient to create a Hive table for every Parquet file The issue is discussed here http://apache-spark-user-list.1001560.n3.nabble.com/How-to-do-broadcast-join-in-SparkSQL-td15298.html -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org