Re: Spark 1.0 & embedded Hive libraries

2014-06-06 Thread Silvio Fiorito
Great, thanks for the info and pointer to the repo! From: Patrick Wendell Sent: ?Friday?, ?June? ?6?, ?2014 ?5?:?11? ?PM To: user@spark.apache.org They are forked and slightly modified for two reasons: (a) Hive embeds a bunch of other depe

Re: Spark 1.0 & embedded Hive libraries

2014-06-06 Thread Patrick Wendell
They are forked and slightly modified for two reasons: (a) Hive embeds a bunch of other dependencies in their published jars such that it makes it really hard for other projects to depend on them. If you look at the hive-exec jar they copy a bunch of other dependencies directly into this jar. We m

Spark 1.0 & embedded Hive libraries

2014-06-06 Thread Silvio Fiorito
Is there a repo somewhere with the code for the Hive dependencies (hive-exec, hive-serde, & hive-metastore) used in SparkSQL? Are they forked with Spark-specific customizations, like Shark, or simply relabeled with a new package name ("org.spark-project.hive")? I couldn't find any repos on Githu