Hi I wonder the differences accessing HIVE tables in two different ways: - with jdbc access - with sparkContext
I would say that jdbc is better since it uses HIVE that is based on map-reduce / TEZ and then works on disk. Using spark rdd can lead to memory errors on very huge datasets. Anybody knows or can point me to relevant documentation ? --------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org