Re: Hive From Spark: Jdbc VS sparkContext

郭鹏飞 Tue, 10 Oct 2017 03:23:47 -0700

> 在 2017年10月4日，上午2:08，Nicolas Paris <nipari...@gmail.com> 写道：
> 
> Hi
> 
> I wonder the differences accessing HIVE tables in two different ways:
> - with jdbc access
> - with sparkContext
> 
> I would say that jdbc is better since it uses HIVE that is based on
> map-reduce / TEZ and then works on disk. 
> Using spark rdd can lead to memory errors on very huge datasets.
> 
> 
> Anybody knows or can point me to relevant documentation ?
> 
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscr...@spark.apache.org



The jdbc will load data into the driver node, this may slow down the speed,and 
may OOM.


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Re: Hive From Spark: Jdbc VS sparkContext

Reply via email to