Dear Yeikel
I checked my code and it uses getOrCreate to create a SparkSession.
Therefore, I should be retrieving the same SparkSession instance everytime I
call that method.
Thanks for your reminding.
Best regard
--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
Dear all
I already had a python function which is used to query data from HBase and
HDFS with given parameters. This function returns a pyspark dataframe and
the SparkContext it used.
With client's increasing demands, I need to merge data from multiple query.
I tested using "union" function to