PySpark row_number Question

infa elance Fri, 14 Apr 2017 13:28:05 -0700

Hi All,
I trying to understand how row_number is applied In the below code, does spark 
store data in a dataframe and then perform row_number function or does it apply 
while reading from hive ?


from pyspark.sql import HiveContext
hiveContext = HiveContext(sc)
hiveContext.sql("
( SELECT colunm1 ,column2,column3, ROW_NUMBER() OVER (ORDER BY columnname) AS 
RowNum FROM tablename )

Appreciate any guidance.
---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

PySpark row_number Question

Reply via email to