Hi,

I am looking for approaches to compare a row with the next one to
determine, e.g., differences in event-times/timestamps. I just found a
couple of solutions that use Window class, but that does not seem to
work on streaming data, such as
https://stackoverflow.com/questions/38229659/pyspark-compare-two-rows-in-dataframe
https://stackoverflow.com/questions/32148208/how-do-i-compare-multiple-rows-of-a-table-using-spark-sql-data-frames-udf

I got: "org.apache.spark.sql.AnalysisException: Non-time-based windows
are not supported on streaming DataFrames/Datasets;;..."

It would be great if anybody could give me a hint how to determine row
differences in python on streams comming from kafka.

Thanks and best regards,
on

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to