Esa Heikkinen Thu, 28 Dec 2017 04:34:44 -0800
Hi I would want to build pyspark-application, which searches sequential items or events of time series from csv-files.
What are the best data structures for this purpose ? Dataframe of pyspark or pandas, or RDD or SQL or something else ? --- Esa