Hi all, I've just viewed some Zeppenlin's videos. The intergration between Zeppenlin and Spark is really amazing and i want to use it for my application.
In my app, i will have a Spark streaming app to do some basic realtime aggregation ( intermediate data). Then i want to use Zeppenlin to do some realtime analytics on the intermediate data. My question is what's the most efficient storage engine to store realtime intermediate data? Is parquet file somewhere is suitable?