Re: Current state of dataset api

2021-10-05 Thread Koert Kuipers
I've forgotten. Ultimately we chose the > Dataframe API as the sweet spot. > > Does anyone know of a good overview of the current state of the > Dataset API, pros/cons as of Spark 3? > > Is it fully usable, do you get the advantages of a strongly typed > dataframe? Any kn

Current state of dataset api

2021-10-04 Thread Magnus Nilsson
Hi, I tried using the (typed) Dataset API about three years ago. Then there were limitations with predicate pushdown, overhead serialization and maybe more things I've forgotten. Ultimately we chose the Dataframe API as the sweet spot. Does anyone know of a good overview of the current state