Re: [DISCUSS] Hudi is the data lake platform

2021-07-21 Thread Vinoth Chandar
Expanding to users@ as well. Hi all, Since this discussion, I started to pen down a coherent strategy and convey these ideas via a blog post. I have also done my own research, talked to (ex)colleagues I respect to get their take and refine it. Here's a blog that hopefully explains this vision.

Re: [DISCUSS] Create Spark and Flink utilities module

2021-07-21 Thread Vinoth Chandar
Hi Vinay, I am not sure why we are bundling parquet with Flink. If so, we could try and resolve that? That's the route we can first take IMO. Our bundles don't bundle, spark, flink, hadoop, parquet. So I think a single bundle is doable. Happy to help with specific issues as they come up. On Tue

Re: [DISCUSS] Hudi is the data lake platform

2021-07-21 Thread vino yang
Thanks vc Very good blog, in-depth and forward-looking. Learned! Best, Vino Vinoth Chandar 于2021年7月22日周四 上午3:58写道: > Expanding to users@ as well. > > Hi all, > > Since this discussion, I started to pen down a coherent strategy and convey > these ideas via a blog post. > I have also done my own