Re: DevX, Test infra Rgdn

2020-08-31 Thread Balaji Varadarajan
+1. This would be a great contribution as all developers will benefit from this work.  On Monday, August 31, 2020, 08:07:08 AM PDT, Vinoth Chandar wrote: +1 this is a great way to also ramp on the code base On Sun, Aug 30, 2020 at 8:00 AM Sivabalan wrote: > As Hudi matures as a

Re: Hudi Writer vs Spark Parquet Writer - Sync

2020-08-31 Thread Balaji Varadarajan
Hi Felix,  For read side performance, we are focussed on adding clustering support (https://cwiki.apache.org/confluence/display/HUDI/RFC+-+19+Clustering+data+for+speed+and+query+performance) and consolidated metadata

Re: DevX, Test infra Rgdn

2020-08-31 Thread Vinoth Chandar
+1 this is a great way to also ramp on the code base On Sun, Aug 30, 2020 at 8:00 AM Sivabalan wrote: > As Hudi matures as a project, we need to get our devX and test infra rock > solid. Availability of test utils and base classes for ease of writing more > tests, stable integration tests, ease

回复: [DISCUSS] Introduce incremental processing API in Hudi

2020-08-31 Thread wangxianghu
+1 This will give hudi more capabilities besides data ingestion and writing, and make hudi-based data processing more timely! Best, wangxianghu 发件人: Abhishek Modi 发送时间: 2020年8月31日 15:01 收件人: dev@hudi.apache.org 主题: Re: [DISCUSS] Introduce incremental processing API in Hudi +1 This sounds

Re: [DISCUSS] Introduce incremental processing API in Hudi

2020-08-31 Thread Abhishek Modi
+1 This sounds really interesting! I like that this implicitly gives Hudi the ability to do transformations on ingested data :) On Sun, Aug 30, 2020 at 10:59 PM vino yang wrote: > Hi everyone, > > > For a long time, in the field of big data, people hope that the tools they > use can give