https://cwiki.apache.org/confluence/display/HUDI/20200915+Weekly+Sync+Minutes
Thanks
Vinoth
Hmm but our use case has multiple schemas one for each dataset as each
dataset is unique in our case and hence the need to validate the schema for
each dataset while writing.
On Tue, 15 Sep 2020 at 2:53 AM, Vinoth Chandar wrote:
> Hi,
>
>
>
> Typically writing people use a single schema, thats
The current time suits well for me personally as well. But I'm fine with
8-9 pm if that helps accommodate other folks.
Thanks,
Nishith
On Tue, Sep 15, 2020 at 7:29 AM Bhavani Sudha
wrote:
> The current time suited well for me personally.
> Moving that to 1 hour earlier should be okay mostly. I
The current time suited well for me personally.
Moving that to 1 hour earlier should be okay mostly. I might be little late
depending on kid care duties some days. We can go ahead with the change if
timing is fine with everyone.
Thanks,
Sudha
On Tue, Sep 15, 2020 at 7:08 AM Vinoth Chandar
Folks,
Please chime in with your opinions. I still can see some regulars (e.g
Nishith, Sudha, Gary) who have not chimed in
On Tue, Sep 15, 2020 at 12:22 AM Pratyaksh Sharma
wrote:
> Hi,
>
> Just wanted to confirm the time for this week's sync up. @Vinoth Chandar
>
>
> On Thu, Sep 10, 2020 at
> So, you are trying to avoid reading the again from an incremental query?
If
so, I don't know how we can achieve this in Hudi.
Let's say we
a) read the 20 mins of data from Kafka or DFS, into a Spark Dataframe,
b) issue an upsert into a hudi table (at this point the dataframe is lost)
if you
Hi,
Just wanted to confirm the time for this week's sync up. @Vinoth Chandar
On Thu, Sep 10, 2020 at 1:58 AM Pratyaksh Sharma
wrote:
> Great. I request others to also please chime in so that we can finalise
> the time for sync up.
>
> On Wed, Sep 9, 2020 at 9:00 AM Balaji Varadarajan
>