Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-27 Thread Jialun Liu
upposed to be used > by > > > > > > analysts running offline queries, and it is not designed to be > used > > > as > > > > an > > > > > > OLTP database. > > > > > > https://prestodb.io/docs/current/overview/use-cases.html > >

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-26 Thread Vinoth Chandar
hnically possible to use data lake to > > > support > > > > > milliseconds latency, high throughput random reads at all today? > Am I > > > > just > > > > > not thinking in the right direction? Maybe it is just not sane to > >

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-26 Thread Jialun Liu
just not sane to > serve > > > > online request-response service using Data lake as backend? > > > > > > > > Best regards, > > > > Bill > > > > > > > > On Sat, Jun 5, 2021 at 1:33 PM Kizhakkel Jose, Felix &g

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-23 Thread Vinoth Chandar
> > wrote: > > > > > > > Hi Bill, > > > > > > > > Did you try using Presto (from EMR) to query HUDI tables on S3, and > it > > > > could support real time queries. And you have to partition your data > > > > properly to m

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-07 Thread Jialun Liu
gt; Subject: Could Hudi Data lake support low latency, high throughput > random > > > reads? > > > Caution: This e-mail originated from outside of Philips, be careful for > > > phishing. > > > > > > > > > Hey guys, > > > > > > I

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-06 Thread Gary Li
data > > properly to minimize the amount of data each query has to scan/process. > > > > Regards, > > Felix K Jose > > From: Jialun Liu > > Date: Saturday, June 5, 2021 at 3:53 PM > > To: dev@hudi.apache.org > > Subject: Could Hudi Data lake su

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-05 Thread Jialun Liu
, > Felix K Jose > From: Jialun Liu > Date: Saturday, June 5, 2021 at 3:53 PM > To: dev@hudi.apache.org > Subject: Could Hudi Data lake support low latency, high throughput random > reads? > Caution: This e-mail originated from outside of Philips, be careful for > phishing.

Re: Could Hudi Data lake support low latency, high throughput random reads?

2021-06-05 Thread Kizhakkel Jose, Felix
:53 PM To: dev@hudi.apache.org Subject: Could Hudi Data lake support low latency, high throughput random reads? Caution: This e-mail originated from outside of Philips, be careful for phishing. Hey guys, I am not sure if this is the right forum for this question, if you know where this should

Could Hudi Data lake support low latency, high throughput random reads?

2021-06-05 Thread Jialun Liu
Hey guys, I am not sure if this is the right forum for this question, if you know where this should be directed, appreciated for your help! The question is that "Could Hudi Data lake support low latency, high throughput random reads?". I am considering building a data lake tha