Re: Big Data Meetup @ LinkedIn tomorrow at 5:30pm

2019-03-26 Thread Shraddha Sahay
Yes, the event will be recorded and we will be posting the link on our website 
- http://pinot.apache.org/

Cheers,
Shraddha

On Mar 26, 2019, at 9:40 PM, Seunghyun Lee 
mailto:sn...@apache.org>> wrote:

I think that it will probably be recorded. Let me double check on this
tomorrow and will get back to you.

On Tue, Mar 26, 2019 at 9:16 PM Sai Boorlagadda 
mailto:sai.boorlaga...@gmail.com>>
wrote:

Is this streamed or recorded?

On Tue, Mar 26, 2019 at 9:08 PM Seunghyun Lee 
mailto:sn...@apache.org>> wrote:

Hi all,

I just found that we have advertised on the website but not in the
mailing
list about the meetup happening tomorrow.

LinkedIn is hosting a Big Data Meet Up(focused on Pinot and Presto) on
Mar
27 from 5:30PM - 9:00 PM. Come join us and learn about the different OLAP
systems that are powering companies like LinkedIn, Facebook, Uber and
Slack.

Please RSVP using the link below:

https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsuperchargedolap.splashthat.comdata=02%7C01%7Cssahay%40linkedin.com%7Ca4c7d205044141cd30b008d6b26e48c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C636892584068692694sdata=QbIFuNpP4PnYvYgYCI%2Bpg3Yb3mioh4A1Do404WZV84A%3Dreserved=0

Best,
Seunghyun




Re: Big Data Meetup @ LinkedIn tomorrow at 5:30pm

2019-03-26 Thread Seunghyun Lee
I think that it will probably be recorded. Let me double check on this
tomorrow and will get back to you.

On Tue, Mar 26, 2019 at 9:16 PM Sai Boorlagadda 
wrote:

> Is this streamed or recorded?
>
> On Tue, Mar 26, 2019 at 9:08 PM Seunghyun Lee  wrote:
>
> > Hi all,
> >
> > I just found that we have advertised on the website but not in the
> mailing
> > list about the meetup happening tomorrow.
> >
> > LinkedIn is hosting a Big Data Meet Up(focused on Pinot and Presto) on
> Mar
> > 27 from 5:30PM - 9:00 PM. Come join us and learn about the different OLAP
> > systems that are powering companies like LinkedIn, Facebook, Uber and
> > Slack.
> >
> > Please RSVP using the link below:
> >
> > https://superchargedolap.splashthat.com
> >
> > Best,
> > Seunghyun
> >
>


Re: Apache Pinot Dev,uSer, Committer mailing lists

2019-03-26 Thread Seunghyun Lee
Thank you for your interest in contributing to Pinot!

I think that you can start to read documentation and get familiarized with
the code base. https://pinot.readthedocs.io/en/latest/dev_guide.html

Also, please join to slack channel using the below link.
https://join.slack.com/t/apache-pinot/shared_invite/enQtNDY4NDczOTYyNjk1LTExODVjY2QxYzBkMzJjNTk0ZGQ3NThiYTU2YzdlNjE0MWI5ZjUwYjI2ZTgxNjNiYWJiNmEzYjkxMTIzMzUxNTQ

In terms of the roadmap, some major features that we are currently working
are:
1. segment merge service
2. HCON based configs
3. distributed controllers

I think that other people can add more information about the roadmap. If
you have some ideas on new features, feel free to suggest and we can
discuss.

Also, if you happen to live in bay area, please join to the meetup
tomorrow. Most of contributors will be there :)
https://superchargedolap.splashthat.com

Best,
Seunghyun


On Tue, Mar 26, 2019 at 8:50 PM Sai Boorlagadda 
wrote:

> I am also interested to get involved and contribute to the community.
> Do we have a roadmap of things that we want to do?
>
> On Tue, Mar 26, 2019 at 5:00 PM Seunghyun Lee  wrote:
>
> > Hi Sujay,
> >
> > Welcome to Pinot community. Please let us know if you have any question
> or
> > you want to discuss on Pinot
> >
> > Best,
> > Seunghyun
> >
> > On Sun, Mar 24, 2019 at 4:31 AM sujay hegde 
> > wrote:
> >
> > > Hi Apache Pinot,
> > >
> > > I am interested in Apache Pinot Dev, user and Committer communities.
> > >
> > >
> > > Thanks and Regards,
> > > Sujay
> > >
> >
>


Re: Big Data Meetup @ LinkedIn tomorrow at 5:30pm

2019-03-26 Thread Sai Boorlagadda
Is this streamed or recorded?

On Tue, Mar 26, 2019 at 9:08 PM Seunghyun Lee  wrote:

> Hi all,
>
> I just found that we have advertised on the website but not in the mailing
> list about the meetup happening tomorrow.
>
> LinkedIn is hosting a Big Data Meet Up(focused on Pinot and Presto) on Mar
> 27 from 5:30PM - 9:00 PM. Come join us and learn about the different OLAP
> systems that are powering companies like LinkedIn, Facebook, Uber and
> Slack.
>
> Please RSVP using the link below:
>
> https://superchargedolap.splashthat.com
>
> Best,
> Seunghyun
>


Big Data Meetup @ LinkedIn tomorrow at 5:30pm

2019-03-26 Thread Seunghyun Lee
Hi all,

I just found that we have advertised on the website but not in the mailing
list about the meetup happening tomorrow.

LinkedIn is hosting a Big Data Meet Up(focused on Pinot and Presto) on Mar
27 from 5:30PM - 9:00 PM. Come join us and learn about the different OLAP
systems that are powering companies like LinkedIn, Facebook, Uber and Slack.

Please RSVP using the link below:

https://superchargedolap.splashthat.com

Best,
Seunghyun


Re: Apache Pinot Dev,uSer, Committer mailing lists

2019-03-26 Thread Sai Boorlagadda
I am also interested to get involved and contribute to the community.
Do we have a roadmap of things that we want to do?

On Tue, Mar 26, 2019 at 5:00 PM Seunghyun Lee  wrote:

> Hi Sujay,
>
> Welcome to Pinot community. Please let us know if you have any question or
> you want to discuss on Pinot
>
> Best,
> Seunghyun
>
> On Sun, Mar 24, 2019 at 4:31 AM sujay hegde 
> wrote:
>
> > Hi Apache Pinot,
> >
> > I am interested in Apache Pinot Dev, user and Committer communities.
> >
> >
> > Thanks and Regards,
> > Sujay
> >
>


Re: Pinot VS Kafka Streams (or Spark SQL)

2019-03-26 Thread Sai Boorlagadda
Thanks, Seunghyun for confirming my understanding.

On Tue, Mar 26, 2019 at 5:06 PM Seunghyun Lee  wrote:

> Hi Sai,
>
> Thank you for the mail. Your understanding is correct.
>
> Systems that you mentioned (Pinot, Kafka, Spark SQL) are built for
> different purposes and they are all critical components when you need to
> build the data analytics data pipeline.
>
> Kafka is a pub-sub message delivering system that is usually used to build
> the streaming data pipeline. Spark SQL is built to add SQL interface layer
> on top of Spark. You can think it as one of offline OLAP query engines like
> Hive or Presto that are used for computing complex ad-hoc queries over
> large data.
>
> Pinot aims to support "interactive" analytics use cases (e.g. dashboard,
> site facing reporting - who's viewed my profile on LInkedIn) where latency
> requirements are more strict than offline reporting use cases.
>
> You mentioned that "I can write a consumer that processes events from a
> stream and computes the needed analytic/metric and store it into a data
> store to serve". This is basically what Pinot does (consuming data from a
> stream, index it for serving, allow query interface)
>
> Below links are some references that we have published to public. Please
> let us know if you have any other question.
>
> Best,
> Seunghyun
>
> cc.
>
> LinkedIn Blog Posts
>
> https://engineering.linkedin.com/analytics/real-time-analytics-massive-scale-pinot
> https://engineering.linkedin.com/blog/2019/03/pinot-joins-apache-incubator
>
> Slides
> https://www.slideshare.net/jeanfrancoisim/intro-to-pinot-20160104
>
> https://www.slideshare.net/seunghyunlee1460/pinot-realtime-olap-for-530-million-users-sigmod-2018-107394584
>
> On Tue, Mar 26, 2019 at 3:46 PM Sai Boorlagadda  >
> wrote:
>
> > Hello Devs,
> >
> > Is there any content or blog post to understand the difference between
> > Pinot vs Kafka Streams (or Spark SQL)? At a high level after reading
> Pinot
> > is actually bringing off-the-shelf components which otherwise data
> > engineers have to write on top of KAFKA streams?
> >
> > I mean I can write a consumer that processes events from a stream and
> > computes the needed analytic/metric and store it into a data store to
> > serve. Though my application layer has to route the requests either to
> the
> > streaming analytics datastore or batch analytics data store.
> >
> > So my understanding is Pinot abstracts these two things from the
> > application. Is that correct?
> > Sai
> >
>


Re: Pinot VS Kafka Streams (or Spark SQL)

2019-03-26 Thread Seunghyun Lee
Hi Sai,

Thank you for the mail. Your understanding is correct.

Systems that you mentioned (Pinot, Kafka, Spark SQL) are built for
different purposes and they are all critical components when you need to
build the data analytics data pipeline.

Kafka is a pub-sub message delivering system that is usually used to build
the streaming data pipeline. Spark SQL is built to add SQL interface layer
on top of Spark. You can think it as one of offline OLAP query engines like
Hive or Presto that are used for computing complex ad-hoc queries over
large data.

Pinot aims to support "interactive" analytics use cases (e.g. dashboard,
site facing reporting - who's viewed my profile on LInkedIn) where latency
requirements are more strict than offline reporting use cases.

You mentioned that "I can write a consumer that processes events from a
stream and computes the needed analytic/metric and store it into a data
store to serve". This is basically what Pinot does (consuming data from a
stream, index it for serving, allow query interface)

Below links are some references that we have published to public. Please
let us know if you have any other question.

Best,
Seunghyun

cc.

LinkedIn Blog Posts
https://engineering.linkedin.com/analytics/real-time-analytics-massive-scale-pinot
https://engineering.linkedin.com/blog/2019/03/pinot-joins-apache-incubator

Slides
https://www.slideshare.net/jeanfrancoisim/intro-to-pinot-20160104
https://www.slideshare.net/seunghyunlee1460/pinot-realtime-olap-for-530-million-users-sigmod-2018-107394584

On Tue, Mar 26, 2019 at 3:46 PM Sai Boorlagadda 
wrote:

> Hello Devs,
>
> Is there any content or blog post to understand the difference between
> Pinot vs Kafka Streams (or Spark SQL)? At a high level after reading Pinot
> is actually bringing off-the-shelf components which otherwise data
> engineers have to write on top of KAFKA streams?
>
> I mean I can write a consumer that processes events from a stream and
> computes the needed analytic/metric and store it into a data store to
> serve. Though my application layer has to route the requests either to the
> streaming analytics datastore or batch analytics data store.
>
> So my understanding is Pinot abstracts these two things from the
> application. Is that correct?
> Sai
>


Re: Apache Pinot Dev,uSer, Committer mailing lists

2019-03-26 Thread Seunghyun Lee
Hi Sujay,

Welcome to Pinot community. Please let us know if you have any question or
you want to discuss on Pinot

Best,
Seunghyun

On Sun, Mar 24, 2019 at 4:31 AM sujay hegde 
wrote:

> Hi Apache Pinot,
>
> I am interested in Apache Pinot Dev, user and Committer communities.
>
>
> Thanks and Regards,
> Sujay
>


Pinot VS Kafka Streams (or Spark SQL)

2019-03-26 Thread Sai Boorlagadda
Hello Devs,

Is there any content or blog post to understand the difference between
Pinot vs Kafka Streams (or Spark SQL)? At a high level after reading Pinot
is actually bringing off-the-shelf components which otherwise data
engineers have to write on top of KAFKA streams?

I mean I can write a consumer that processes events from a stream and
computes the needed analytic/metric and store it into a data store to
serve. Though my application layer has to route the requests either to the
streaming analytics datastore or batch analytics data store.

So my understanding is Pinot abstracts these two things from the
application. Is that correct?
Sai