Hello. I guess the time is fixed at 20 minutes now. My talk needs a fair amount of background, so it won't fit easily (depending on whether I can assume people have seen my previous meetup/CoC presentations).
Here is my abstract: Streaming, incremental, finite-memory computations in SQL over unbounded streams SQL is the standard language for expressing computations on collections. Using modern incremental view maintenance techniques, SQL can also be adopted as the standard language for computing on *changes* to collections. In previous presentations we have shown how to automatically convert any SQL program that defines views into an *incremental* program: the inputs of an incremental program are insertions, deletions, and updates to data tables, and the outputs of the incremental program are insertions, deletions, and updates of the maintained views. Whereas SQL queries are stateless systems, the incremental programs are stateful streaming systems that maintain complex *indexes* for performing efficient updates. The indexes enable computing all updates in time proportional to the size of the changes. In this presentation we discuss the problem of computing over data that grows unbounded (e.g., event streams), leading to potentially unbounded indexes. We present the design and implementation of a fully automatic mechanism which enables many such computations to use only finite memory by garbage-collecting the indexes at runtime. The mechanism requires users to specify bounds on the amount of "out-of-orderness" of the input data, using annotations on input tables. Thank you for organizing this! Mihai ________________________________ From: Stamatis Zampetakis <[email protected]> Sent: Wednesday, January 15, 2025 7:29 AM To: [email protected] <[email protected]> Subject: Re: [DISCUSS] Apache Calcite Meetup February 2025 I can also give a talk based on some recent work that we did in Apache Hive around CTEs. With Walaa's and mine we now have five talks for the meetup. 1. Improving testing of JDBC adapter's dialects - Julian Hyde (Google) 2. Federated Query Planning w/ Substrait - Victor Barua (Datadog) 3. Combining streaming and incremental computation in SQL - Mihai Budiu (Feldera) 4. Revolutionizing Data Lakes: A Dive into Coral, the SQL Translation, Analysis, and Rewrite Engine - Walaa Eldin Moustafa (LinkedIn) 5. Optimizing Common Table Expressions in Apache Hive with Calcite - Stamatis Zampetakis (Cloudera) It would be nice to keep the duration of each talk around 20 minutes so that we finish with the presentations in ~2hrs. If more or less time is needed for some talks we can adapt. I now created the event on meetup [1] based on the information that I have so far. It would be nice to fill in the description/abstract part for each talk so please share the necessary details when you get a chance. Best, Stamatis [1] https://www.meetup.com/apache-calcite/events/305627349 On Fri, Jan 10, 2025 at 8:16 PM Walaa Eldin Moustafa <[email protected]> wrote: > > That is a great idea. I would love to join in person as well. I can talk > about Coral [1]. > > Revolutionizing Data Lakes: A Dive into Coral, the SQL Translation, > Analysis, and Rewrite Engine. > > [1] https://github.com/linkedin/coral > > Thanks, > Walaa. > > > On Fri, Jan 10, 2025 at 11:05 AM Stamatis Zampetakis <[email protected]> > wrote: > > > I now have some more clarity regarding the physical venue of the > > meetup. It will be held in Cloudera's offices in Santa Clara, > > California. More details will come over the next few weeks. > > > > Cloudera is also going to help with promoting the event to various > > channels and other meetup groups. > > > > Now with Mihai's proposal we have three topics for the meetup: > > * Improving testing of our JDBC adapter's dialects > > * Federated Query Planning w/ Substrait > > * Combining streaming and incremental computation in SQL > > > > In order to put up the agenda and publish the event to a wider > > audience we need the title, abstract, and expected duration for each > > talk. The sooner we publish the event the better it will be for people > > who would like to join. > > > > Best, > > Stamatis > > > > > > > > On Fri, Jan 3, 2025 at 9:02 PM Mihai Budiu <[email protected]> wrote: > > > > > > I can give a talk about combining streaming and incremental computation > > in SQL. > > > > > > Mihai > > > > > > ________________________________ > > > From: Stamatis Zampetakis <[email protected]> > > > Sent: Thursday, January 2, 2025 2:22 AM > > > To: [email protected] <[email protected]> > > > Subject: Re: [DISCUSS] Apache Calcite Meetup February 2025 > > > > > > So far we have two topics: > > > * Improving testing of our JDBC adapter's dialects > > > * Federated Query Planning w/ Substrait > > > > > > Both of them would be quite interesting and relevant for the meetup. > > > > > > For people who are willing to give a talk, please share the title, > > > abstract, and expected duration as soon as possible. Given that > > > February is not too far away, I would like to put up a tentative > > > agenda on meetup [1] so that people who don't follow the dev list can > > > arrange their schedules for the event. > > > > > > Best, > > > Stamatis > > > > > > [1] https://www.meetup.com/Apache-Calcite/ > > > > > > On Tue, Dec 31, 2024 at 9:00 PM Victor Barua > > > <[email protected]> wrote: > > > > > > > > If folks are interested, I could give a talk along the lines of > > "Federated > > > > Query Planning w/ Substrait" which would be about how we're using > > Isthmus > > > > <https://github.com/substrait-io/substrait-java/tree/main/isthmus>, > > the > > > > Substrait <https://substrait.io/> <-> Calcite bridge, to be able to > > use > > > > Calcite as a query planner for our distributed execution system. > > > > > > > > On Tue, 24 Dec 2024 at 03:38, Ruben Q L <[email protected]> wrote: > > > > > > > > > Great idea, Stamatis! > > > > > I'll do my best to try to participate remotely. > > > > > > > > > > > > > > > > > > > > > > > > > On Tue, Dec 24, 2024 at 10:21 AM Stamatis Zampetakis < > > [email protected]> > > > > > wrote: > > > > > > > > > > > Regarding the time it will be difficult to find a slot that is > > > > > > suitable for everyone worldwide. I proposed 5pm PST to facilitate > > > > > > those that will participate in person. > > > > > > > > > > > > If there are people interested in giving a talk in other regions > > and > > > > > > the timing does not work for them then we can try to see if there > > is > > > > > > another option that could accommodate those. Please let me know if > > > > > > that's the case. > > > > > > > > > > > > Best, > > > > > > Stamatis > > > > > > > > > > > > On Mon, Dec 23, 2024 at 7:58 PM Julian Hyde <[email protected]> > > wrote: > > > > > > > > > > > > > > Excellent idea. I'm always happy to give a talk about Calcite, > > and > > > > > > > could travel to Santa Clara in person. One possible topic is > > > > > > > "Improving testing of our JDBC adapter's dialects". > > > > > > > > > > > > > > On Mon, Dec 23, 2024 at 9:46 AM Mihai Budiu <[email protected]> > > wrote: > > > > > > > > > > > > > > > > Great idea, I would participate in person or in the hybrid > > format. > > > > > > > > > > > > > > > > Mihai > > > > > > > > > > > > > > > > ________________________________ > > > > > > > > From: Stamatis Zampetakis <[email protected]> > > > > > > > > Sent: Monday, December 23, 2024 12:47 AM > > > > > > > > To: [email protected] <[email protected]> > > > > > > > > Subject: [DISCUSS] Apache Calcite Meetup February 2025 > > > > > > > > > > > > > > > > Hi all, > > > > > > > > > > > > > > > > It's been quite a while since our last meetup [1, 2] so I was > > > > > thinking > > > > > > > > it may be a good idea to organize one around the beginning of > > 2025. > > > > > > > > > > > > > > > > I am considering the possibility of a hybrid event with few > > > > > > > > presentations followed by open discussion and socializing. In > > terms > > > > > of > > > > > > > > location, I am discussing something around Santa Clara, > > California. > > > > > > > > For those who cannot attend physically, we will use zoom or > > another > > > > > > > > app so that people can participate remotely. > > > > > > > > > > > > > > > > The tentative date that I have in mind is Thursday, 20 > > February, > > > > > > > > 5:00pm PST but nothing is fixed and we can adapt to > > accommodate more > > > > > > > > people. > > > > > > > > > > > > > > > > I created a small anonymous google form [3] to see if there is > > enough > > > > > > > > interest to hold an event on the proposed date and if it is > > worth > > > > > > > > organising a hybrid event instead of a fully virtual one. > > Please take > > > > > > > > a few seconds to submit your responses. > > > > > > > > > > > > > > > > Are there people willing to give a talk around Calcite? > > > > > > > > > > > > > > > > Best, > > > > > > > > Stamatis > > > > > > > > > > > > > > > > [1] https://www.meetup.com/Apache-Calcite/ > > > > > > > > [2] > > https://lists.apache.org/thread/8vbdtbjxt9zx4k93dof085yj3v5z820s > > > > > > > > [3] https://forms.gle/iuAB6hoDHuLdy3BC7 > > > > > > > > > > > > >
