+1 for the merge. Apart from code contributions, helping users run and troubleshoot SQE is equally important and it seems that quite a number of folks are ready to help.
On Fri, Sep 2, 2016 at 8:46 PM, Kamil Sindi <ka...@jwplayer.com> wrote: > Our data science efforts rely on SQE to power our recommendations engine. I > am also excited to contribute to it especially as we continue to implement > predictive models at larger scales. > > On Fri, Sep 2, 2016 at 10:57 AM, Sahil Shah <sa...@jwplayer.com> wrote: > > > I would like to throw my support behind SQE. Having working with it in a > > production environment, I have seen the many benefits in testing new > > topologies and quickly understanding what a topology is doing. As our > data > > needs have grown, we have only increased our reliance on SQE and it > stands > > the test repeatedly. I am excited at the opportunity to contribute to > this > > wonderful open source community. > > > > On Fri, Sep 2, 2016 at 10:31 AM, Alex Halter <a...@jwplayer.com> wrote: > > > > > I too want to voice my support for SQE and our commitment to the > > initiative > > > going forward. We've been working on adapting Storm to our needs for > most > > > of two years. It was thoughtfully designed and supports our production > > > needs. We have a long list of features we want to build out and we'd > love > > > to work with the community. > > > > > > > > > On Fri, Sep 2, 2016 at 10:19 AM, Rohit Garg <rohit.gar...@gmail.com> > > > wrote: > > > > > > > I am one of the developers who has been working on SQE for past 1.5 > > > years. > > > > Over time, we have made it more stable and production ready. > > > > > > > > As of now, one can easily scale SQE for more production data with > easy > > > > config changes and re-deploy, aggregate across different dimensions > by > > > > writing json like sql, write to different state stores and most > > > > importantly, address new feature requirements really quick.(Since > it's > > > just > > > > writing a sql like json file and sqe handles everything for you ! ) > > > > > > > > I think SQE can really help companies who want to setup a production > > > ready > > > > and well tested framework within weeks (instead of months) for large > > > scale > > > > event stream processing and with minimum risks and limited resources. > > We > > > > are actively working on SQE to make it more awesome and are committed > > to > > > > make the experience of developing a highly scalable and fault > tolerant > > > > stream processing framework more seamless and less stressful !!!! > > > > > > > > On Fri, Sep 2, 2016 at 9:49 AM, Lee Morris <l...@jwplayer.com> wrote: > > > > > > > > > Hi, Storm Dev! > > > > > > > > > > I wanted to chime in to show support for SQE and show how committed > > we > > > > are > > > > > to SQE. *StormSQL looks awesome and has some real potential! * > > > > > > > > > > We use SQE in production. It has been tested, code reviewed, load > > > tested, > > > > > maintained, and processing an average of 8 million tuples per > minute > > or > > > > > more for over a year now. The investment into this code base has > been > > > > > significant. > > > > > > > > > > Please take a look at the code itself. The production quality code > is > > > > ready > > > > > to go. Developers with no experience with Storm or even streaming > > > > > successfully launch robust topologies using SQE. Our productivity > in > > > > this > > > > > area went up by orders of magnitude. > > > > > > > > > > Based on this experience we realized the value of querying storm, > and > > > we > > > > > decided to give that value back to the storm community. > > > > > > > > > > Our data pipelines and real-time processing are very important to > the > > > > > success of JW Player. SQE has been a foundation for that. We will > > > > continue > > > > > to invest into this technology for years to come. Unfortunately we > > > > wouldn't > > > > > be able to adopt StormSQL as is until it has been put through the > > > > crucible > > > > > of production level usage and has had the same rigor applied. It > > seems > > > > much > > > > > of the development has been over the last couple of weeks. > > > > > > > > > > *Quick Gap Analysis (Not Exhaustive)* > > > > > *States* > > > > > - SQE supports Redis and MongoDB as states in addition to Kafka. > > > (Soon > > > > > adding a Test/Monitor State) > > > > > - SQE supports non-static field names for Redis state > > > > > - Storm SQL supports Kafka > > > > > - SQE supports replay filtering for Kafka > > > > > > > > > > *Aggregations* > > > > > - SQE supports stateful, exactly-once aggregations for states > that > > > > > support it > > > > > - Storm SQL supports aggregations within each micro batch > > > > > > > > > > *SQL* > > > > > - StormSQL supports SQL > > > > > - SQE supports SQL "like" JSON > > > > > > > > > > *Scaling* > > > > > - SQE has a mechanism for controlling parallelism or scaling > > > > > - Could not find parallelism or scaling controls within StormSQL > > (May > > > > > need to look harder) > > > > > > > > > > *Support for SQE* > > > > > So far the SQE / JW Player developers have been watching this > thread > > > > > without knowing if we should chime in. I call upon the devs at JW > to > > > > chime > > > > > in because we are dedicated to the success of this SQL in Storm. > > > > > > > > > > (Noticed I said "chime" three times in this email... well now four > > > times) > > > > > > > > > > Thanks for reading, > > > > > > > > > > Lee Morris, Sr Principal Engineer, Data | JWPLAYER > > > > > > > > > > O: 212.244.0140 <212.244.0140%20x999> | M: 215.920.1331 > > > > > > > > > > 2 Park Avenue, 10th Floor North, New York NY 10016 > > > > > > > > > > jwplayer.com | @jwplayer <http://twitter.com/jwplayer> > > > > > > > > > > On Tue, Aug 30, 2016 at 5:46 PM, Jungtaek Lim <kabh...@gmail.com> > > > wrote: > > > > > > > > > > > Hi Morrigan, > > > > > > > > > > > > Thanks for joining discussion. I thought we need to hear your > goal > > to > > > > > > donate SQE code, and opinion for how to apply SQE to Storm SQL > and > > > > > working > > > > > > on further improvements. > > > > > > > > > > > > Not sure when you took a look at the feature set of Storm SQL, > but > > if > > > > you > > > > > > haven't recently, you may want to do that. > > > > > > I started working on improving Storm SQL several weeks ago, and > > many > > > > > things > > > > > > are addressed in recent weeks. > > > > > > > > > > > > * STORM-1435 <https://issues.apache.org/jira/browse/STORM-1435>: > > You > > > > can > > > > > > easily launch Storm SQL runner without concerning dependencies > for > > > > Storm > > > > > > SQL core and runtime. It wasn't easy to run before STORM-2016 > > > > > > <http://issues.apache.org/jira/browse/STORM-2016> is introduced. > > > > > > * Refactored Storm SQL code for Trident to fit to Trident > > operations. > > > > > Storm > > > > > > SQL parsed SQL and generated topology code but it was not easy to > > > know > > > > > how > > > > > > topology code is generated, and also hard to determine how > Trident > > > > > > optimizations are applied. > > > > > > * STORM-1434 <https://issues.apache.org/jira/browse/STORM-1434>, > > > > > > STORM-2050 > > > > > > <https://issues.apache.org/jira/browse/STORM-2050>: Addressed > > GROUP > > > BY > > > > > > with > > > > > > UDAF (User Defined Aggregate Function) on Trident mode. Storm SQL > > > > already > > > > > > supported UDF on Trident mode. > > > > > > * STORM-2057 <https://issues.apache.org/jira/browse/STORM-2057>: > > > JOIN > > > > > > (inner, left outer, right outer, full outer) feature is now on > > > > reviewing. > > > > > > Note that only equi-join is supported. > > > > > > > > > > > > The changes are not included to official release yet, but I > expect > > > > Storm > > > > > > 1.1.0 will include them which are worth to try out for early > > > adopters. > > > > > > > > > > > > You can also refer STORM-1433 > > > > > > <https://issues.apache.org/jira/browse/STORM-1433> for current > > phase > > > > of > > > > > > Storm SQL. Might need to have another phases (epics) for > resolving > > > > other > > > > > > issues as well. > > > > > > > > > > > > I only had a look at SQE wiki so don't know the detailed features > > of > > > > SQE, > > > > > > but my feeling is that recent changes fills the gap between SQE > and > > > > Storm > > > > > > SQL, and even addressing some TODOs of SQE. We might need to > cross > > > > check > > > > > > feature set of each project to make clear on pros and cons for > each > > > > > > project. > > > > > > > > > > > > Btw, while Storm SQL has been implemented its missing features, > the > > > > > > difficult part for Storm SQL is SQL optimizations. There seems > lots > > > of > > > > > SQL > > > > > > optimizations (like filter pushdown) but I'm not expert on that > and > > > it > > > > > > apparently needs more deep understanding of Calcite. Other parts > > also > > > > > need > > > > > > contributors but we strongly need contributors in this area. > > > > > > > > > > > > Thanks, > > > > > > Jungtaek Lim (HeartSaVioR) > > > > > > > > > > > > 2016년 8월 31일 (수) 오전 12:47, Morrigan Jones <morri...@jwplayer.com > > >님이 > > > > 작성: > > > > > > > > > > > > > Hi, I'm the original creator and primary developer of SQE. > Sorry > > > for > > > > > > > the radio silence on my part, I was out on vacation the past > two > > > > > > > weeks. > > > > > > > > > > > > > > I'm glad to see the Storm SQL project chugging along. I started > > SQE > > > > > > > because I wanted better tools on top of Storm, particularly the > > > > > > > ability to query streams and build topologies using SQL. Our > > > > > > > philosophy is to quickly iterate on our production systems and > > > > provide > > > > > > > immediate value. We've been able to do this with SQE, which > > powers > > > > our > > > > > > > streaming systems. Work on SQE and adding functions is driven > by > > > our > > > > > > > current use cases. The big near term item on our road map is to > > add > > > > > > > SQL parsing. Calcite is very promising there and brings lots of > > > > > > > additional features, as I'm sure you know. Additionally, we're > > > going > > > > > > > to improve our function, stream, and state support. > > > > > > > > > > > > > > The difficulty I can see for us with Storm SQL is the amount of > > > work > > > > > > > necessary to get from where we are now with SQE to integrating > > any > > > > > > > functionality and making sure Storm SQL can provide the > > > functionality > > > > > > > we have now, assuming that is the path we would all go. We're > > super > > > > > > > excited to see support for Storm grow and mature, and we'd like > > to > > > be > > > > > > > a part of that. But we also have to maintain our ability to > > iterate > > > > > > > quickly and provide immediate value. > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > > Morrigan Jones > > > > > > > Principal Engineer > > > > > > > JWPLAYER | Your Way to Play > > > > > > > morri...@jwplayer.com | jwplayer.com > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > *Sahil Shah,* Data Engineer > > *JW*PLAYER | Your Way to Play > > P: 240.595.1169 | jwplayer.com > > > -- Regards, Abhishek Agarwal