Re: [DISCUSS] FLIP-39: Flink ML pipeline and ML libs

2019-06-03 Thread Stavros Kontopoulos
Hi, Some portion of the code could be migrated to the new Table API no? I am saying that because the new API design is based on scikit-learn and the old one was also inspired by it. Best, Stavros On Wed, May 22, 2019 at 1:24 PM Shaoxuan Wang wrote: > Another consensus (from the offline

Re: [DISCUSS] FLIP-23 Model Serving

2017-11-28 Thread Stavros Kontopoulos
approach. > > 5) I'm skeptical about using queryable state to expose metrics. Did you > consider using Flink's metrics system [1]? It is easily configurable and we > provided several reporters that export the metrics. > > What do you think? > Best, Fabian > > [1] > ht

[DISCUSS] FLIP-23 Model Serving

2017-11-23 Thread Stavros Kontopoulos
Hi guys, Let's discuss the new FLIP proposal for model serving over Flink. The idea is to combine previous efforts there and provide a library on top of Flink for serving models. https://cwiki.apache.org/confluence/display/FLINK/FLIP-23+-+Model+Serving Code from previous efforts can be found

Re: [ANNOUNCE] New Flink PMC member: Tzu-Li (Gordon) Tai

2017-07-12 Thread Stavros Kontopoulos
Congrats Gordon! On Wed, Jul 12, 2017 at 5:29 AM, Evans Ye wrote: > Congrats Gordon! > > Tzu-Li (Gordon) Tai 於 2017年7月12日 週三,上午2:28寫道: > > > Thanks everyone :) > > It has always been great working with you all! > > Looking forward to all the future

Re: [DISCUSS] FLIP proposal for Model Serving over Flink

2017-07-06 Thread Stavros Kontopoulos
out the ML efforts to Flink Forward > Berlin > > > this > > > > year? > > > > > > > > On Fri, Jun 30, 2017 at 6:04 PM, Fabian Hueske <fhue...@gmail.com> > > > wrote: > > > >> > > > >> Yes, I know that Theo is

Re: [DISCUSS] FLIP proposal for Model Serving over Flink

2017-06-30 Thread Stavros Kontopoulos
he > new module. > As people are contributing to the model serving module, the number of > committers should hopefully grow after some time. > > Best, Fabian > > 2017-06-30 10:58 GMT+02:00 Stavros Kontopoulos <st.kontopou...@gmail.com>: > > > Hi all, > > > >

[DISCUSS] FLIP proposal for Model Serving over Flink

2017-06-30 Thread Stavros Kontopoulos
Hi all, After coordinating with Theodore Vasiloudis and the guys behind the Flink Model Serving effort (Eron, Radicalbit people, Boris, Bas (ING)), we propose to start working on the model serving over Flink in a more official way. That translates to capturing design details in a FLIP document.

Re: Re: Switch to Scala 2.11 as a default build profile

2017-06-29 Thread Stavros Kontopoulos
+10 I think it makes sense spark is also using 2.11 as the default for quite some time. On Thu, Jun 29, 2017 at 8:14 PM, Bowen Li wrote: > EMR's builtin Flink is always 1 or 2 versions behind Flink latest release. > We choose to install Flink on EMR ourselves. > > On

Re: FlinkML on slack

2017-06-24 Thread Stavros Kontopoulos
gt; > > > We've created an app to automate the invite process, now you can just use > > the following link > > to get an invite to the FlinkML Slack group: > > > > https://flinkml-invites.herokuapp.com/ > > > > Regards, > > Theodore >

Re: FlinkML on slack

2017-06-20 Thread Stavros Kontopoulos
gt; > > > Hi Stavros, > > > Can I get an invitation for the slack channel. > > > > > > Thanks, > > > Shaoxuan > > > > > > > > > On Thu, Jun 8, 2017 at 3:56 AM, Stavros Kontopoulos < > > > st.kontopou...@gmail.com>

Re: FlinkML on slack

2017-06-17 Thread Stavros Kontopoulos
h Amarnath < > lokesh.amarn...@gmail.com> > wrote: > > > Hi Stravros, > > > > Could you also please add me to the Slack channel? My email id is: > > lokesh.amarn...@gmail.com. > > > > Thanks, > > Lokesh > > > > > > > > On Th

Re: FlinkML on slack

2017-06-15 Thread Stavros Kontopoulos
Ziyad added. Stavros On Sun, Jun 11, 2017 at 4:45 PM, Ziyad Muhammed <mmzi...@gmail.com> wrote: > Hi Stavros > > Could you please send me an invite to the slack channel? > > Best > Ziyad > > > On Sun, Jun 11, 2017 at 1:53 AM, Stavros Kontopoulos < > st.kon

Re: FlinkML on slack

2017-06-10 Thread Stavros Kontopoulos
s hsapu...@apache.org > > Thanks, > > Henry > > > On Thu, Jun 8, 2017 at 2:21 AM, Stavros Kontopoulos < > st.kontopou...@gmail.com> wrote: > > > Hi Aljoscha, > > > > Slack is invite only to the best of my knowledge, I just sent you an > > invitation. &

Re: FlinkML on slack

2017-06-08 Thread Stavros Kontopoulos
@Ted Yu sure. On Thu, Jun 8, 2017 at 5:18 PM, Ted Yu <yuzhih...@gmail.com> wrote: > Hi Stavros, > Can you add me as well ? > > Thanks > > On Wed, Jun 7, 2017 at 12:56 PM, Stavros Kontopoulos < > st.kontopou...@gmail.com> wrote: > > > Hi all

Re: FlinkML on slack

2017-06-08 Thread Stavros Kontopoulos
rote: > > > Thanks! > > > > > On 8. Jun 2017, at 11:21, Stavros Kontopoulos < > st.kontopou...@gmail.com> > > wrote: > > > > > > Hi Aljoscha, > > > > > > Slack is invite only to the best of my knowledge, I just sent you an >

Re: FlinkML on slack

2017-06-08 Thread Stavros Kontopoulos
> Aljoscha > > > On 7. Jun 2017, at 21:56, Stavros Kontopoulos <st.kontopou...@gmail.com> > wrote: > > > > Hi all, > > > > We took the initiative to create the organization for FlinkML on slack > > (thnx Eron). > > Th

FlinkML on slack

2017-06-07 Thread Stavros Kontopoulos
Hi all, We took the initiative to create the organization for FlinkML on slack (thnx Eron). There is now a channel for model-serving . Another is coming for flink-jpmml. You are invited to join the channels and

Re: interested in volunteering at Flink ML/Stream ML Project

2017-05-24 Thread Stavros Kontopoulos
Hi, If you are aware of the following you can skip it... There was a previous discussion about Flink ML recently and several people are working on it on different directions. Discussion:

Re: Machine Learning on Flink - Next steps

2017-03-22 Thread Stavros Kontopoulos
u please also create some table in google doc, that is > >>> representing > >>> the selected directions and persons, who would like to drive or > >>> participate > >>> in the particular topic, in order to make this process transparent for > >&

Re: Machine Learning on Flink - Next steps

2017-03-20 Thread Stavros Kontopoulos
e who expressed interest in the project. > > Would you be willing to lead that effort for the model serving project? > > Regards, > Theodore > > -- > Sent from a mobile device. May contain autocorrect errors. > > On Mar 19, 2017 3:49 AM, "Stavros Kontopoulos" <

Re: Machine Learning on Flink - Next steps

2017-03-18 Thread Stavros Kontopoulos
Hi all... I agree about the tensorflow integration it seems to be important from what I hear. Should we sign up somewhere for the working groups (gdcos)? I would like to start helping with the model serving feature. Best Regards, Stavros On Fri, Mar 17, 2017 at 10:34 PM, Gábor Hermann

Re: Machine Learning on Flink - Next steps

2017-03-10 Thread Stavros Kontopoulos
Thanks Theodore, I'd vote for - Offline learning with Streaming API - Low-latency prediction serving Some comments... Online learning Good to have but my feeling is that it is not a strong requirement (if a requirement at all) across the industry right now. May become hot in the future.

Re: [DISCUSS] Flink ML roadmap

2017-03-03 Thread Stavros Kontopoulos
ed here, so I'll try > to > > >>> put there all the arguments mentioned in this thread. Feel free to > put > > >>> there more :) > > >>> > > >>> @Stavros: I agree we should take action fast. What about collecting > our > > >&

Re: [DISCUSS] Code style / checkstyle

2017-02-27 Thread Stavros Kontopoulos
+1 to provide and enforcing a unified code style for both java and scala. Unification should apply when it makes sense like comments though. Eventually code base should be re-factored. I would vote for the one at a time module fix apporoach. Style guide should be part of any PR review. We could

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Stavros Kontopoulos
t; to think. >> During analysis something will finally arise. >> May be we can ask partners of Flink for cases? Data Artisans got results >> of customers survey: [1], ML better support is wanted, so we could ask >> what >> exactly is necessary. >> >> [1] http:

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Stavros Kontopoulos
+100 for a design doc. Could we also set a roadmap after some time-boxed investigation captured in that document? We need action. Looking forward to work on this (whatever that might be) ;) Also are there any data supporting one direction or the other from a customer perspective? It would help

Re: [DISCUSS] Flink ML roadmap

2017-02-21 Thread Stavros Kontopoulos
Ok I see. Suppose we solve all the critical issues. And suppose we dont go with the pure online model (although online ML has a potential)... should we move on with the current ML implementation which is for batch processing (to the best of my knowledge)? The parameter server problem is a long

Re: [DISCUSS] Flink ML roadmap

2017-02-20 Thread Stavros Kontopoulos
contributing to Flink ML lately. I > > > > believe we should rethink our goals, to put the contribution efforts > in > > > > making a usable and useful library. Are we trying to implement as > many > > > > useful algorithms as possible to create a scalable ML

[DISCUSS] Flink ML roadmap

2017-02-20 Thread Stavros Kontopoulos
(Resending with the appropriate topic) Hi, I would like to start a discussion about next steps for Flink ML. Currently there is a lot of work going on but needs a push forward. Some topics to discuss: a) How several features should be planned and get aligned with Flink releases. b) Priorities

Flink ML

2017-02-19 Thread Stavros Kontopoulos
Hi, I would like to start a discussion about next steps for Flink ML. Currently there is a lot of work going on but needs a push forward. Some topics to discuss: a) How several features should be planned and get aligned with Flink releases. b) Priorities of what should be done. c) Basic

[jira] [Created] (FLINK-5841) Algorithms for each pipeline stage should handle NaN, infinity like in scikit-learn

2017-02-18 Thread Stavros Kontopoulos (JIRA)
Stavros Kontopoulos created FLINK-5841: -- Summary: Algorithms for each pipeline stage should handle NaN, infinity like in scikit-learn Key: FLINK-5841 URL: https://issues.apache.org/jira/browse/FLINK-5841

[jira] [Created] (FLINK-5785) Add an Imputer for preparing data

2017-02-13 Thread Stavros Kontopoulos (JIRA)
Stavros Kontopoulos created FLINK-5785: -- Summary: Add an Imputer for preparing data Key: FLINK-5785 URL: https://issues.apache.org/jira/browse/FLINK-5785 Project: Flink Issue Type: New

Re: flink-ml test

2017-02-12 Thread Stavros Kontopoulos
Ok I missed the IT part, if it is an integration test class you could use mvn integration-test -DwildcardSuites= the mvn target is different. On Sat, Feb 11, 2017 at 12:20 AM, Stavros Kontopoulos < st.kontopou...@gmail.com> wrote: > Weird for me it works. I can post the output he

Re: Flink ML - NaN Handling

2017-02-12 Thread Stavros Kontopoulos
gt; Till > > On Fri, Feb 10, 2017 at 11:48 PM, Stavros Kontopoulos < > st.kontopou...@gmail.com> wrote: > > > Hello guys, > > > > Is there a story for this (might have been discussed earlier)? I see > > differences between scikit-learn and n

Re: [ANNOUNCE] Welcome Stefan Richter as a new committer

2017-02-10 Thread Stavros Kontopoulos
Congrats! On Fri, Feb 10, 2017 at 9:11 PM, Matthias J. Sax wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA512 > > Congrats! > > On 2/10/17 2:00 AM, Ufuk Celebi wrote: > > Hey everyone, > > > > I'm very happy to announce that the Flink PMC has accepted Stefan > >

Flink ML - NaN Handling

2017-02-10 Thread Stavros Kontopoulos
Hello guys, Is there a story for this (might have been discussed earlier)? I see differences between scikit-learn and numpy. Do we standardize on scikit-learn? PS. I am working on the preprocessing stuff. Best, Stavros

Re: flink-ml test

2017-01-27 Thread Stavros Kontopoulos
e.flink.ml On Fri, Jan 27, 2017 at 4:07 PM, Stavros Kontopoulos < st.kontopou...@gmail.com> wrote: > typo:remove the second BreezeMathSuite.. > > > On Fri, Jan 27, 2017 at 4:06 PM, Stavros Kontopoulos < > st.kontopou...@gmail.com> wrote: > >> Hi, >> >&g

Re: flink-ml test

2017-01-27 Thread Stavros Kontopoulos
Hi, For running a specific test under flink-ml: mvn test -DwildcardSuites=org.apache.flink.ml.math.BreezeMathSuite BreezeMathSuite Cheers, Stavros On Fri, Jan 27, 2017 at 12:01 PM, Driesprong, Fokko wrote: > Hi Anton, > > I'm curious what tests fail. I run the tests by

Re: flink-ml test

2017-01-27 Thread Stavros Kontopoulos
typo:remove the second BreezeMathSuite.. On Fri, Jan 27, 2017 at 4:06 PM, Stavros Kontopoulos < st.kontopou...@gmail.com> wrote: > Hi, > > For running a specific test under flink-ml: > mvn test -DwildcardSuites=org.apache.flink.ml.math.BreezeMathSuite > BreezeMathSuite

[jira] [Created] (FLINK-5588) Add a unit scaler based on different norms

2017-01-20 Thread Stavros Kontopoulos (JIRA)
Stavros Kontopoulos created FLINK-5588: -- Summary: Add a unit scaler based on different norms Key: FLINK-5588 URL: https://issues.apache.org/jira/browse/FLINK-5588 Project: Flink Issue

[jira] [Created] (FLINK-5525) Streaming Version of a Linear Regression model

2017-01-17 Thread Stavros Kontopoulos (JIRA)
Stavros Kontopoulos created FLINK-5525: -- Summary: Streaming Version of a Linear Regression model Key: FLINK-5525 URL: https://issues.apache.org/jira/browse/FLINK-5525 Project: Flink

Re: buffering in operators, implementing statistics

2016-05-23 Thread Stavros Kontopoulos
tion()) > > > > > > with sketch data types and a fold function that is tailored to the user > > > types. Therefore, I would prefer to not add a special API for this and > > vote > > > to close https://issues.apache.org/jira/browse/FLINK-2147. I already > > &g

withBroadcastSet for a DataStream missing?

2016-03-29 Thread Stavros Kontopoulos
H i am new here... I am trying to implement online k-means as here https://databricks.com/blog/2015/01/28/introducing-streaming-k-means-in-spark-1-2.html with flink. I dont see anywhere a withBroadcastSet call to save intermediate results is this currently supported? Is intermediate results