Re: [VOTE] Accept Stateful Functions into Apache Flink

2019-10-30 Thread Vasiliki Kalavri
+1 (binding) from me. I hope this is not too late :) Thank you for this great contribution! On Wed, 30 Oct 2019 at 14:45, Stephan Ewen wrote: > Thank you all for voting. > > The voting period has passed, but only 13 PMC members have voted so far, > that is less than 2/3rd of the PMCs (17 member

Re: [DISCUSS] Java code style

2015-10-23 Thread Vasiliki Kalavri
Hey, sorry I haven't replied so far. I was enjoying the thread tough :P I'm +1 for 120 line length and tabs. I wouldn't voice a -1 for spaces, but it seems to me like an unnecessary change that would touch every single Java file and without substantially improving anything. JavaDocs by-module wi

Re: From 0.10 to 1.0

2015-10-23 Thread Vasiliki Kalavri
+1 ^^ On 23 October 2015 at 13:14, Matthias J. Sax wrote: > +1 for 1.0 > it's time to "grow up" :) > > On 10/23/2015 12:52 PM, Kostas Tzoumas wrote: > > +1 for 1.0, it's the right time if not a bit overdue in my opinion > > > > On Fri, Oct 23, 2015 at 12:06 PM, Fabian Hueske > wrote: > > > >> Y

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc1)

2015-10-27 Thread Vasiliki Kalavri
I tested this for rc0, and I confirm. Worked fine for Firefox and Chrome, didn't work for Safari (I left a note in the previous testing doc). -Vasia. On 27 October 2015 at 15:18, Maximilian Michels wrote: > Good catch, Aljoscha. As far as I know the plan visualizer is only broken > for Safari.

[gelly] Spargel model rework

2015-10-27 Thread Vasiliki Kalavri
Hello squirrels, I want to discuss with you a few concerns I have about our current vertex-centric model implementation, Spargel, now fully subsumed by Gelly. Spargel is our implementation of Pregel [1], but it violates some fundamental properties of the model, as described in the paper and as im

Re: [gelly] Spargel model rework

2015-10-27 Thread Vasiliki Kalavri
Your SSSP example speaks for itself. Straightforward, if the reader is > > familiar with Pregel/Giraph/... > > > > Best, > > Martin > > > > > > On 27.10.2015 17:40, Vasiliki Kalavri wrote: > > > >> Hello squirrels, > >> > >&

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
Is there a new testing doc for rc2 or are we using the previous one? Thanks! On 27 October 2015 at 22:17, Maximilian Michels wrote: > Please vote on releasing the following candidate as Apache Flink version > 0.10.0: > > The commit to be voted on: > ed75049dfc9748eae81ace9d4d686907dcd7835c > > B

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
Maximilian Michels wrote: > Thanks for testing, Vasia :) > > Here is the new document: > > https://docs.google.com/document/d/1CR3DH4tUJvukxGFQ1ySxfnzO00LjPhSTwkeE7Mf98CY/edit > > I've transferred results which are unaffected by the changes of the new RC. > > On Wed,

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
I see, thank you! +1 for removing before the release :) On 28 October 2015 at 13:06, Sachin Goel wrote: > Those are hard coded values. > What exactly should be there, I'm not sure either. > On Oct 28, 2015 5:25 PM, "Vasiliki Kalavri" > wrote: > > > I have a

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
hile we're at it, we should also remove the dummy log and stdout tabs for > task managers. The work on that hasn't been finished yet. > I'll file a jira for both. > On Oct 28, 2015 5:39 PM, "Vasiliki Kalavri" > wrote: > > > I see, thank you! +1 for

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
columns in the "Plan" tab on Firefox. Which > version are you using? Does resizing the window make any difference? > > @Sachin: Thanks for your pull requests. Will pull them in for the next RC. > > On Wed, Oct 28, 2015 at 2:03 PM, Vasiliki Kalavri > wrote: > > I thi

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
> -- Sachin Goel > > Computer Science, IIT Delhi > > m. +91-9871457685 > > > > On Wed, Oct 28, 2015 at 7:04 PM, Vasiliki Kalavri < > [email protected] > >> wrote: > > > >> It's Firefox 41.0.2. Resizing doesn't work :/ > >

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
h CSS unfortunately. > >> > >> @Vasia, there seems to be different things which are being used to > render > >> those two. For the running jobs page, job.tasks is rendered, while for > the > >> job page, job.status-counts is being used. Looking into it now. &

Re: Fast strategy for intersect

2015-10-28 Thread Vasiliki Kalavri
Hi Martin, isn't finding the intersection of edges enough in this case? And assuming there are no duplicate edges, I believe a join should do the trick. Cheers, -Vasia. On 28 October 2015 at 13:15, Martin Junghanns wrote: > Hi all! > > While working on FLINK-2905, I was wondering what a good (

neo4j - Flink connector

2015-10-29 Thread Vasiliki Kalavri
Hello everyone, Martin, Martin, Alex (cc'ed) and myself have started discussing about implementing a neo4j-Flink connector. I've opened a corresponding JIRA (FLINK-2941) containing an initial document [1], but we'd also like to share our ideas here to engage the community and get your feedback. W

Fwd: neo4j - Flink connector

2015-10-29 Thread Vasiliki Kalavri
ts from neo4j with the REST API [1]. we didnt end up > using it, so I can't comment on performance etc., but intuitively it seems > like a better chunking. > > Nice to see another connector for Flink. :) > > Best regards, > > Martin > > [1] http://neo4j.com

Re: [gelly] Spargel model rework

2015-10-30 Thread Vasiliki Kalavri
them to fit in memory, right? ​I'm also working on a version where the graph is represented as an adjacency list, instead of two separate datasets of vertices and edges. The disadvantage is that the graph has to fit in memory, but I think the advantages are many​. We'll be able to s

Re: [gelly] Spargel model rework

2015-11-03 Thread Vasiliki Kalavri
ve > > >> the > > >> performance considerably. > > >> > > >> The good news is that it would come "for free" because the necessary > > >> partitioning and sorting can be reused (given the forwardField > > annotations > > &g

Re: Question about limitations of iterative algorithms

2015-11-04 Thread Vasiliki Kalavri
Hi Andre, On 4 November 2015 at 16:04, André Petermann < [email protected]> wrote: > Hi Fabian, > > thanks for your fast reply! > > I created a gist to explain the while-not-empty loop in more detail: > https://gist.github.com/p3et/9f6e56cf0b68213e3e2b > > It is an approach to c

Re: Question about limitations of iterative algorithms

2015-11-05 Thread Vasiliki Kalavri
o you think this could be worth to be contributed as an example within > the Flink documentation? The examples I found so far could not help > enlightening me how to use delta iteration for this kind of loop > (ST != WT, start from empty solution set, ...). > > Cheers, > Andre >

Re: [gelly] Spargel model rework

2015-11-05 Thread Vasiliki Kalavri
ed by the join with the in-memory > hash-table of the solution set and can be reused for the coGroup. > > Best, > Fabian > > 2015-10-30 18:38 GMT+01:00 Vasiliki Kalavri : > > > Hi Fabian, > > > > thanks so much for looking into this so quickly :-) > > >

Long cannot be cast to org.apache.flink.types.CopyableValue

2015-11-08 Thread Vasiliki Kalavri
Hello squirrels, I'm writing a few graph algorithms to test the performance of different iteration models and I am quite stuck with an error. While my sssp example works fine, I get the following in my connected components job (local execution inside eclipse): Exception in thread "main" org.apa

Re: Long cannot be cast to org.apache.flink.types.CopyableValue

2015-11-09 Thread Vasiliki Kalavri
field, > but the TypeExtraction thinks you return a NullValue. I would guess that > there are some unsafe generic casts in your code that emit a Long at a > place that declares a NullValue in its signature. > > On Sun, Nov 8, 2015 at 9:20 PM, Vasiliki Kalavri < > vasilikikala..

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc6)

2015-11-09 Thread Vasiliki Kalavri
I also think this is it :-) Apart from the tests in the doc, I ran a bunch of tests and the dataset examples on a 5-node cluster with Hadoop 2.7.1. +1 Cheers, -Vasia. On 9 November 2015 at 13:53, Maximilian Michels wrote: > +1 > > - Checked the source files for binaries > - Ran mvn clean verif

Re: Tagging Flink classes with InterfaceAudience and InterfaceStability

2015-11-10 Thread Vasiliki Kalavri
Hi Robert, thanks for bringing this up! I generally like the idea, but I wouldn't rush to annotate the Gelly classes yet. Gelly hasn't had that many users and I'm quite sure we'll find things to improve as it gets more exposure. TBH, I think it's quite unfair to force Gelly (also e.g. ML, Table)

Re: Tagging Flink classes with InterfaceAudience and InterfaceStability

2015-11-10 Thread Vasiliki Kalavri
nt in declaring certain APIs as frozen when we are not yet > confident they have converged. > > > > On Tue, Nov 10, 2015 at 8:39 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > Hi Robert, > > > > thanks for bringing this up! > > >

Re: [gelly] Spargel model rework

2015-11-10 Thread Vasiliki Kalavri
on the ratio of messages and solution set > updates). Also it does not require that the initial workset is sorted for > the first groupReduce. > > I would put it at the end. > > Cheers, Fabian > > 2015-11-05 17:19 GMT+01:00 Vasiliki Kalavri : > > > @Fabian > &g

Re: [gelly] Spargel model rework

2015-11-11 Thread Vasiliki Kalavri
te flag to indicate which both types > is used. > It might actually make sense, to add a generic Either type to the Java API > in general (similar to the Java Tuples with resemble the Scala Tuples). > > Cheers, Fabian > > 2015-11-10 22:16 GMT+01:00 Vasiliki Kalavri : > >

Re: [VOTE] [RESULT] Release Apache Flink 0.10.0 (release-0.10.0-rc8)

2015-11-12 Thread Vasiliki Kalavri
\o/ \o/ \o/ Thank you Max! On Nov 13, 2015 2:23 AM, "Nick Dimiduk" wrote: > Woo hoo! > > On Thu, Nov 12, 2015 at 3:01 PM, Maximilian Michels > wrote: > > > Thanks for voting! The vote passes. > > > > The following votes have been cast: > > > > +1 votes: 7 > > > > Stephan > > Aljoscha > > Robert

Re: [DISCUSS] Release Flink 0.10.1 soon

2015-11-18 Thread Vasiliki Kalavri
Hey, I would also add FLINK-3012 and FLINK-3036 (both pending PRs). Thanks! -Vasia. On 18 November 2015 at 12:24, Robert Metzger wrote: > Hi, > > I was wondering whether we should release Flink 0.10.1 soon, as there are > some issues we've identified: > > (pending PRs) > - FLINK-3032: Flink do

Re: [DISCUSS] Release Flink 0.10.1 soon

2015-11-18 Thread Vasiliki Kalavri
set committer for > case > >> where checkpointing is deactivated > >> > >> On Wed, Nov 18, 2015 at 12:34 PM, Vasiliki Kalavri < > >> [email protected]> wrote: > >> > >>> Hey, > >>> > >>> I would also ad

Re: Apache Tinkerpop & Geode Integration?

2015-11-23 Thread Vasiliki Kalavri
Hi James, thank you for your e-mail and your interest in Flink :) I've recently taken a _quick_ look into Apache TinkerPop and I think it'd be very interesting to integrate with Flink/Gelly. Are you thinking about something like a Flink GraphComputer, similar to Giraph and Spark GraphComputer's?

Re: Either left() vs left(value)

2015-11-23 Thread Vasiliki Kalavri
Hey Gyula, I don't think dropping the method is a good idea. We need a way to retrieve left and right values, no? How about renaming to getLeft() / getRight()? -V. On 23 November 2015 at 09:55, Gyula Fóra wrote: > Hey guys, > > I know this should have been part of the PR discussion but it kind

Re: Either left() vs left(value)

2015-11-23 Thread Vasiliki Kalavri
> Right classes, maybe directly expose them instead of the static method. (or > rename the static method) > > Gyula > > Vasiliki Kalavri ezt írta (időpont: 2015. nov. > 23., H, 20:14): > > > Hey Gyula, > > > > I don't think dropping the method is a good id

Re: Either left() vs left(value)

2015-11-23 Thread Vasiliki Kalavri
uld be pretty much like in scala) > > or we can add static methods like: Left.of(...), Right.of(...) which would > work exactly as it does now. > > And then we can live without the static methods in Either (Either would > become Abstract). > > Gyula > > Vasiliki Kala

Union a data stream with a product of itself

2015-11-24 Thread Vasiliki Kalavri
Hi squirrels, when porting the gelly streaming code from 0.9 to 0.10 today with Paris, we hit an exception in union: "*A DataStream cannot be unioned with itself*". The code raising this exception looks like this: stream.union(stream.map(...)). Taking a look into the union code, we see that it's

Re: Union a data stream with a product of itself

2015-11-25 Thread Vasiliki Kalavri
t; stream.union(stream.map(..)) and things like this are fair operations. > > Also > > >> maybe stream.union(stream) should just give stream instead of an > error. > > >> > > >> Could someone comment on this who knows the reasoning behind the > current > &g

Re: Union a data stream with a product of itself

2015-11-25 Thread Vasiliki Kalavri
Here's the issue: https://issues.apache.org/jira/browse/FLINK-3080 -V. On 25 November 2015 at 14:38, Gyula Fóra wrote: > Yes, please > > Vasiliki Kalavri ezt írta (időpont: 2015. nov. > 25., Sze, 14:37): > > > So, do we all agree that the current behavior is not cor

Re: Apache Tinkerpop & Geode Integration?

2015-11-25 Thread Vasiliki Kalavri
n the TinkerPop dev list -- it would be great to have you part of the > conversation there too as we work on the integration: > >http://mail-archives.apache.org/mod_mbox/incubator-tinkerpop-dev/ > > Thanks, Vasia. > > - James > > > On Mon, Nov 23, 2015 at 10:28 A

Re: [ANNOUNCE] Flink 0.10.1 released

2015-11-27 Thread Vasiliki Kalavri
Thank you Robert ^^ On 27 November 2015 at 16:23, Till Rohrmann wrote: > Thanks Robert for being the release manager for 0.10.1 > > On Fri, Nov 27, 2015 at 4:21 PM, Maximilian Michels > wrote: > > > Great. We released that one fast. Thanks Robert. > > > > On Fri, Nov 27, 2015 at 3:27 PM, Robert

Either not NotSerializableException and InvalidTypesException

2015-11-28 Thread Vasiliki Kalavri
Hi squirrels, I have 2 problems with the new Either type and I could use your help to understand them. 1. I have a piece of code that looks like this: TypeInformation>> workSetTypeInfo = ... DataSet>> initialWorkSet = initialVertices.map(...).returns(workSetTypeInfo); This gives me the followin

Re: Either not NotSerializableException and InvalidTypesException

2015-11-30 Thread Vasiliki Kalavri
l > ​ > > On Mon, Nov 30, 2015 at 11:03 AM, Aljoscha Krettek > wrote: > > > It seems there is an Either.Left stored somewhere in the Object. Could > > that be? > > > On 28 Nov 2015, at 20:18, Vasiliki Kalavri > > wrote: > > > > > > > > > org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:307) > > > > >

Re: New Blog Post Draft

2015-12-09 Thread Vasiliki Kalavri
Thanks Matthias! This is a very nice blog post and reads easily. On 9 December 2015 at 19:21, Ufuk Celebi wrote: > Great post! Thanks! > > I have also made some comments in the commit. > > – Ufuk > > > On 09 Dec 2015, at 14:19, Maximilian Michels wrote: > > > > Hi Matthias, > > > > Thank you fo

Re: Apache Tinkerpop & Geode Integration?

2015-12-09 Thread Vasiliki Kalavri
-archives.apache.org/mod_mbox/incubator-tinkerpop-dev/201511.mbox/%3ccanva_a390l7g169r8sn+ej1-yfkbudlnd4td6atwnp0uza-...@mail.gmail.com%3E [3]: http://www.datastax.com/dev/blog/the-benefits-of-the-gremlin-graph-traversal-machine On 25 November 2015 at 16:54, Vasiliki Kalavri wrote: > Hi James, >

Re: Apache Tinkerpop & Geode Integration?

2015-12-14 Thread Vasiliki Kalavri
Ping squirrels! Any thoughts/opinions on this? On 9 December 2015 at 20:40, Vasiliki Kalavri wrote: > Hello squirrels, > > I have been discussing with the Apache Tinkerpop [1] community regarding > an integration with Flink/Gelly. > You can read our discussion in [2]. > >

Re: Apache Tinkerpop & Geode Integration?

2015-12-15 Thread Vasiliki Kalavri
ght the conversation over here, so I hear the opinions of the Flink community on this :)​ > Best, Fabian > -Vasia.​ > > > 2015-12-14 18:33 GMT+01:00 Vasiliki Kalavri : > > > Ping squirrels! Any thoughts/opinions on this? > > > > On 9 December 2015 at 20

Re: Apache Tinkerpop & Geode Integration?

2015-12-16 Thread Vasiliki Kalavri
collect again some of those ideas. > > Stephan > > > > On Tue, Dec 15, 2015 at 5:35 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > Hi Dr. Fabian, > > > > thanks a lot for your answer! > > > > > > On 15 December

Re: 2015: A Year in Review for Apache Flink

2015-12-31 Thread Vasiliki Kalavri
Happy new year everyone! Looking forward to all the great things the Apache Flink community will accomplish in 2016 :)) Greetings from snowy Greece! -Vasia. On 31 December 2015 at 04:22, Henry Saputra wrote: > Dear All, > > It is almost end of 2015 and it has been busy and great year for Apache

Re: [gelly] Spargel model rework

2016-01-04 Thread Vasiliki Kalavri
wrote: > > > "Either" an "Optional" types are quite useful. > > > > Let's add them to the core Java API. > > > > On Wed, Nov 11, 2015 at 10:00 AM, Vasiliki Kalavri < > > [email protected]> wrote: > > > >>

Re: [gelly] Spargel model rework

2016-01-06 Thread Vasiliki Kalavri
o > think that it would make sense to do it. > > On 04 Jan 2016, at 19:48, Vasiliki Kalavri > wrote: > > > > Hello squirrels and happy new year! > > > > I'm reviving this thread to share some results and discuss next steps. > > > > Using t

[gelly] partition-centric iterations

2016-01-06 Thread Vasiliki Kalavri
Hi squirrels, here's some more Gelly iteration love for you ;) Together with Kien and Nikola (students at KTH, cc-ed), we looked into partition-centric iterations for Gelly. The idea of the partition-centric model is to expose the graph partition structure to the user (not just a single vertex, b

Re: [gelly] partition-centric iterations

2016-01-07 Thread Vasiliki Kalavri
> > Again, nice work! > > Best, > Martin > ​Cheers, -Vasia.​ [1]: https://drive.google.com/file/d/0BzQJrI2eGlyYSHZJTHFObmgxTXM/view?usp=sharing > > On 06.01.2016 23:29, Vasiliki Kalavri wrote: > > Hi squirrels, > > > > here's some more Gelly iter

Re: [DISCUSS] Remove Combinable Annotation from DataSet API

2016-01-13 Thread Vasiliki Kalavri
Hi, ​+1 for removing the Combinable annotation​. Approach 1 sounds like the best option to me. On 13 January 2016 at 14:11, Till Rohrmann wrote: > Hi Fabian, > > thanks for bringing this issue up. I agree with you that it would be nice > to remove the Combinable annotation if it is not really

Re: Old Flink Graph Repository

2016-01-15 Thread Vasiliki Kalavri
Hey, we can delete the content, but I wouldn't remove the repository. In the early Gelly days we shared the link of this repository in several places (e.g. my FOSDEM'14 talk and few other presentations), so people might still land there. I believe it would be nice if we could keep it empty with a

Re: Old Flink Graph Repository

2016-01-15 Thread Vasiliki Kalavri
I believe I do! On 15 January 2016 at 16:20, Stephan Ewen wrote: > Sounds good. Do you have push rights there to do that? > > On Fri, Jan 15, 2016 at 3:51 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > Hey, > > > > we can del

Re: Old Flink Graph Repository

2016-01-15 Thread Vasiliki Kalavri
Do we want to keep the history or shall I force-push and wipe everything? On 15 January 2016 at 18:02, Vasiliki Kalavri wrote: > I believe I do! > > On 15 January 2016 at 16:20, Stephan Ewen wrote: > >> Sounds good. Do you have push rights there to do that? >> >>

Re: [ANNOUNCE] Chengxiang Li added as committer

2016-01-19 Thread Vasiliki Kalavri
Congratulations! Welcome Chengxiang Li! On 19 January 2016 at 11:02, Fabian Hueske wrote: > Hi everybody, > > I'd like to announce that Chengxiang Li accepted the PMC's offer to become > a committer of the Apache Flink project. > > Please join me in welcoming Chengxiang Li! > > Best, Fabian >

Re: User Feedback

2016-02-09 Thread Vasiliki Kalavri
Hi Martin, thank you for the feedback. Let me try to answer some of your concerns. On 9 February 2016 at 15:35, Martin Neumann wrote: > During this year's FOSDEM Martin Junghans and I set together and gathered > some feedback for the Flink project. It is based on our personal experience > as w

Re: Affiity Propagation

2016-02-17 Thread Vasiliki Kalavri
Hi Josep, welcome to the the Flink dev list! There exists a JIRA issue for Affinity Propagation that is currently unassigned (FLINK-1707 [1]). You can see in the comments that someone started working on it, but they dropped it. As far as I know, nobody is working on this right now, so let me know

Re: [VOTE] Release Apache Flink 1.0.0 (RC1)

2016-02-25 Thread Vasiliki Kalavri
Hi squirrels, here's my testing outcome so far: - Examples: Ran all examples locally and on a cluster, both from CLI and web submission tool Issues: 1. PageRank example doesn't run without arguments anymore. I have a fix together with some doc fixes. - CLI: tested locally and on cluster Issues:

Re: Inconvenient (unforeseen?) consequences of PR #1683

2016-02-27 Thread Vasiliki Kalavri
Hi squirrels, sorry I've been slow to respond to this, but I'm now testing RC1 and I'm a bit confused with this change. So far, the easier way to run a Gelly example on a cluster was to package and submit the Gelly jar. Now, since the flink-gelly project doesn't contain the examples anymore, I tr

Re: Inconvenient (unforeseen?) consequences of PR #1683

2016-02-29 Thread Vasiliki Kalavri
gelly dependency. If you just > build flink-gelly-examples, then it won't contain flink-gelly because it is > not a fat jar. You either have to install flink-gelly on your cluster or > package it in the final user jar. > > Cheers, > Till > > On Sat, Feb 27, 2016 at 7:

Re: Inconvenient (unforeseen?) consequences of PR #1683

2016-02-29 Thread Vasiliki Kalavri
: > Good catch :-) I mean we could also change the behaviour to include > flink-gelly in the flink-gelly-examples module. > > On Mon, Feb 29, 2016 at 12:13 PM, Vasiliki Kalavri < > [email protected]> wrote: > > > Thanks Till! Then, we'd better update the doc

Re: Inconvenient (unforeseen?) consequences of PR #1683

2016-02-29 Thread Vasiliki Kalavri
t; On Mon, Feb 29, 2016 at 12:24 PM, Vasiliki Kalavri < > [email protected]> wrote: > > > In my opinion, the fat jar solution is easier than having to copy the > Gelly > > jar to all task managers. > > I would be in favor of including flink-gelly in the flink-example

Re: [VOTE] Release Apache Flink 1.0.0 (RC3)

2016-03-01 Thread Vasiliki Kalavri
Thank you Robert! +1 for the google doc. On 1 March 2016 at 09:10, Aljoscha Krettek wrote: > Very good, I’ll test the savepoints again and also try to hammer some of > the recent API fixes. > > Could we also have the usual google doc for keeping track of the basic > checks? > > On 01 Mar 2016, a

Re: Release 1.0 Migration Guide

2016-03-03 Thread Vasiliki Kalavri
Thanks Ufuk! I'll add the Gelly-related breaking changes. -Vasia. On 3 March 2016 at 10:55, Márton Balassi wrote: > Thanks for initiating this Ufuk. Updated the streaming hashing mention - > whether it is api breaking is questionable, so I would place it last in the > list. But definitely good

Re: 答复: Effort to add SQL / StreamSQL to Flink

2016-03-19 Thread Vasiliki Kalavri
Hello everyone, We are happy to announce that the "tableOnCalcite" branch is finally ready to be merged. It essentially provides the existing functionality of the Table API, but now the translation happens through Apache Calcite. You can find the changes rebased on top of the current master in [1]

Re: 答复: Effort to add SQL / StreamSQL to Flink

2016-03-19 Thread Vasiliki Kalavri
ing towards "Task 6". > I think good support for Stream SQL is a very interesting new feature for > Flink. > > [1] > > https://docs.google.com/document/d/1TLayJNOTBle_-m1rQfgA6Ouj1oYsfqRjPcp1h2TVqdI/edit#heading=h.28dvisn56su0 > > On Wed, Mar 16, 2016 at 6:17 PM, Vas

Re: 答复: Effort to add SQL / StreamSQL to Flink

2016-03-19 Thread Vasiliki Kalavri
h and improve the > exceptions for invalid joins before merging. > Will open a PR later today. > > 2016-03-16 21:17 GMT+01:00 Vasiliki Kalavri : > > > Yes, the current state corresponds to Task 1. PR #1770 corresponds to > Task > > 5. Task 6 should come right after :) > &

Re: Next steps: SQL / StreamSQL support

2016-03-21 Thread Vasiliki Kalavri
Thanks for the nice summary and for updating the design documents Fabian! As we proceed with the upcoming tasks, we should also go through existing JIRAs and update them, too. There are some old issues referring to SQL and adding external data sources, but these were created before the decision of

Re: 答复: 答复: Effort to add SQL / StreamSQL to Flink

2016-03-29 Thread Vasiliki Kalavri
> Cool stuff! > > SQL coming up next? ;-) > > > On Tue, Mar 29, 2016 at 1:39 PM, Maximilian Michels > wrote: > > > Yeah! I'm a little late to the party but exciting stuff! :) > > > > On Fri, Mar 18, 2016 at 3:15 PM, Vasiliki Kalavri < > > vasilikika

Re: [DISCUSS] Graph algorithms for vertex and edge degree

2016-04-22 Thread Vasiliki Kalavri
Hi all, I asked Greg to start a discussion here about FLINK-3771 and FLINK-3772, to make sure we're all on the same page about future Gelly development. About a year ago we created the Gelly roadmap [1]. Many of these items have been implemented and others were researched and either developed ext

Re: [DISCUSS] Methods for translating Graphs

2016-04-26 Thread Vasiliki Kalavri
Thanks for the input Fabian! I also think this is a valuable and lightweight addition. I will add specific comments on the PR :) -Vasia. On 25 April 2016 at 14:30, Fabian Hueske wrote: > Hi Greg, > > sorry for the late reply. > I am not super familiar with Gelly, but the use cases you describe

Re: [DISCUSS] Graph algorithms for vertex and edge degree

2016-04-26 Thread Vasiliki Kalavri
> > with the current methods? > > Could the current methods be replaced by the more efficient > implementations > > or would there be two methods which look very similar and behave almost > the > > same? > > > > Best, Fabian > > > > > >

Updated Gelly Roadmap

2016-04-26 Thread Vasiliki Kalavri
Hi all, as promised, I have updated the Gelly roadmap [1]. Below, I am describing and reasoning about the changes I made. Please, let me know whether you agree and if you have any other ideas for further improvements and feature additions. *1. Operators for highly skewed graphs*: I have removed t

Re: Performance and accuracy of Flink iterations

2016-05-17 Thread Vasiliki Kalavri
Hi Greg, I think there is confusion between what delta means in the "delta iteration operator" of Flink and the "delta approximate implementation" of an algorithm, such as in PageRank. Assuming that we have a graph with a set of vertices and an iterative fixpoint algorithm that updates the vertex

Re: ITCases in the Table API

2016-05-18 Thread Vasiliki Kalavri
Hey Stephan, thanks for bringing this up! We discussed this situation with Fabian a while ago and I saw that he has now updated FLINK-3656 regarding this. If nobody picks this up sooner, I can help with reworking the tests next week. Cheers, -V. On 18 May 2016 at 10:23, Stephan Ewen wrote: > H

Re: Preparing Table API & SQL for Flink 1.1.0

2016-05-20 Thread Vasiliki Kalavri
Hey all, @Fabian: thanks for compiling the list of issues and trying out TPC-H I think it would be nice to include the first 5 from your list in 1.1.0. What about FLINK-3656 (re-working the tests)? Do we want to do this before the release as well? Great to see Timo willing to coordinate and Yijie

Re: Blogpost on Flink's SQL support

2016-05-24 Thread Vasiliki Kalavri
Great post :)) Thanks Fabian! On 24 May 2016 at 17:04, Fabian Hueske wrote: > Thanks Henry! The post is out :-) > > 2016-05-24 16:51 GMT+02:00 Henry Saputra : > > > Awesome! > > Thanks for the great post, Fabian > > > > - Henry > > > > On Tue, May 24, 2016 at 2:36 AM, Fabian Hueske > wrote: > >

Re: Hotfixes on the master

2016-05-28 Thread Vasiliki Kalavri
Hi all, in principle I agree with Max. I personally avoid hotfixes and always open a PR, even for javadoc improvements. I believe the main problem is that we don't have a clear definition of what constitutes a "hotfix". Ideally, even cosmetic changes and documentation should be reviewed; I've see

Re: Iteration Intermediate Output

2016-05-28 Thread Vasiliki Kalavri
Hey, it would be great to add this feature indeed! Thanks for bringing it up Greg :) Would the best way be to extend the iteration operators to support intermediate outputs or revisit the idea of caching intermediate results and thus allow efficient for-loop iterations? -Vasia. On 26 May 2016 at

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-06-01 Thread Vasiliki Kalavri
Hi, we could go for something like "sponsor" or "champion" :) I'm fine with the proposal. Good to see more than 1 person for both Gelly and Table API. cheers, -V. On 1 June 2016 at 05:46, Tzu-Li (Gordon) Tai wrote: > I'd like to be added to the Streaming Connectors component (already edited >

Re: How to run table api in 1.1-SNAPSHOT

2016-06-05 Thread Vasiliki Kalavri
Hi Cody, could it be you're getting this error because you've named a SQL table column "count"? Can you try renaming it to "myCount" or something else? I think the parser recognizes the aggregate function instead :) Cheers, -V. On Jun 2, 2016 1:56 PM, "Cody Innowhere" wrote: > Hi guys, > I'm tr

Re: sampling function

2016-07-11 Thread Vasiliki Kalavri
Hi Do, Paris and Martha worked on sampling techniques for data streams on Flink last year. If you want to implement your own samplers, you might find Martha's master thesis helpful [1]. -Vasia. [1]: http://kth.diva-portal.org/smash/get/diva2:910695/FULLTEXT01.pdf On 11 July 2016 at 11:31, Kosta

[DISCUSS] Gelly iteration abstractions

2015-02-19 Thread Vasiliki Kalavri
Hello beautiful Flink people, during the past few days, Andra and I have been discussing about how to extend Gelly's iteration methods. Alexander's course (and his awesome students) has made it obvious that vertex-centric iterations are not the best fit for algorithms which don't follow the commo

Re: [DISCUSS] Should we improve the structure of our Gelly test suite?

2015-02-22 Thread Vasiliki Kalavri
Hi Andra, not every test has to be a MultipleProgramsTestBase. We're just using this for our convenience, it's not a rule :-) If the case you want to test doesn't fit the format of this test, you can just make a separate standalone test. I found some examples for testing exceptions in Flink operat

Re: [DISCUSS] Gelly iteration abstractions

2015-02-22 Thread Vasiliki Kalavri
> (parallel Bellman-Ford) from Section 4.2 in [1]. > > > > However, it seems that the issues encountered by Andra are related to the > > implementation of Parallel Boruvka (Section 3.2 in [2]). Is that correct? > > > > Regards, > > A. > > > > [1] ht

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri
Sun, Feb 22, 2015 at 4:53 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > Hi, > > > > yes, I was referring to the parallel Boruvka algorithm. There are several > > ways to implement this one in Flink and I believe that the one described

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri
may not work well as "closed-loop" iterations. > > Is it possible to express those as for-loop iterations? > > On Mon, Feb 23, 2015 at 1:03 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > Hi Stephan, > > > > yes, this wo

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri
on plans. > > On Mon, Feb 23, 2015 at 2:44 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > for-loop iterations could cover some cases, I guess, when the number of > > iterations is known beforehand. > > Are there currently any restrictions on wh

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri
b 23, 2015 at 4:15 PM, Vasiliki Kalavri < > [email protected] > > wrote: > > > I see that's cool :-) > > So, what is the advantage of closed-loop versus for-loop iterations? > > Custom convergence criteria / aggregators and more efficient execution &g

k-means example behavior

2015-02-24 Thread Vasiliki Kalavri
Hello everyone, I'm using the k-means example as basis for a custom implementation and I noticed the following behavior: If during an iteration no point is assigned to a particular cluster, this cluster will then "disappear". This happens because SelectNearestCenter() outputs tuples, (where centr

Re: k-means example behavior

2015-02-25 Thread Vasiliki Kalavri
t points then it > > has no centroid. If it has no centroid no points could ever be > > assigned to it again in the future since there is no way of > > calculating a distance. > > > > On Tue, Feb 24, 2015 at 6:57 PM, Vasiliki Kalavri > > wrote: > > > Hello e

Re: gelli graph algorithm

2015-02-26 Thread Vasiliki Kalavri
Hi Martin, welcome back :-) I'll try to merge the documentation PR tonight. Gelly is in the flink-staging package and most of the Gelly methods are in the Graph class and have javadocs that describe their functionality. Regarding your specific tasks, you can easily get the degree distribution us

Re: Queries regarding RDFs with Flink

2015-03-03 Thread Vasiliki Kalavri
Hi Flavio, if you want to use Gelly to model your data as a graph, you can load your Tuple3s as Edges. This will result in "http://test/John";, "Person", "Frank", etc to be vertices and "type", "name", "knows" to be edge values. In the first case, you can use filterOnEdges() to get the subgraph wi

Re: [MultipleProgramsTestBase][Cluster vs. Collection mode] Inconsistent Behavior

2015-03-04 Thread Vasiliki Kalavri
Hi Andra, judging from the output, it seems that all 3 supersteps are executed in the second case as well, but getSuperstepNumber() is returning the wrong superstep number. I confirmed that this is also the case in VertexCentricConnectedComponentsITCase and SpargelConnectedComponentsITCase, i.e. t

Re: [MultipleProgramsTestBase][Cluster vs. Collection mode] Inconsistent Behavior

2015-03-04 Thread Vasiliki Kalavri
p number accordingly (like it does with the aggregators) or shall we assign the new context in every superstep? Thanks! -Vasia. On 4 March 2015 at 17:46, Vasiliki Kalavri wrote: > Hi Andra, > > judging from the output, it seems that all 3 supersteps are executed in > the second

Re: [MultipleProgramsTestBase][Cluster vs. Collection mode] Inconsistent Behavior

2015-03-04 Thread Vasiliki Kalavri
ext should be the same > across all iterations. > > It only needs to support returning a different superstep number in each > superstep. > > On Wed, Mar 4, 2015 at 7:13 PM, Vasiliki Kalavri < > [email protected]> > wrote: > > > Hi, > > > > I ha

Re: [gelly] Tests fail, but build succeeds

2015-03-10 Thread Vasiliki Kalavri
vironment.createRemoteEnvironment( > > "localhost", cluster.getJobManagerRPCPort()); > > instead of getExecutionEnvironment()? > > > > I get the same error in both cases. > > > > Thank you! > > Andra > > > > On Tue

Re: [DISCUSS] Deprecate Spargel API for 0.9

2015-03-11 Thread Vasiliki Kalavri
ng people to Gelly > > > > On Wed, Mar 11, 2015 at 10:30 AM, Ufuk Celebi wrote: > > > > > > > > On 10 Mar 2015, at 22:02, Vasiliki Kalavri > > > wrote: > > > > > > > Hi all, > > > > > > > > I would like

  1   2   3   >