[jira] [Created] (FLINK-1744) Change the reference of slaves to workers to match the description of the system

2015-03-18 Thread Henry Saputra (JIRA)
Henry Saputra created FLINK-1744: Summary: Change the reference of slaves to workers to match the description of the system Key: FLINK-1744 URL: https://issues.apache.org/jira/browse/FLINK-1744 Projec

[jira] [Created] (FLINK-1743) Add multinomial logistic regression to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1743: Summary: Add multinomial logistic regression to machine learning library Key: FLINK-1743 URL: https://issues.apache.org/jira/browse/FLINK-1743 Project: Flink

[jira] [Created] (FLINK-1742) Sample data points for MultipleLinearRegression to support proper SGD

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1742: Summary: Sample data points for MultipleLinearRegression to support proper SGD Key: FLINK-1742 URL: https://issues.apache.org/jira/browse/FLINK-1742 Project: Flink

[DISCUSS] Submitting small PRs rather than massive ones

2015-03-18 Thread Henry Saputra
Hi All, Recently there have been some PRs with massive changes which include multiple JIRA tickets. It is getting tougher to review and also to back port changes if needed. To help reviewers to help review the changes lets try to submit small but often PRs to make it easier to review. Not to men

[jira] [Created] (FLINK-1741) Add Jaccard Similarity Metric Example

2015-03-18 Thread Andra Lungu (JIRA)
Andra Lungu created FLINK-1741: -- Summary: Add Jaccard Similarity Metric Example Key: FLINK-1741 URL: https://issues.apache.org/jira/browse/FLINK-1741 Project: Flink Issue Type: Task Co

[jira] [Created] (FLINK-1740) StreamExecutionEnvironment is not respecting the setNumberOfExecutionRetries()

2015-03-18 Thread JIRA
Márton Balassi created FLINK-1740: - Summary: StreamExecutionEnvironment is not respecting the setNumberOfExecutionRetries() Key: FLINK-1740 URL: https://issues.apache.org/jira/browse/FLINK-1740 Projec

[jira] [Created] (FLINK-1739) Fix the bug of JobManager and TaskManager Port Check.

2015-03-18 Thread Sibao Hong (JIRA)
Sibao Hong created FLINK-1739: - Summary: Fix the bug of JobManager and TaskManager Port Check. Key: FLINK-1739 URL: https://issues.apache.org/jira/browse/FLINK-1739 Project: Flink Issue Type: Bug

Re: Marking affected and fixed versions in JIRA

2015-03-18 Thread Henry Saputra
+1 Will do this more diligently On Wed, Mar 18, 2015 at 1:34 AM, Robert Metzger wrote: > I would appreciate if everyone who is merging pull requests is properly > setting the "fix version" in JIRA. > > So in most cases, the "fix version" is the next major release, currently > 0.9. > If we're not

Re: [DISCUSS] Issues with heterogeneity of the code

2015-03-18 Thread Henry Saputra
+1 for keeping the tabs vs space for now. When I start working on Flink I was itching to move to spaces indentation since that makes reader consistent. But that was the choice by preference made early on and I think we could keep it and add as exception in Java code style. Scala, on the other han

Overview of Memory Management in Flink

2015-03-18 Thread Stephan Ewen
Hi all! Here is a first version of the documentation how memory management works in Flink. I hope it sheds some light on the magic we do. Let me know if certain sections are still confusing. https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=53741525 Greetings, Stephan

Re: [DISCUSS] Issues with heterogeneity of the code

2015-03-18 Thread Robert Metzger
I'm against changing the indentation, for the same reasons as Stephan listed. In my opinion, the codebase has grown too large to "just" switch the indentation or the entire code style (to the google style or whatever). We have 235870 LOC of Java and 24173 LOC of Scala. Therefore, I'm proposing to

[jira] [Created] (FLINK-1738) Add mean centering transformation to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1738: Summary: Add mean centering transformation to machine learning library Key: FLINK-1738 URL: https://issues.apache.org/jira/browse/FLINK-1738 Project: Flink

Re: [DISCUSS] Issues with heterogeneity of the code

2015-03-18 Thread Stephan Ewen
I agree, if we set p a new project, we should use space indentation. Should we really refactor 300k lines of code? Would be massive. Also: The history would basically show a single committer for all code. Git blame (for error tracing) would become useless. On Wed, Mar 18, 2015 at 3:49 PM, Alexan

[jira] [Created] (FLINK-1737) Add statistical whitening transformation to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1737: Summary: Add statistical whitening transformation to machine learning library Key: FLINK-1737 URL: https://issues.apache.org/jira/browse/FLINK-1737 Project: Flink

[jira] [Created] (FLINK-1736) Add CountVectorizer to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1736: Summary: Add CountVectorizer to machine learning library Key: FLINK-1736 URL: https://issues.apache.org/jira/browse/FLINK-1736 Project: Flink Issue Type: Imp

[jira] [Created] (FLINK-1735) Add FeatureHasher to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1735: Summary: Add FeatureHasher to machine learning library Key: FLINK-1735 URL: https://issues.apache.org/jira/browse/FLINK-1735 Project: Flink Issue Type: Impro

Re: [DISCUSS] Issues with heterogeneity of the code

2015-03-18 Thread Alexander Alexandrov
Massive +1 for switching to space indention. Makes the code render consistently across various viewers (e.g. Github UI, Apache infrastructure, IDEs). 2015-03-18 1:29 GMT+01:00 Fabian Hueske : > Touching every file of the code would also be a good opportunity to switch > from tab to space indentio

[jira] [Created] (FLINK-1734) Insufficient memory allocation in Gelly

2015-03-18 Thread Robert Waury (JIRA)
Robert Waury created FLINK-1734: --- Summary: Insufficient memory allocation in Gelly Key: FLINK-1734 URL: https://issues.apache.org/jira/browse/FLINK-1734 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-1733) Add PCA to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1733: Summary: Add PCA to machine learning library Key: FLINK-1733 URL: https://issues.apache.org/jira/browse/FLINK-1733 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-1732) Add Gaussian mixture model to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1732: Summary: Add Gaussian mixture model to machine learning library Key: FLINK-1732 URL: https://issues.apache.org/jira/browse/FLINK-1732 Project: Flink Issue Ty

[jira] [Created] (FLINK-1731) Add kMeans clustering algorithm to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1731: Summary: Add kMeans clustering algorithm to machine learning library Key: FLINK-1731 URL: https://issues.apache.org/jira/browse/FLINK-1731 Project: Flink Is

[jira] [Created] (FLINK-1730) Add a FlinkTools.persist style method to the Data Set.

2015-03-18 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-1730: --- Summary: Add a FlinkTools.persist style method to the Data Set. Key: FLINK-1730 URL: https://issues.apache.org/jira/browse/FLINK-1730 Project: Flink Issue Type

[jira] [Created] (FLINK-1729) Assess performance of classification algorithms

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1729: Summary: Assess performance of classification algorithms Key: FLINK-1729 URL: https://issues.apache.org/jira/browse/FLINK-1729 Project: Flink Issue Type: Imp

Re: Marking affected and fixed versions in JIRA

2015-03-18 Thread Vasiliki Kalavri
Thanks for this Robert! I updated the gelly-related closed issues. BTW, what's the difference between closed and resolved? Any case where we should use one over the other? -Vasia. On 18 March 2015 at 10:34, Robert Metzger wrote: > I would appreciate if everyone who is merging pull requests is p

[jira] [Created] (FLINK-1728) Add random forest ensemble method to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1728: Summary: Add random forest ensemble method to machine learning library Key: FLINK-1728 URL: https://issues.apache.org/jira/browse/FLINK-1728 Project: Flink

[jira] [Created] (FLINK-1727) Add decision tree to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1727: Summary: Add decision tree to machine learning library Key: FLINK-1727 URL: https://issues.apache.org/jira/browse/FLINK-1727 Project: Flink Issue Type: Impro

[jira] [Created] (FLINK-1726) Add Community Detection Library and Example

2015-03-18 Thread Andra Lungu (JIRA)
Andra Lungu created FLINK-1726: -- Summary: Add Community Detection Library and Example Key: FLINK-1726 URL: https://issues.apache.org/jira/browse/FLINK-1726 Project: Flink Issue Type: Task

[jira] [Created] (FLINK-1725) New Partitioner for better load balancing for skewed data

2015-03-18 Thread Anis Nasir (JIRA)
Anis Nasir created FLINK-1725: - Summary: New Partitioner for better load balancing for skewed data Key: FLINK-1725 URL: https://issues.apache.org/jira/browse/FLINK-1725 Project: Flink Issue Type:

Re: Website documentation minor bug

2015-03-18 Thread Hermann Gábor
Great! Thanks Max! On Wed, Mar 18, 2015 at 11:34 AM Maximilian Michels wrote: > Alright, changed it for the master. I think it makes the docs much more > pleasant to read. > > http://ci.apache.org/projects/flink/flink-docs-master/ > > On Wed, Mar 18, 2015 at 9:26 AM, Hermann Gábor > wrote: > >

[jira] [Created] (FLINK-1724) TestingCluster uses local communication with multiple task managers

2015-03-18 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-1724: -- Summary: TestingCluster uses local communication with multiple task managers Key: FLINK-1724 URL: https://issues.apache.org/jira/browse/FLINK-1724 Project: Flink

[jira] [Created] (FLINK-1723) Add cross validation for parameter selection and validation

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1723: Summary: Add cross validation for parameter selection and validation Key: FLINK-1723 URL: https://issues.apache.org/jira/browse/FLINK-1723 Project: Flink Is

[jira] [Created] (FLINK-1722) Streaming not respecting FinalizeOnMaster for output formats

2015-03-18 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-1722: - Summary: Streaming not respecting FinalizeOnMaster for output formats Key: FLINK-1722 URL: https://issues.apache.org/jira/browse/FLINK-1722 Project: Flink

Re: How to test including ITCase using maven?

2015-03-18 Thread Chiwan Park
Thanks @Andra, @Stephan. I will try it. Regards. Chiwan Park (Sent with iPhone) > On Mar 18, 2015, at 7:33 PM, Andra Lungu wrote: > > The way I do it is mvn -e test :) > > On Wed, Mar 18, 2015 at 11:21 AM, Chiwan Park wrote: > >> Hello. >> I have a question about test using maven. >> >> I

Re: How to test including ITCase using maven?

2015-03-18 Thread Andra Lungu
The way I do it is mvn -e test :) On Wed, Mar 18, 2015 at 11:21 AM, Chiwan Park wrote: > Hello. > I have a question about test using maven. > > I tested with `mvn -pl flink-tests test` command to test flink-tests > module. I got followed execution logs. (I removed some unnecessary logs.) > > [IN

Re: How to test including ITCase using maven?

2015-03-18 Thread Stephan Ewen
Hi! ITCases (Integration Test cases) are executed in the "verify" phase. call "mvn clean verify", then you will see it. Stephan On Wed, Mar 18, 2015 at 11:21 AM, Chiwan Park wrote: > Hello. > I have a question about test using maven. > > I tested with `mvn -pl flink-tests test` command to te

Re: Website documentation minor bug

2015-03-18 Thread Maximilian Michels
Alright, changed it for the master. I think it makes the docs much more pleasant to read. http://ci.apache.org/projects/flink/flink-docs-master/ On Wed, Mar 18, 2015 at 9:26 AM, Hermann Gábor wrote: > +1 for the proposal! > > On Tue, Mar 17, 2015 at 4:35 PM Aljoscha Krettek > wrote: > > > +1 >

How to test including ITCase using maven?

2015-03-18 Thread Chiwan Park
Hello. I have a question about test using maven. I tested with `mvn -pl flink-tests test` command to test flink-tests module. I got followed execution logs. (I removed some unnecessary logs.) [INFO] Scanning for projects... [INFO]

[jira] [Created] (FLINK-1721) Flakey Yarn Tests

2015-03-18 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-1721: --- Summary: Flakey Yarn Tests Key: FLINK-1721 URL: https://issues.apache.org/jira/browse/FLINK-1721 Project: Flink Issue Type: Bug Components: YARN Clie

[jira] [Created] (FLINK-1720) Integrate ScalaDoc in Scala sources into overall JavaDoc

2015-03-18 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-1720: --- Summary: Integrate ScalaDoc in Scala sources into overall JavaDoc Key: FLINK-1720 URL: https://issues.apache.org/jira/browse/FLINK-1720 Project: Flink

Re: A typo in a documentation

2015-03-18 Thread Till Rohrmann
Good catch Rui Zhu. Thanks a lot, I'll fix it. On Wed, Mar 18, 2015 at 2:21 AM, Rui Zhu wrote: > Hello, > > I just found a typo in the document of Cluster Setup. In the HDFS Setup > section of Cluster Setup, the command of starting HDFS has a typo: when we > go into the hadoop-* directory, the s

[jira] [Created] (FLINK-1719) Add naive Bayes classification algorithm to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1719: Summary: Add naive Bayes classification algorithm to machine learning library Key: FLINK-1719 URL: https://issues.apache.org/jira/browse/FLINK-1719 Project: Flink

[jira] [Created] (FLINK-1718) Add sparse vector and sparse matrix types to machine learning library

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1718: Summary: Add sparse vector and sparse matrix types to machine learning library Key: FLINK-1718 URL: https://issues.apache.org/jira/browse/FLINK-1718 Project: Flink

[jira] [Created] (FLINK-1717) Add support to read libSVM and SVMLight input files

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1717: Summary: Add support to read libSVM and SVMLight input files Key: FLINK-1717 URL: https://issues.apache.org/jira/browse/FLINK-1717 Project: Flink Issue Type:

[jira] [Created] (FLINK-1716) Add CoCoA algorithm to flink-ml

2015-03-18 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1716: Summary: Add CoCoA algorithm to flink-ml Key: FLINK-1716 URL: https://issues.apache.org/jira/browse/FLINK-1716 Project: Flink Issue Type: Improvement

Re: Marking affected and fixed versions in JIRA

2015-03-18 Thread Robert Metzger
I would appreciate if everyone who is merging pull requests is properly setting the "fix version" in JIRA. So in most cases, the "fix version" is the next major release, currently 0.9. If we're not setting this, the issue will not appear in the changelog of the release. Also, I think that users ma

Re: Website documentation minor bug

2015-03-18 Thread Hermann Gábor
+1 for the proposal! On Tue, Mar 17, 2015 at 4:35 PM Aljoscha Krettek wrote: > +1 > On Mar 17, 2015 12:28 PM, "Maximilian Michels" wrote: > > > Another proposal in addition to the recent changes: How about we remove > the > > "fixed" attribute from the navigation of the docs? Then the nav will