[GitHub] apex-malhar pull request #330: Initial cut of Inner Join operator for REVIEW...

2016-07-08 Thread tushargosavi
Github user tushargosavi commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/330#discussion_r70042432 --- Diff: library/src/main/java/com/datatorrent/lib/join/AbstractManagedStateInnerJoinOperator.java --- @@ -0,0 +1,259 @@ +/** + * Licensed

[GitHub] apex-malhar pull request #330: Initial cut of Inner Join operator for REVIEW...

2016-07-08 Thread tushargosavi
Github user tushargosavi commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/330#discussion_r70042536 --- Diff: library/src/main/java/com/datatorrent/lib/join/AbstractInnerJoinOperator.java --- @@ -0,0 +1,187 @@ +/** + * Licensed to the Apac

[GitHub] apex-malhar pull request #330: Initial cut of Inner Join operator for REVIEW...

2016-07-08 Thread tushargosavi
Github user tushargosavi commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/330#discussion_r70043086 --- Diff: library/src/main/java/com/datatorrent/lib/join/AbstractInnerJoinOperator.java --- @@ -0,0 +1,187 @@ +/** + * Licensed to the Apac

[GitHub] apex-malhar pull request #330: Initial cut of Inner Join operator for REVIEW...

2016-07-08 Thread tushargosavi
Github user tushargosavi commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/330#discussion_r70043528 --- Diff: library/src/main/java/com/datatorrent/lib/join/AbstractManagedStateInnerJoinOperator.java --- @@ -0,0 +1,259 @@ +/** + * Licensed

[GitHub] apex-malhar pull request #330: Initial cut of Inner Join operator for REVIEW...

2016-07-08 Thread tushargosavi
Github user tushargosavi commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/330#discussion_r70049062 --- Diff: library/src/main/java/com/datatorrent/lib/join/managed/ManagedTimeStateMultiMap.java --- @@ -0,0 +1,247 @@ +/** + * Licensed to t

Re: APEXMALHAR-1701 Deduper in Malhar

2016-07-08 Thread Bhupesh Chawda
Thanks David. I'll try to create an implementation for Deduper which uses WindowedOperator. Will open a PR soon for review. ~ Bhupesh On Fri, Jul 8, 2016 at 2:23 AM, David Yan wrote: > Hi Bhupesh, > > I just added the method setFixedLateness(long millis) to > AbstractWindowedOperator in my PR.

Re: APEXCORE-408 : Ability to schedule Sub-DAG from running application

2016-07-08 Thread Tushar Gosavi
Hi All, I have dome some initial prototype which allows stat listener to specify dag changes, and the dag changes are applied asynchronously. The changes involved are - Add DagChangeSet object which is inherited from DAG, supporting methods to remove operator and streams. - The stat listener w

[jira] [Created] (APEXCORE-482) Default Unifiers should be ThreadLocal with downstream operators

2016-07-08 Thread Sandesh (JIRA)
Sandesh created APEXCORE-482: Summary: Default Unifiers should be ThreadLocal with downstream operators Key: APEXCORE-482 URL: https://issues.apache.org/jira/browse/APEXCORE-482 Project: Apache Apex Core

[jira] [Updated] (APEXCORE-473) Default Unifier without serialization

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandesh updated APEXCORE-473: - Description: When a DefaultUnifier is deployed in a separate container, there is no need to do Serializ

[jira] [Updated] (APEXCORE-474) Default unifier placement during M*1 deployment

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandesh updated APEXCORE-474: - Description: During M*1 deployment, unifier was deployed in the separate container. But there is no adv

[jira] [Updated] (APEXCORE-474) Default unifier placement during M*1 deployment

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandesh updated APEXCORE-474: - Description: During M*1 deployment, unifier was deployed in the separate container. But there is no adv

[jira] [Updated] (APEXCORE-474) Default unifier placement during M*1 deployment

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandesh updated APEXCORE-474: - Description: During M*1 deployment, unifier was deployed in the separate container. But there is no adv

Re: [Proposal] Make the default Unifier ThreadLocal with the downstream operator

2016-07-08 Thread Sandesh Hegde
Created a jira for this issue, https://issues.apache.org/jira/browse/APEXCORE-482 On Thu, Jul 7, 2016 at 9:22 PM Amol Kekre wrote: > +1. Makes sense. Do need to allow users to override if they want. > > Thks > Amol > > > On Thu, Jul 7, 2016 at 6:53 PM, Sandesh Hegde > wrote: > > > Hi All, > >

[jira] [Commented] (APEXCORE-482) Default Unifiers should be ThreadLocal with downstream operators

2016-07-08 Thread Munagala V. Ramanath (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367868#comment-15367868 ] Munagala V. Ramanath commented on APEXCORE-482: --- Why is Mx1 special ? > D

[jira] [Commented] (APEXCORE-482) Default Unifiers should be ThreadLocal with downstream operators

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367921#comment-15367921 ] Sandesh commented on APEXCORE-482: -- Mx1 is not CONTAINER_LOCAL yet, that needs to be fi

[GitHub] apex-core pull request #350: APEXCORE-222 Purging the stale data present in ...

2016-07-08 Thread vrozov
Github user vrozov commented on a diff in the pull request: https://github.com/apache/apex-core/pull/350#discussion_r70103320 --- Diff: bufferserver/src/main/java/com/datatorrent/bufferserver/server/Server.java --- @@ -199,6 +199,13 @@ private void handlePurgeRequest(PurgeRequestTu

[jira] [Commented] (APEXCORE-222) Delegate Buffer Server purge to StreamingContainer

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367945#comment-15367945 ] ASF GitHub Bot commented on APEXCORE-222: - Github user vrozov commented on a dif

[GitHub] apex-malhar pull request #298: [APEXMALHAR-2086] Kafka output operator: 0.9....

2016-07-08 Thread siyuanh
Github user siyuanh commented on a diff in the pull request: https://github.com/apache/apex-malhar/pull/298#discussion_r70109068 --- Diff: kafka/src/test/resources/log4j.properties --- @@ -47,3 +47,4 @@ log4j.logger.org.apache.apex=INFO log4j.logger.org.apacke.kafka=WARN --

[jira] [Commented] (APEXMALHAR-2086) Kafka Output Operator with Kafka 0.9 API

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368014#comment-15368014 ] ASF GitHub Bot commented on APEXMALHAR-2086: Github user siyuanh commente

[GitHub] apex-core pull request #351: APEXCORE-405 Common API to launch on local mode...

2016-07-08 Thread gauravgopi123
Github user gauravgopi123 commented on a diff in the pull request: https://github.com/apache/apex-core/pull/351#discussion_r70111000 --- Diff: api/src/main/java/com/datatorrent/api/ClusterMode.java --- @@ -0,0 +1,67 @@ +/** + * Licensed to the Apache Software Foundation (ASF

[jira] [Commented] (APEXCORE-405) Provide an API to launch DAG on the cluster

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368036#comment-15368036 ] ASF GitHub Bot commented on APEXCORE-405: - Github user gauravgopi123 commented o

[GitHub] apex-malhar pull request #298: [APEXMALHAR-2086] Kafka output operator: 0.9....

2016-07-08 Thread sandeshh
GitHub user sandeshh reopened a pull request: https://github.com/apache/apex-malhar/pull/298 [APEXMALHAR-2086] Kafka output operator: 0.9.0 Kafka output exactly once operator and the regular output operator. You can merge this pull request into a Git repository by running:

[GitHub] apex-malhar pull request #298: [APEXMALHAR-2086] Kafka output operator: 0.9....

2016-07-08 Thread sandeshh
Github user sandeshh closed the pull request at: https://github.com/apache/apex-malhar/pull/298 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[jira] [Commented] (APEXMALHAR-2086) Kafka Output Operator with Kafka 0.9 API

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368038#comment-15368038 ] ASF GitHub Bot commented on APEXMALHAR-2086: GitHub user sandeshh reopene

[jira] [Commented] (APEXMALHAR-2086) Kafka Output Operator with Kafka 0.9 API

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368037#comment-15368037 ] ASF GitHub Bot commented on APEXMALHAR-2086: Github user sandeshh closed

[jira] [Commented] (APEXCORE-482) Default Unifiers should be ThreadLocal with downstream operators

2016-07-08 Thread Munagala V. Ramanath (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368066#comment-15368066 ] Munagala V. Ramanath commented on APEXCORE-482: --- Saw that, but I was wonde

[jira] [Commented] (APEXCORE-482) Default Unifiers should be ThreadLocal with downstream operators

2016-07-08 Thread Sandesh (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368077#comment-15368077 ] Sandesh commented on APEXCORE-482: -- Talked to [~akekre] regarding Mx1, even he mentione

Re: APEXMALHAR-1701 Deduper in Malhar

2016-07-08 Thread David Yan
No problem. By the way, I changed the method name to setFixedWatermark. And also, if you want to drop any tuples that are considered late, you need to set the allowed lateness to be 0. David On Fri, Jul 8, 2016 at 4:55 AM, Bhupesh Chawda wrote: > Thanks David. > I'll try to create an implement

[jira] [Resolved] (APEXMALHAR-2133) Got NullPointerException when run KafkaInputOperatorTest unit test in Travis

2016-07-08 Thread bright chen (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] bright chen resolved APEXMALHAR-2133. - Resolution: Fixed > Got NullPointerException when run KafkaInputOperatorTest unit tes

Issue with httpclient

2016-07-08 Thread Pramod Immaneni
Hi, We have seen an issue with different versions of org.apache.httpcomponents:httpclient jar getting included in classpath when apps are launched on newer versions of Hadoop. This is because we have an explicit dependency in Apex to version 4.3.5, the version of hadoop we build against (2.2 for b

Re: Issue with httpclient

2016-07-08 Thread Pramod Immaneni
Forgot to add that 4.3.5 is a later version that what newer versions of hadoop are using (4.2.5) and we need it for kerberos SPNEGO security related functionality. Thanks On Fri, Jul 8, 2016 at 11:28 AM, Pramod Immaneni wrote: > Hi, > > We have seen an issue with different versions of > org.apa

Re: Issue with httpclient

2016-07-08 Thread Timothy Farkas
+1 Flink and Spark shade a lot of their dependencies to avoid problems like this, they both even shade guava. Don't see any down side to doing it. On Fri, Jul 8, 2016 at 11:31 AM, Pramod Immaneni wrote: > Forgot to add that 4.3.5 is a later version that what newer versions of > hadoop are using

Re: Issue with httpclient

2016-07-08 Thread David Yan
+1 for the change. This would fix dependency conflicts between httpclient 4.2.5 and 4.3.5. I think the question is whether we should publish a separate shaded artifact (like we did for ning-ahc) for the shaded httpclient 4.3.5, or just shading it internally in apex-engine. The former allows applica

[GitHub] apex-core pull request #350: APEXCORE-222 Purging the stale data present in ...

2016-07-08 Thread vrozov
Github user vrozov commented on a diff in the pull request: https://github.com/apache/apex-core/pull/350#discussion_r70151318 --- Diff: bufferserver/src/main/java/com/datatorrent/bufferserver/server/Server.java --- @@ -169,7 +168,7 @@ public String toString() return identi

[jira] [Commented] (APEXCORE-222) Delegate Buffer Server purge to StreamingContainer

2016-07-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368613#comment-15368613 ] ASF GitHub Bot commented on APEXCORE-222: - Github user vrozov commented on a dif

Windowed Operator PR

2016-07-08 Thread David Yan
Hi all, The Windowed Operator PR is ready to be merged. Thank you very much for all your feedback so far. https://github.com/apache/apex-malhar/pull/319 Merging this PR will make projects related to the WindowedOperator go on more easily, which includes High level API, Apache Calcite support, Ap

Re: Windowed Operator PR

2016-07-08 Thread Siyuan Hua
+1 We should merge this ASAP. I don't think we could solve all the problems in one PR and I think David's PR is good enough that we can keep working on this incrementally and in parallel. Regards, Siyuan On Fri, Jul 8, 2016 at 4:10 PM, David Yan wrote: > Hi all, > > The Windowed Operator PR is

Re: Issue with httpclient

2016-07-08 Thread Pramod Immaneni
Good question. Even though it would be useful for somebody wanting to use SPNEGO in their operator for example I don't see any immediate use. We can just go with it being internal and create a separate artifact in future if need arises. What do you think? Thanks On Fri, Jul 8, 2016 at 11:40 AM, D

Re: Issue with httpclient

2016-07-08 Thread David Yan
I'm good with shading it internally in apex-engine for now until there is an actual need in a real use case. Thanks! David On Fri, Jul 8, 2016 at 4:42 PM, Pramod Immaneni wrote: > Good question. Even though it would be useful for somebody wanting to use > SPNEGO in their operator for example I