Re: Contributing to Apache Apex

2016-03-23 Thread Amol Kekre
Rodrigo, First and foremost welcome aboard. It is great to have someone of your calibre join our community. Contributing guidelines is the way to go. AFAIK, we do not have a system to mark jiras for beginners. But one idea to get your feet wet is to develop example applications in Malhar https://gi

Re: [VOTE] Graduate Apex from the Incubator

2016-03-23 Thread Chris Nauroth
+1 (binding) Nice work, everyone! --Chris Nauroth On 3/22/16, 10:36 AM, "Pramod Immaneni" wrote: >Dear Community, > >Based on the positive discussion, I'm calling for the community VOTE >to graduate Apache Apex (incubating) to top level project (TLP). > >Please vote on the resolution pasted

Re: Aligning FileSplitter and BlocReader with hadoop.mapreduce InputFormats

2016-03-23 Thread Yogi Devendra
Aligning FileSplitter, BlockReader with respective counterparts from mapreduce will be excellent value addition. IMO, it has 2 advantages: 1. It will allow us to plug-in more formats for FileSplitter+BlockReader pattern use-cases. 2. It will be easy for end-users coming from mapreduce background

Re: Aligning FileSplitter and BlocReader with hadoop.mapreduce InputFormats

2016-03-23 Thread Priyanka Gugale
So as I understand splitter would be format aware, in that case would we need different kinds of parser we have right now? Or the format aware splitter will take care of parsing different file formats e.g. csv etc? -Priyanka On Wed, Mar 23, 2016 at 11:41 PM, Devendra Tagare wrote: > Hi All, > >

[jira] [Commented] (APEXMALHAR-2013) HDFS output module for file copy

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209745#comment-15209745 ] ASF GitHub Bot commented on APEXMALHAR-2013: GitHub user yogidevendra reo

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2013 : HDFS output ...

2016-03-23 Thread yogidevendra
GitHub user yogidevendra reopened a pull request: https://github.com/apache/incubator-apex-malhar/pull/216 APEXMALHAR-2013 : HDFS output module for file copy 1. Added operators required for file copy 2. Added junit tests 3. Added sample app for HDFS to HDFS file copy app You

[jira] [Commented] (APEXMALHAR-2013) HDFS output module for file copy

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209744#comment-15209744 ] ASF GitHub Bot commented on APEXMALHAR-2013: Github user yogidevendra clo

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2013 : HDFS output ...

2016-03-23 Thread yogidevendra
Github user yogidevendra closed the pull request at: https://github.com/apache/incubator-apex-malhar/pull/216 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or i

[jira] [Commented] (APEXCORE-405) Provide an API to launch DAG on the cluster

2016-03-23 Thread Tushar Gosavi (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209698#comment-15209698 ] Tushar Gosavi commented on APEXCORE-405: +1 for this feature, much needed if we

[jira] [Commented] (APEXCORE-202) Integration with Samoa

2016-03-23 Thread Bhupesh Chawda (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209661#comment-15209661 ] Bhupesh Chawda commented on APEXCORE-202: - Yes, that will be great! We can then

Contributing to Apache Apex

2016-03-23 Thread Rodrigo Abreu
Hi, My name is Rodrigo Abreu, and I have been writing code and working with technology for over 15 years. Nowadays, I specialize in Stream Processing, Complex Event Processing (CEP), microservices, messaging and Hadoop/Big Data related architectures/technologies. I would like to learn more about

Re: [VOTE] Graduate Apex from the Incubator

2016-03-23 Thread Hitesh Shah
+1 — Hitesh On Mar 22, 2016, at 10:36 AM, Pramod Immaneni wrote: > Dear Community, > > Based on the positive discussion, I'm calling for the community VOTE > to graduate Apache Apex (incubating) to top level project (TLP). > > Please vote on the resolution pasted below. > > [ ] +1 Graduate

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209579#comment-15209579 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user amberarrow comme

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread amberarrow
Github user amberarrow commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57266129 --- Diff: library/src/main/java/com/datatorrent/lib/state/managed/AbstractManagedStateImpl.java --- @@ -226,6 +232,15 @@ protected int pre

[jira] [Commented] (APEXCORE-405) Provide an API to launch DAG on the cluster

2016-03-23 Thread Siyuan Hua (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209394#comment-15209394 ] Siyuan Hua commented on APEXCORE-405: - Love this feature, when include this with sca

Re: Apex DataFrame

2016-03-23 Thread David Yan
Hi Brendan, Can you draw the result DAG for the DataFrame pseudo-code in the document? I'd assume there will be a file input operator, a CSV parser operator, a filter operator (for price != null and datetime > 1992/1/24), an enricher (for normalized price and price max 30 days), a custom non-DataF

[jira] [Resolved] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread David Yan (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Yan resolved APEXCORE-403. Resolution: Fixed > DelayOperator unit test fails intermittently > ---

[jira] [Commented] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209352#comment-15209352 ] ASF GitHub Bot commented on APEXCORE-403: - Github user asfgit closed the pull re

[GitHub] incubator-apex-core pull request: APEXCORE-403 Fixed unit test

2016-03-23 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/incubator-apex-core/pull/284 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the fe

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209347#comment-15209347 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user chandnisingh com

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread chandnisingh
Github user chandnisingh commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57250223 --- Diff: library/src/main/java/com/datatorrent/lib/state/BucketedState.java --- @@ -0,0 +1,60 @@ +/** + * Licensed to the Apache

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209343#comment-15209343 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user amberarrow comme

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread amberarrow
Github user amberarrow commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57249937 --- Diff: library/src/main/java/com/datatorrent/lib/state/BucketedState.java --- @@ -0,0 +1,60 @@ +/** + * Licensed to the Apache S

[jira] [Updated] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread David Yan (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Yan updated APEXCORE-403: --- Fix Version/s: 3.3.1 > DelayOperator unit test fails intermittently > --

[jira] [Commented] (APEXCORE-202) Integration with Samoa

2016-03-23 Thread David Yan (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209329#comment-15209329 ] David Yan commented on APEXCORE-202: Hi Bhupesh, APEXCORE-405 will provide a Java AP

[jira] [Created] (APEXCORE-405) Provide an API to launch DAG on the cluster

2016-03-23 Thread Pramod Immaneni (JIRA)
Pramod Immaneni created APEXCORE-405: Summary: Provide an API to launch DAG on the cluster Key: APEXCORE-405 URL: https://issues.apache.org/jira/browse/APEXCORE-405 Project: Apache Apex Core

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209311#comment-15209311 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user chandnisingh com

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread chandnisingh
Github user chandnisingh commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57248568 --- Diff: library/src/main/java/com/datatorrent/lib/state/BucketedState.java --- @@ -0,0 +1,60 @@ +/** + * Licensed to the Apache

[jira] [Commented] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209303#comment-15209303 ] ASF GitHub Bot commented on APEXCORE-403: - Github user PramodSSImmaneni commente

[GitHub] incubator-apex-core pull request: APEXCORE-403 Fixed unit test

2016-03-23 Thread PramodSSImmaneni
Github user PramodSSImmaneni commented on a diff in the pull request: https://github.com/apache/incubator-apex-core/pull/284#discussion_r57248038 --- Diff: engine/src/test/java/com/datatorrent/stram/plan/logical/DelayOperatorTest.java --- @@ -473,4 +476,5 @@ public void testValida

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209301#comment-15209301 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user amberarrow comme

[jira] [Commented] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209300#comment-15209300 ] ASF GitHub Bot commented on APEXCORE-403: - Github user PramodSSImmaneni commente

[GitHub] incubator-apex-core pull request: APEXCORE-403 Fixed unit test

2016-03-23 Thread PramodSSImmaneni
Github user PramodSSImmaneni commented on a diff in the pull request: https://github.com/apache/incubator-apex-core/pull/284#discussion_r57247928 --- Diff: engine/src/test/java/com/datatorrent/stram/plan/logical/DelayOperatorTest.java --- @@ -349,6 +352,7 @@ public void testFibona

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread amberarrow
Github user amberarrow commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57247940 --- Diff: library/src/main/java/com/datatorrent/lib/state/managed/ManagedTimeUnifiedStateImpl.java --- @@ -0,0 +1,238 @@ +/** + *

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209296#comment-15209296 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user chandnisingh com

[jira] [Updated] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread David Yan (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Yan updated APEXCORE-403: --- Fix Version/s: 3.4.0 > DelayOperator unit test fails intermittently > --

[jira] [Commented] (APEXCORE-403) DelayOperato unit test fails intermittently

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209290#comment-15209290 ] ASF GitHub Bot commented on APEXCORE-403: - Github user davidyan74 commented on t

[jira] [Updated] (APEXCORE-403) DelayOperator unit test fails intermittently

2016-03-23 Thread David Yan (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Yan updated APEXCORE-403: --- Summary: DelayOperator unit test fails intermittently (was: DelayOperato unit test fails intermitte

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread chandnisingh
Github user chandnisingh commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57247730 --- Diff: library/src/main/java/com/datatorrent/lib/state/managed/ManagedTimeUnifiedStateImpl.java --- @@ -0,0 +1,238 @@ +/** +

[GitHub] incubator-apex-core pull request: APEXCORE-403 Fixed unit test

2016-03-23 Thread davidyan74
Github user davidyan74 commented on the pull request: https://github.com/apache/incubator-apex-core/pull/284#issuecomment-200566464 @PramodSSImmaneni please merge --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

Re: Apex DataFrame

2016-03-23 Thread Devendra Tagare
Hi, You can create a case class.Then map the incoming RDD's to the case class & convert the map to a data frame.By doing this you would have a Dataframe with the respective fields and associated datatypes set as per the ETL rules defined before setting the members of the case class. Sample below

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209244#comment-15209244 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user chandnisingh com

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread chandnisingh
Github user chandnisingh commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57243863 --- Diff: library/src/main/java/com/datatorrent/lib/state/managed/ManagedTimeUnifiedStateImpl.java --- @@ -0,0 +1,238 @@ +/** +

Re: Apex DataFrame

2016-03-23 Thread Ashwin Chandra Putta
Can you also provide types for fields in the code example? How should a fieldname associate itself with a type? Regards, Ashwin. On Wed, Mar 23, 2016 at 9:32 AM, Amol Kekre wrote: > Ilya, Brendan, > Thanks for driving this feature set. We are looking into it. We should have > feedback soon. > >

[jira] [Commented] (APEXMALHAR-1897) Create ManagedState

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209217#comment-15209217 ] ASF GitHub Bot commented on APEXMALHAR-1897: Github user chandnisingh com

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-1897 added managed ...

2016-03-23 Thread chandnisingh
Github user chandnisingh commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/145#discussion_r57241107 --- Diff: library/src/main/java/com/datatorrent/lib/state/managed/AbstractManagedStateImpl.java --- @@ -0,0 +1,499 @@ +/** + * L

Re: S3 Output Module

2016-03-23 Thread Ashwin Chandra Putta
+1 regarding the s3 upload functionality. However, I think we should just focus on multipart upload directly as it comes with various advantages like higher throughput, faster recovery, not needing to wait for entire file being created before uploading each part. See: http://docs.aws.amazon.com/Am

[jira] [Moved] (APEXMALHAR-2025) Move FileLineInputOperator out of AbstractFileInputOperator

2016-03-23 Thread Thomas Weise (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Weise moved APEXCORE-404 to APEXMALHAR-2025: --- Workflow: Default workflow, editable Closed status (was:

Re: S3 Input Module

2016-03-23 Thread Ashwin Chandra Putta
Chaitanya, For hadoop version < 2.6, 1. Is the readersCount value forced to 1 irrespective of the value configured by user? 2. Is it possible to allow for parallel file reads i.e. 1 reader per file? Also, just to confirm. No more copying s3a files from hadoop for previous versions right? Regard

Aligning FileSplitter and BlocReader with hadoop.mapreduce InputFormats

2016-03-23 Thread Devendra Tagare
Hi All, Initiating this thread to get the community's opinion on aligning the FileSplitter with InputSplit & the BlockReader with the RecordReader from org.apache.hadoop.mapreduce.InputSplit & org.apache.hadoop.mapreduce.RecordReader respectively. Some more details and rationale on the approach,

Adding AvroFileInputOperator to Malhar

2016-03-23 Thread Devendra Tagare
Hi All, I am working on adding a concrete implementation for reading Avro container files by extending the AbstractFileInputOperator & emitting Generic Records based on the file schema. This operator would be an input adapter and can work together with the Avro to POJO operator to read an Avro co

Re: malhar library travis PR build failed

2016-03-23 Thread Chinmay Kolhatkar
Hi Yogi, I recently observed the issue... Closing and reopening the same PR solved this for me. Thanks, Chinmay. --- Sent from mobile. On 23 Mar 2016 10:36 p.m., "Yogi Devendra" wrote: > Hi, > > I raised a PR for HDFS file copy module > https://github.com/apache/incubator-apex-malhar/pull/216

malhar library travis PR build failed

2016-03-23 Thread Yogi Devendra
Hi, I raised a PR for HDFS file copy module https://github.com/apache/incubator-apex-malhar/pull/216 malhar build for this branch passes for my localhost; but fails on travis. Here are the logs from travis: https://s3.amazonaws.com/archive.travis-ci.org/jobs/118022851/log.txt There are no test

[jira] [Commented] (APEXMALHAR-2013) HDFS output module for file copy

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208736#comment-15208736 ] ASF GitHub Bot commented on APEXMALHAR-2013: GitHub user yogidevendra reo

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2013 : HDFS output ...

2016-03-23 Thread yogidevendra
Github user yogidevendra closed the pull request at: https://github.com/apache/incubator-apex-malhar/pull/216 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or i

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2013 : HDFS output ...

2016-03-23 Thread yogidevendra
GitHub user yogidevendra reopened a pull request: https://github.com/apache/incubator-apex-malhar/pull/216 APEXMALHAR-2013 : HDFS output module for file copy 1. Added operators required for file copy 2. Added junit tests 3. Added sample app for HDFS to HDFS file copy app You

[jira] [Commented] (APEXMALHAR-2013) HDFS output module for file copy

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208735#comment-15208735 ] ASF GitHub Bot commented on APEXMALHAR-2013: Github user yogidevendra clo

Re: Apex DataFrame

2016-03-23 Thread Amol Kekre
Ilya, Brendan, Thanks for driving this feature set. We are looking into it. We should have feedback soon. Thks, Amol On Tue, Mar 22, 2016 at 6:27 PM, Ganelin, Ilya wrote: > Would love the community's input on this idea and thoughts on technical > challenges we may run into when implementing th

[jira] [Commented] (APEXMALHAR-2013) HDFS output module for file copy

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208697#comment-15208697 ] ASF GitHub Bot commented on APEXMALHAR-2013: Github user yogidevendra com

[jira] [Created] (APEXMALHAR-2024) Refactor AbstractFileOutputOperator to use Autometrics instead of Basic Counters

2016-03-23 Thread Yogi Devendra (JIRA)
Yogi Devendra created APEXMALHAR-2024: - Summary: Refactor AbstractFileOutputOperator to use Autometrics instead of Basic Counters Key: APEXMALHAR-2024 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2024

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2013 : HDFS output ...

2016-03-23 Thread yogidevendra
Github user yogidevendra commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/216#discussion_r57190169 --- Diff: library/src/main/java/com/datatorrent/lib/io/fs/BlockWriter.java --- @@ -0,0 +1,210 @@ +/** + * Licensed to the Apache

Re: S3 Output Module

2016-03-23 Thread Tushar Gosavi
+1 , we need this functionality. Is it going to be a single operator or multiple operators? If multiple operators, then can you explain what functionality each operator will provide? Regards, -Tushar. On Wed, Mar 23, 2016 at 5:01 PM, Yogi Devendra wrote: > Writing to S3 is a common use-case

Re: GSOC: AVSR in Apache Apex

2016-03-23 Thread Atri Sharma
1) You need to write to IPMC, not Google, for being a mentor. ASF will take your name as a mentor for ASF. 2) Yes, we can definitely select the project we wish to mentor. Note that ASF primarily encourages podlings to participate in GSoC through the ASF community, so you will be able to mentor a ch

Re: GSOC: AVSR in Apache Apex

2016-03-23 Thread Priyanka Gugale
Hi Atri, I have couple of questions, 1. Now as mentor deadlines are over, we can't provide our ideas as project to any students right? 2. If IPMC assigns one of us as mentor, can we choose which projects we can guide? I assume mentors who doesn't have any idea associated with his name could be ask

Re: S3 Output Module

2016-03-23 Thread Yogi Devendra
Writing to S3 is a common use-case for applications. This module will be definitely helpful. +1 for adding this module. ~ Yogi On 22 March 2016 at 13:52, Chaitanya Chebolu wrote: > Hi All, > > I am proposing S3 output copy Module. Primary functionality of this > module is uploading files to

Naming sugestion for HDFS output modules

2016-03-23 Thread Yogi Devendra
Hi, Currently, I am in the process of developing HDFS output module: We have two modules for HDFS output. 1. Tuple based [] 2. File based (used for file copy) Currently, I am calling #1 as "HDFS output module" as this module is the one which will be mostly used to write tuples to HDFS. I am call

Re: GSOC: AVSR in Apache Apex

2016-03-23 Thread HARSH PATHAK
Respected Community, I have updated my GSOC Project Proposal Link as per your all guidelines.I am thank full for that, shall I mention more about DAG and Windowing as those are key components of

[APEXCORE-107] Adding modules in DAG through property and json file.

2016-03-23 Thread Tushar Gosavi
Hi All, I am planning provide support for adding modules into the json and property file specification of DAG. We will go with the same syntax as discussed in following mail thread. https://mail-archives.apache.org/mod_mbox/incubator-apex-dev/201512.mbox/%3C565D0E2C.2000805%40datatorrent.com%3E

[jira] [Commented] (APEXCORE-107) Support adding module to application using property file API.

2016-03-23 Thread Tushar Gosavi (JIRA)
[ https://issues.apache.org/jira/browse/APEXCORE-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208220#comment-15208220 ] Tushar Gosavi commented on APEXCORE-107: There are multiple choices for the desi

[jira] [Created] (APEXMALHAR-2023) Adding Enrichment Operator to Malhar

2016-03-23 Thread Chinmay Kolhatkar (JIRA)
Chinmay Kolhatkar created APEXMALHAR-2023: - Summary: Adding Enrichment Operator to Malhar Key: APEXMALHAR-2023 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2023 Project: Apache Apex Ma

[jira] [Commented] (APEXMALHAR-2015) Projection Operator

2016-03-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208071#comment-15208071 ] ASF GitHub Bot commented on APEXMALHAR-2015: Github user chinmaykolhatkar

[GitHub] incubator-apex-malhar pull request: APEXMALHAR-2015: Projection Op...

2016-03-23 Thread chinmaykolhatkar
Github user chinmaykolhatkar commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/217#discussion_r57123958 --- Diff: library/src/main/java/com/datatorrent/lib/projection/ProjectionOperator.java --- @@ -0,0 +1,311 @@ +/** + * License

Re: [VOTE] Graduate Apex from the Incubator

2016-03-23 Thread Amruta Phansalker
+1. On Wed, Mar 23, 2016 at 11:13 AM, Priyanka Gugale wrote: > +1 > > -Priyanka > > On Wed, Mar 23, 2016 at 11:09 AM, Akshay Gore > wrote: > > > +1 > > > > Looking forward Apex as Apache TLP. > > > > -Akshay > > > > On Wed, Mar 23, 2016 at 10:53 AM, Shubham Pathak < > shub...@datatorrent.com> >

Re: Bandwidth control for Input operators in Apex

2016-03-23 Thread Yogi Devendra
This looks OK. Let us build it incrementally. ~ Yogi On 23 March 2016 at 13:24, Sandeep Deshmukh wrote: > I would suggest that we go ahead with design as suggested by Priyanka where > we have bandwidth setup for each operator separately. We can later extend > this for bandwidth to be shared wit

Re: Streaming JSON parser

2016-03-23 Thread Chinmay Kolhatkar
+1 for the Streaming JSON parser. On Wed, Mar 23, 2016 at 12:02 PM, Justin Mclean wrote: > Hi, > > Would this help? > > http://johnzon.incubator.apache.org > > Justin >

Re: Bandwidth control for Input operators in Apex

2016-03-23 Thread Sandeep Deshmukh
I would suggest that we go ahead with design as suggested by Priyanka where we have bandwidth setup for each operator separately. We can later extend this for bandwidth to be shared with different input operators or for the DAG as a whole. Regards, Sandeep On Wed, Mar 23, 2016 at 11:51 AM, Priyan

[jira] [Closed] (APEXMALHAR-1972) Create Expression Evaluator Support quasi-Java Expression Language

2016-03-23 Thread Chinmay Kolhatkar (JIRA)
[ https://issues.apache.org/jira/browse/APEXMALHAR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chinmay Kolhatkar closed APEXMALHAR-1972. - Resolution: Fixed Fix Version/s: 3.4.0 This is done as a part of APEXM