[ https://issues.apache.org/jira/browse/APEXCORE-119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Timothy Farkas reassigned APEXCORE-119: --------------------------------------- Assignee: (was: Timothy Farkas) > Add Support For A New Type Of (Distributed) Operator > ---------------------------------------------------- > > Key: APEXCORE-119 > URL: https://issues.apache.org/jira/browse/APEXCORE-119 > Project: Apache Apex Core > Issue Type: Improvement > Reporter: Timothy Farkas > > This JIRA Proposes support for a new type of distributed operator. Currently > when an operator is partitioned there is no platform supported mechanism > through which partitions can talk to each other. A Distributed operator would > have an easy to use platform supported mechanism through which operators in a > partitioning can exchange information with each other. Eventually Distributed > operators would support running plain old single threaded java code > transparently across partitions. > In summary the goals would be to do the following: > 1 - provide a platform supported fault tolerant mechanism through which > operators in a partitioning can talk to each other. > 2 - provide a platform supported way to run plain old single threaded java > code accross all the partitions of a Distributed operator > The benefits of implementing this would be huge: > 1 - Using distributed operators we could support large in memory fault > tolerant data structures (graphs, maps, arrays) in a fault tolerant way. Like > Spark's RDD's but better. > 2 - Plain old java code could be used to access and manipulate the data > structures, without the user having the learn complex API's like with Spark. > An implementation proposal and presentation are coming soon. -- This message was sent by Atlassian JIRA (v6.4.14#64029)