Hi David,

Generally speaking, having different fluent DSL on top of the Beam SDK is great.

I would like to take a look on your wordcount examples to give you a complete feedback. I like the idea and a fluent Java DSL is valuable.

Let's wait feedback from others. If we have a consensus, then I would be more than happy to help you for the donation (I worked on the Camel Java DSL while ago, so I have some experience here).

Thanks !
Regards
JB

On 12/17/2017 07:00 PM, David Morávek wrote:
Hello,


First of all, thanks for the amazing work the Apache Beam community is doing!


In 2014, we've started development of the runtime independent Java 8 API, that helps us to create unified big-data processing flows. It has been used as a core building block of Seznam.cz web crawler data infrastructure every since. Its design principles and execution model are very similar to Apache Beam.


This API was open sourced in 2016, under the name Euphoria API:

https://github.com/seznam/euphoria


As it is very similar to Apache Beam, we feel, that it is not worth of duplicating effort in terms of development of new runtimes and fine-tuning of current ones.


The main blocker for us to switch to Apache Beam is lack of the Java 8 API. *W*e propose the integration of Euphoria API into Apache Beam as a Java 8 DSL, in order to share our effort with the community.


Simple example of the Euphoria API usage, can be found here:

https://github.com/seznam/euphoria/tree/master/euphoria-examples/src/main/java/cz/seznam/euphoria/examples/wordcount


If you feel, that Beam community could leverage from our work, we would love to start working on Euphoria integration into Apache Beam (we already have a working POC, with few basic operators implemented).


I look forward to hearing from you,

David


--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com

Reply via email to