Hi Minudika,

thank you for showing up early and expressing an interest in participating
in GSoC this year with Zeppelin project!

Its great that you already have relevant experience creating an interpreter
for Zeppelin, building it, etc. If that work is opensource, could you
please also share the link here?

On the Beam interpreter project - AFAIK there is no such implementation
yet, so the objective, or the big goal I would suggest is: to build 2-3
Zeppelin notebooks, each showcasing some data analysis using Beam\Dataflow
API over a real dataset, using Flink\Spark executors.

I would imagine, that would include multiple steps like:

 - create plain java interpreter

 - create Beam interpreter (using plain java one + Beam SDK + executor on
top of Flink\Spark, including Interpreter features like "job progress
monitoring", "job cancellation" and auto-suggestions in UI)

 - create example notebooks showcasing data processing uing Beam
interpreter, over some real dataset

First steps would include a small research on feasibility of such plan of
course, and may be a simple CLI PoC (Java REPL, Beam SDK + Flink\Spark
executor).

What do you think?

---

Alex


On Sun, Feb 21, 2016, 11:58 Minudika Malshan <minudika...@gmail.com> wrote:

> Hi all,
>
> I am looking forward to contribute in this year GSOC.
> I have a basic understand in writing interpreters for zeppelin and have
> already implement an interpreter for WSO2 siddhi which is a complex event
> processing language.
> Could you please give me more details about the requirement for
> implementing a new interpreter for Apache beam?
> It would be a great help for me to getting familiar with this
> implementation.
>
> Thank you!
>
> Minudika Malshan
> Undergraduate
> Department of Computer Science and Engineering
> University of Moratuwa.
>

Reply via email to