+1 for the proposal.
Compatibility Matrix
<https://beam.apache.org/documentation/runners/capability-matrix/> can be a
good place to show case parity between different runners.
Do you think we should write 2 way examples [Spark, Flink, ..]<=>Beam?



On Sat, Jun 1, 2019 at 4:31 PM Reza Rokni <r...@google.com> wrote:

> For layer 1, what about working through this link as a starting point :
> https://spark.apache.org/docs/latest/rdd-programming-guide.html#transformations
> ?
>
> On Sat, 1 Jun 2019 at 09:21, Ahmet Altay <al...@google.com> wrote:
>
>> Thank you Reza. That separation makes sense to me.
>>
>> On Wed, May 29, 2019 at 6:26 PM Reza Rokni <r...@google.com> wrote:
>>
>>> +1
>>>
>>> I think there will be at least two layers of this;
>>>
>>> Layer 1 - Using primitives : I do join, GBK, Aggregation... with system
>>> x this way, what is the canonical equivalent in Beam.
>>> Layer 2 - Patterns : I read and join Unbounded and Bounded Data in
>>> system x this way, what is the canonical equivalent in Beam.
>>>
>>> I suspect as a first pass Layer 1 is reasonably well bounded work, there
>>> would need to be agreement on "canonical" version of how to do something in
>>> Beam as this could be seen to be opinionated. As there are often a
>>> multitude of ways of doing x....
>>>
>>
>> Once we identify a set of layer 1 items, we could crowd source the
>> canonical implementations. I believe we can use our usual code review
>> process to settle on a version that is agreeable. (Examples have the same
>> issue, they are probably opinionated today based on the author but it works
>> out.)
>>
>>
>>>
>>>
>>> On Thu, 30 May 2019 at 08:56, Ahmet Altay <al...@google.com> wrote:
>>>
>>>> Hi all,
>>>>
>>>> Inspired by the user asking about a Spark feature in Beam [1] in the
>>>> release thread, I searched the user@ list and noticed a few instances
>>>> of people asking for question like "I can do X in Spark, how can I do that
>>>> in Beam?" Would it make sense to add documentation to explain how certain
>>>> tasks that can be accomplished in Beam with side by side examples of doing
>>>> the same task in Beam/Spark etc. It could help with on-boarding because it
>>>> will be easier for people to leverage their existing knowledge. It could
>>>> also help other frameworks as well, because it will serve as a Rosetta
>>>> stone with two translations.
>>>>
>>>> Questions I have are:
>>>> - Would such a thing be a helpful?
>>>> - Is it feasible? Would a few pages worth of examples can cover enough
>>>> use cases?
>>>>
>>>> Thank you!
>>>> Ahmet
>>>>
>>>> [1]
>>>> https://lists.apache.org/thread.html/b73a54aa1e6e9933628f177b04a8f907c26cac854745fa081c478eff@%3Cdev.beam.apache.org%3E
>>>>
>>>
>>>
>>> --
>>>
>>> This email may be confidential and privileged. If you received this
>>> communication by mistake, please don't forward it to anyone else, please
>>> erase all copies and attachments, and please let me know that it has gone
>>> to the wrong person.
>>>
>>> The above terms reflect a potential business arrangement, are provided
>>> solely as a basis for further discussion, and are not intended to be and do
>>> not constitute a legally binding obligation. No legally binding obligations
>>> will be created, implied, or inferred until an agreement in final form is
>>> executed in writing by all parties involved.
>>>
>>
>
> --
>
> This email may be confidential and privileged. If you received this
> communication by mistake, please don't forward it to anyone else, please
> erase all copies and attachments, and please let me know that it has gone
> to the wrong person.
>
> The above terms reflect a potential business arrangement, are provided
> solely as a basis for further discussion, and are not intended to be and do
> not constitute a legally binding obligation. No legally binding obligations
> will be created, implied, or inferred until an agreement in final form is
> executed in writing by all parties involved.
>

Reply via email to