Re: Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-08-03 Thread Brian Hulette via dev
I wanted to share that Ryan gave a presentation about his (and Charles')
work on Pangeo Forge at Scipy 2022 (in Austin just before Beam Summit!),
with a couple mentions of their transition to Beam [1]. There were also a
couple of other talks about Pangeo [2,3] with some Beam/xarray-beam
references in there.

[1]
https://www.youtube.com/watch?v=sY20UpYCAEE=PLYx7XA2nY5Gde0WF1yswQw5InhmSNED8o=9
[2]
https://www.youtube.com/watch?v=7niNfs3ZpfQ=PLYx7XA2nY5Gfb0tQyezb4Gsf1nVsy86zt=2
[3]
https://www.youtube.com/watch?v=ftlgOESINvo=PLYx7XA2nY5Gfb0tQyezb4Gsf1nVsy86zt=3

On Tue, Jun 21, 2022 at 9:29 AM Ahmet Altay  wrote:

> Were you able to meet? If yes, I would be very interested in a summary if
> someone would like to share that :)
>
> On Mon, Jun 13, 2022 at 9:16 AM Pablo Estrada  wrote:
>
>> Also added my availability... please do invite me as well : )
>> -P.
>>
>> On Mon, Jun 13, 2022 at 6:57 AM Kenneth Knowles  wrote:
>>
>>> I would love to try to join any meetings if you add me. My calendar is
>>> too chaotic to be useful on the when2meet :-) but I can often move things
>>> around.
>>>
>>> Kenn
>>>
>>> On Wed, Jun 8, 2022 at 2:50 PM Brian Hulette 
>>> wrote:
>>>
 Thanks for reaching out, Ryan, this sounds really cool. I added my
 availability to the calendar since I'm interested in this space, but I'm
 not sure I can offer much help - I don't have any experience building a
 runner, to date I've worked exclusively on the SDK side of Beam. So I hope
 some other folks can join as well :)

 @Pablo Estrada  might have some useful insight -
 he's been working on a spike to build a Ray runner.


 On Wed, Jun 8, 2022 at 12:53 PM Robert Bradshaw 
 wrote:

> This sounds like a great project. Unfortunately I wouldn't be able to
> meet next week, but would be happy to meet some other time and if that
> doesn't work answer questions over email, etc. Looking forward to a
> Dask runner.
>
> On Wed, Jun 8, 2022 at 9:04 AM Ryan Abernathey
>  wrote:
> >
> > Dear Beamer,
> >
> > Thank you for all of your work on this amazing project. I am new to
> Beam and am quite excited about its potential to help with some data
> processing challenges in my field of climate science.
> >
> > Our community is interested in running Beam on Dask Distributed
> clusters, which we already know how to deploy. This has been discussed at
> https://issues.apache.org/jira/browse/BEAM-5336 and
> https://github.com/apache/beam/issues/18962. It seems technically
> feasible.
> >
> > We are trying to organize a meeting next week to kickstart and
> coordinate this effort. It would be great if we could entrain some Beam
> maintainers into this meeting. If you have interest in this topic and are
> available next week, please share your availability here -
> https://www.when2meet.com/?15861604-jLnA4
> >
> > Alternatively, if you have any guidance or suggestions you wish to
> provide by email or GitHub discussion, we welcome your input.
> >
> > Thanks again for your open source work.
> >
> > Best,
> > Ryan Abernathey
> >
>



Re: Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-07-22 Thread Pablo Estrada via dev
+Brian Hulette  and +Kenneth Knowles  were
curious about this effort.

Alex can you elaborate on what happened with GH actions? We can help you
figure out the CI story.

On Tue, Jun 21, 2022 at 11:06 AM Alex Merose  wrote:

> We had a great meeting last week on this topic! Here is a proposal /
> meeting notes doc:
>
> https://docs.google.com/document/d/1Awj_eNmH-WRSte3bKcCcUlQDiZ5mMKmCO_xV-mHWAak/edit#heading=h.y0pwg4polebc
>
> Tomorrow, another engineer (https://github.com/cisaacstern) and I are
> meeting to create an initial prototype of the Dask runner in the main Beam
> repo. Let us know if you'd like to help out in any way. We'll post updates
> in this mailing list + the above doc.
>
> Best,
> Alex Merose
>
> On 2022/06/08 14:22:41 Ryan Abernathey wrote:
> > Dear Beamer,
> >
> > Thank you for all of your work on this amazing project. I am new to Beam
> > and am quite excited about its potential to help with some data
> processing
> > challenges in my field of climate science.
> >
> > Our community is interested in running Beam on Dask Distributed clusters,
> > which we already know how to deploy. This has been discussed at
> > https://issues.apache.org/jira/browse/BEAM-5336 and
> > https://github.com/apache/beam/issues/18962. It seems technically
> feasible.
> >
> > We are trying to organize a meeting next week to kickstart and coordinate
> > this effort. It would be great if we could entrain some Beam maintainers
> > into this meeting. If you have interest in this topic and are available
> > next week, please share your availability here -
> > https://www.when2meet.com/?15861604-jLnA4
> >
> > Alternatively, if you have any guidance or suggestions you wish to
> provide
> > by email or GitHub discussion, we welcome your input.
> >
> > Thanks again for your open source work.
> >
> > Best,
> > Ryan Abernathey
> >
>


RE: Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-07-08 Thread Alex Merose via dev
Dear Beam & Dask communities,

Together with Pablo and Charles, I've hacked together an initial prototype
of a Dask runner for Beam. I'm happy to announce that I have minimum viable
working version in a fork here: https://github.com/alxmrs/beam/pull/1

There's definitely more work to do here – more operations to implement,
tests to write, style guides to follow, etc. However, I'm pleased that
there are enough operations implemented to run test pipelines with
assertions.

>From here, what are good next steps?

Best,
Alex

PS – Meeting / design notes are available in this doc:
https://docs.google.com/document/d/1Awj_eNmH-WRSte3bKcCcUlQDiZ5mMKmCO_xV-mHWAak/edit#heading=h.y0pwg4polebc

On 2022/06/08 14:22:41 Ryan Abernathey wrote:
> Dear Beamer,
>
> Thank you for all of your work on this amazing project. I am new to Beam
> and am quite excited about its potential to help with some data processing
> challenges in my field of climate science.
>
> Our community is interested in running Beam on Dask Distributed clusters,
> which we already know how to deploy. This has been discussed at
> https://issues.apache.org/jira/browse/BEAM-5336 and
> https://github.com/apache/beam/issues/18962. It seems technically
feasible.
>
> We are trying to organize a meeting next week to kickstart and coordinate
> this effort. It would be great if we could entrain some Beam maintainers
> into this meeting. If you have interest in this topic and are available
> next week, please share your availability here -
> https://www.when2meet.com/?15861604-jLnA4
>
> Alternatively, if you have any guidance or suggestions you wish to provide
> by email or GitHub discussion, we welcome your input.
>
> Thanks again for your open source work.
>
> Best,
> Ryan Abernathey
>


Re: Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-06-21 Thread Austin Bennett
Looks/Sounds great!

On Tue, Jun 21, 2022 at 11:06 AM Alex Merose  wrote:

> We had a great meeting last week on this topic! Here is a proposal /
> meeting notes doc:
>
> https://docs.google.com/document/d/1Awj_eNmH-WRSte3bKcCcUlQDiZ5mMKmCO_xV-mHWAak/edit#heading=h.y0pwg4polebc
>
> Tomorrow, another engineer (https://github.com/cisaacstern) and I are
> meeting to create an initial prototype of the Dask runner in the main Beam
> repo. Let us know if you'd like to help out in any way. We'll post updates
> in this mailing list + the above doc.
>
> Best,
> Alex Merose
>
> On 2022/06/08 14:22:41 Ryan Abernathey wrote:
> > Dear Beamer,
> >
> > Thank you for all of your work on this amazing project. I am new to Beam
> > and am quite excited about its potential to help with some data
> processing
> > challenges in my field of climate science.
> >
> > Our community is interested in running Beam on Dask Distributed clusters,
> > which we already know how to deploy. This has been discussed at
> > https://issues.apache.org/jira/browse/BEAM-5336 and
> > https://github.com/apache/beam/issues/18962. It seems technically
> feasible.
> >
> > We are trying to organize a meeting next week to kickstart and coordinate
> > this effort. It would be great if we could entrain some Beam maintainers
> > into this meeting. If you have interest in this topic and are available
> > next week, please share your availability here -
> > https://www.when2meet.com/?15861604-jLnA4
> >
> > Alternatively, if you have any guidance or suggestions you wish to
> provide
> > by email or GitHub discussion, we welcome your input.
> >
> > Thanks again for your open source work.
> >
> > Best,
> > Ryan Abernathey
> >
>


Re: Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-06-13 Thread Kenneth Knowles
I would love to try to join any meetings if you add me. My calendar is too
chaotic to be useful on the when2meet :-) but I can often move things
around.

Kenn

On Wed, Jun 8, 2022 at 2:50 PM Brian Hulette  wrote:

> Thanks for reaching out, Ryan, this sounds really cool. I added my
> availability to the calendar since I'm interested in this space, but I'm
> not sure I can offer much help - I don't have any experience building a
> runner, to date I've worked exclusively on the SDK side of Beam. So I hope
> some other folks can join as well :)
>
> @Pablo Estrada  might have some useful insight - he's
> been working on a spike to build a Ray runner.
>
>
> On Wed, Jun 8, 2022 at 12:53 PM Robert Bradshaw 
> wrote:
>
>> This sounds like a great project. Unfortunately I wouldn't be able to
>> meet next week, but would be happy to meet some other time and if that
>> doesn't work answer questions over email, etc. Looking forward to a
>> Dask runner.
>>
>> On Wed, Jun 8, 2022 at 9:04 AM Ryan Abernathey
>>  wrote:
>> >
>> > Dear Beamer,
>> >
>> > Thank you for all of your work on this amazing project. I am new to
>> Beam and am quite excited about its potential to help with some data
>> processing challenges in my field of climate science.
>> >
>> > Our community is interested in running Beam on Dask Distributed
>> clusters, which we already know how to deploy. This has been discussed at
>> https://issues.apache.org/jira/browse/BEAM-5336 and
>> https://github.com/apache/beam/issues/18962. It seems technically
>> feasible.
>> >
>> > We are trying to organize a meeting next week to kickstart and
>> coordinate this effort. It would be great if we could entrain some Beam
>> maintainers into this meeting. If you have interest in this topic and are
>> available next week, please share your availability here -
>> https://www.when2meet.com/?15861604-jLnA4
>> >
>> > Alternatively, if you have any guidance or suggestions you wish to
>> provide by email or GitHub discussion, we welcome your input.
>> >
>> > Thanks again for your open source work.
>> >
>> > Best,
>> > Ryan Abernathey
>> >
>>
>


Join a meeting to help coordinate implementing a Dask Runner for Beam

2022-06-08 Thread Ryan Abernathey
Dear Beamer,

Thank you for all of your work on this amazing project. I am new to Beam
and am quite excited about its potential to help with some data processing
challenges in my field of climate science.

Our community is interested in running Beam on Dask Distributed clusters,
which we already know how to deploy. This has been discussed at
https://issues.apache.org/jira/browse/BEAM-5336 and
https://github.com/apache/beam/issues/18962. It seems technically feasible.

We are trying to organize a meeting next week to kickstart and coordinate
this effort. It would be great if we could entrain some Beam maintainers
into this meeting. If you have interest in this topic and are available
next week, please share your availability here -
https://www.when2meet.com/?15861604-jLnA4

Alternatively, if you have any guidance or suggestions you wish to provide
by email or GitHub discussion, we welcome your input.

Thanks again for your open source work.

Best,
Ryan Abernathey