Hi all,

We've been making quite some progress these last weeks. I'll give a short
status update on where we are right now.

Our current goal is to make all unittests succeed in Python 3. We are
currently at:
1842 tests: (SKIP=350, errors=100, failures=9)

All of these remaining errors and failures are in the io and examples
packages, except for some non-blocking failing typehints tests. All other
packages have been ported. We're currently working on porting the io
package, and have finished all core modules (iobase, filesystemio,
filebasedsource/sink, textio, ...). This allowed us to fix the wordcount
end to end examples, which now run successfully on Python 3! [1]

Next up are all other io sources and sinks. Most of the skipped tests are
due to missing GCP components in the Python 3 test suite. We will add a
separate GCP test suite for Python 3 when we start porting the first GCP io
module, which should be very soon.

Anyone who wants to help out, can start with porting one of these
sources/sinks.

Kind regards,
Robbe

[1] https://github.com/apache/beam/pull/7447

[image: https://ml6.eu] <https://ml6.eu/>

* Robbe Sneyders*

ML6 Gent
<https://www.google.be/maps/place/ML6/@51.037408,3.7044893,17z/data=!3m1!4b1!4m5!3m4!1s0x47c37161feeca14b:0xb8f72585fdd21c90!8m2!3d51.037408!4d3.706678?hl=nl>

M: +32 474 71 31 08


On Tue, 8 Jan 2019 at 04:58, Ahmet Altay <al...@google.com> wrote:

> +Matthias Feys <matth...@ml6.eu> +Valentyn Tymofieiev
> <valen...@google.com> +Mark Liu <mark...@google.com> could add more
> details here since they are working on Python 3 for a while now.
>
> Hopeful state is that we might have python 3 working with DirectRunner in
> the release after this one (2.12). Mark is also working on getting python 3
> working on a cluster. He has been able to run WordCount on Dataflow service
> with some hacks but it was not yet ready to run out of the box. I would
> like to note that we are targeting python 3 support only for portable
> runners so running on Dataflow and Flink should happen at the same time.
>
> Matthias and Valentyn are still working on converting the SDK to be Python
> 3 compatible. They are now mostly dealing with harder to convert parts of
> the SDK. (e.g. Parts were python 2/3 differences results in performance
> regressions or subtle changes in the behavior).
>
> To the folks working on this, it would be really helpful if you could
> update BEAM-1251 regularly. We have shared this issue with many people and
> not all of them will read this thread.
>
> Ahmet
>
> On Mon, Jan 7, 2019 at 8:15 AM Maximilian Michels <m...@apache.org> wrote:
>
>> Also curious because I see Python 3 requests quite often. I always say,
>> we're
>> close, but how close are we? :)
>>
>> Thanks,
>> Max
>>
>> On 05.01.19 00:03, Manu Zhang wrote:
>> > Guys,
>> >
>> > Happy New Year !!!
>> > I haven't got much time to contribute to Python 3 support. What is the
>> progress
>> > now ? It seems there are quite a few open issues under
>> > https://issues.apache.org/jira/browse/BEAM-1251. People have kept
>> asking about
>> > Python 3 support in tf.transform
>> > (https://github.com/tensorflow/transform/issues/1) which is blocked by
>> BEAM-1251.
>> >
>> > Thanks,
>> > Manu Zhang
>> >
>> >
>> > On Fri, Oct 12, 2018 at 3:17 AM Valentyn Tymofieiev <
>> valen...@google.com
>> > <mailto:valen...@google.com>> wrote:
>> >
>> >     I cc'ed a few folks who are familiar with Jenkins setup on
>> >     https://issues.apache.org/jira/browse/BEAM-5663, I think we can
>> continue the
>> >     discussion there or start a separate thread.
>> >
>> >     On Wed, Oct 10, 2018 at 8:54 PM Manu Zhang <owenzhang1...@gmail.com
>> >     <mailto:owenzhang1...@gmail.com>> wrote:
>> >
>> >         Does anyone know how to set up python version on Jenkins ? It’s
>> Python
>> >         3.5.2 now.
>> >
>> >         Thanks,
>> >         Manu Zhang
>> >         On Oct 5, 2018, 9:24 AM +0800, Valentyn Tymofieiev <
>> valen...@google.com
>> >         <mailto:valen...@google.com>>, wrote:
>> >>         I have put together a guide [1] to help get started with
>> investigating
>> >>         Python 3-related test failures that may be helpful for new
>> folks
>> >>         joining the effort.
>> >>
>> >>         Comments and improvements welcome!
>> >>
>> >>         Thanks,
>> >>         Valentyn
>> >>         [1]
>> >>
>> https://docs.google.com/document/d/1s1BJVCY65LB_SYK1SU1u7NbZiFANoq-nEYaEvzRbYlA
>> >>
>> >>
>> >>         On Thu, Oct 4, 2018 at 11:26 AM Valentyn Tymofieiev
>> >>         <valen...@google.com <mailto:valen...@google.com>> wrote:
>> >>
>> >>             I agree there is some overlap between JIRAs that track
>> individual
>> >>             failures and module-level JIRAs. We originally wanted to
>> do the
>> >>             conversion on a module-by-module basis, however we learned
>> that
>> >>             test failures in some modules require changes in other
>> modules,
>> >>             and it may be a little easier to slice the problem if we
>> focus on
>> >>             classes of failures.
>> >>
>> >>             Module-level JIRAs can still be useful for tracking the end
>> >>             result: tox suites cover all tests in the module in Py3
>> >>             environment, and there are no disabled tests in the module
>> that
>> >>             don't have individual JIRAs tracking them.
>> >>
>> >>             I suggest that folks who are working on module-level JIRAs
>> assign
>> >>             to themselves the JIRAs that track individual failures
>> if/when
>> >>             they are actively addressing them. This way, unassigned
>> >>             problem-specific JIRAs can use help from the community.
>> >>
>> >>             Thanks,
>> >>             Valentyn
>> >>
>> >>
>> >>             On Wed, Oct 3, 2018 at 8:14 PM Manu Zhang <
>> owenzhang1...@gmail.com
>> >>             <mailto:owenzhang1...@gmail.com>> wrote:
>> >>
>> >>                 Thanks Valentyn. Note some test failing issues are
>> covered by
>> >>                 “Finish Python 3 porting for *** module”, e.g.
>> >>                 https://issues.apache.org/jira/browse/BEAM-5315.
>> >>
>> >>                 Manu
>> >>                 在 2018年10月3日 +0800 PM4:18,Valentyn Tymofieiev
>> >>                 <valen...@google.com <mailto:valen...@google.com>>,写道:
>> >>>                 Hi Rakesh and Manu,
>> >>>
>> >>>                 Thanks to both of you for offering help (in different
>> >>>                 threads). It's great to see that more and more people
>> get
>> >>>                 involved with helping to make Beam Python 3
>> compatible!
>> >>>
>> >>>                 There are a few PRs in flight, and several people in
>> the
>> >>>                 community actively work on Python 3 support now. I
>> would be
>> >>>                 happy to coordinate the work so that we don't step at
>> each
>> >>>                 others toes and avoid duplication of effort.
>> >>>
>> >>>                 I recently looked at unit tests that are still
>> failing in
>> >>>                 Python 3 environment  and filed a few issues (within
>> range
>> >>>                 BEAM-5615 - BEAM-5629), to track similar classes of
>> errors.
>> >>>                 You can also find them on Kanban board [1].
>> >>>                 In particular, BEAM-5620 and BEAM-5627 should be easy
>> issues
>> >>>                 to get started.
>> >>>
>> >>>                 There are multiple ways you can help:
>> >>>                 - Helping to rootcause errors. Even a comment why a
>> test is
>> >>>                 failing and a suggestion how to fix it, will be
>> helpful for
>> >>>                 others when you don't have time to do the fix.
>> >>>                 - Helping with code reviews.
>> >>>                 - Reporting new issues (as subtasks to BEAM-1251),
>> >>>                 deduplicating or splitting the existing issues. We
>> probably
>> >>>                 don't want to file a Jira for each of 250+ currently
>> failing
>> >>>                 tests at this point, but it may make sense to track
>> the
>> >>>                 errors that occur repeatedly share the root cause.
>> >>>                 - Fixing the issues. Feel free to assign an issue to
>> yourself
>> >>>                 if you have a fix in mind and plan to actively work
>> on it.
>> >>>                 Due to the nature of the problem it may occasionally
>> happen
>> >>>                 that two issues share the rootcause, or fixing one
>> issue is a
>> >>>                 prerequisite for fixing another issue, so sync to
>> master
>> >>>                 often to make sure the issue you are working on is not
>> >>>                 already fixed.
>> >>>
>> >>>                 I'll also keep an eye on the PRs and will try to keep
>> the
>> >>>                 list of open issues up to date.
>> >>>
>> >>>                 Thanks,
>> >>>                 Valentyn
>> >>>
>> >>>                 [1]:
>> >>>
>> https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=245&view=detail
>> >>>
>> >>>
>> >>>                 On Tue, Oct 2, 2018 at 9:38 AM Pablo Estrada
>> >>>                 <pabl...@google.com <mailto:pabl...@google.com>>
>> wrote:
>> >>>
>> >>>                     Very cool : ) I'm also available to review /
>> merge if you
>> >>>                     need help from my side.
>> >>>                     Best
>> >>>                     -P.
>> >>>
>> >>>                     On Tue, Oct 2, 2018 at 7:45 AM Rakesh Kumar
>> >>>                     <rakeshku...@lyft.com <mailto:
>> rakeshku...@lyft.com>> wrote:
>> >>>
>> >>>                         Hi Rob,
>> >>>
>> >>>                         I am, Rakesh Kumar, using Beam SDK for one of
>> my
>> >>>                         projects at Lyft. I have been working closely
>> with
>> >>>                         Thomas Weise. I have already met a couple of
>> Python
>> >>>                         SDK developers in person.
>> >>>                         I am interested to help migrate to Python 3.
>> You can
>> >>>                         assign me PRs for review. I am also more than
>> happy
>> >>>                         to take a simple ticket to begin development
>> work on
>> >>>                         Beam.
>> >>>
>> >>>                         Thank you,
>> >>>                         Rakesh
>> >>>
>> >>>                         On Wed, Sep 5, 2018 at 9:12 AM Robbe Sneyders
>> >>>                         <robbe.sneyd...@ml6.eu
>> >>>                         <mailto:robbe.sneyd...@ml6.eu>> wrote:
>> >>>
>> >>>                             Hi everyone,
>> >>>
>> >>>                             With the merging of [1], we now have
>> Python 3
>> >>>                             tests running on Jenkins, which allows us
>> to move
>> >>>                             forward with the last step of the Python
>> 3 porting.
>> >>>
>> >>>                             You can follow the progress on the Jira
>> Kanban
>> >>>                             Board [2]. If you're interested in
>> helping by
>> >>>                             porting a module, you can assign one of
>> the
>> >>>                             issues to yourself and start coding. You
>> can find
>> >>>                             the different steps outlined in the design
>> >>>                             document [3].
>> >>>
>> >>>                             We could also use some extra reviewers.
>> If you're
>> >>>                             interested, let us know, and we'll tag
>> you in our
>> >>>                             PRs.
>> >>>
>> >>>                             [1]
>> https://github.com/apache/beam/pull/6266
>> >>>                             [2]
>> >>>
>> https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=245
>> >>>                             [3] https://s.apache.org/beam-python-3
>> >>>
>> >>>                             kind regards,
>> >>>                             Robbe
>> >>>                             --
>> >>>
>> >>>                             https://ml6.eu <https://ml6.eu/>
>> >>>
>> >>>
>> >>>
>> >>>
>> >>>                             *Robbe Sneyders*
>> >>>
>> >>>                             ML6 Gent
>> >>>                             <
>> https://www.google.be/maps/place/ML6/@51.037408,3.7044893,17z/data=!3m1!4b1!4m5!3m4!1s0x47c37161feeca14b:0xb8f72585fdd21c90!8m2!3d51.037408!4d3.706678?hl=nl
>> >
>> >>>
>> >>>                             M: +32 474 71 31 08
>> <tel:+32%20474%2071%2031%2008>
>> >>>
>> >>>                         --
>> >>>                         Rakesh Kumar
>> >>>                         Software Engineer
>> >>>                         510-761-1364 <tel:(510)%20761-1364> |
>> >>>                         <https://www.lyft.com/>
>> >>>
>>
>

Reply via email to