+1 for going by the commits since this is what matters at the end of the
day. Also, many issues may not get tagged correctly for a given release due
to either the contributor not tagging the issue or due to commits for the
issue spanning multiple Beam releases.

For example,

For all commits in a given release RC:
  * If we find a Github issue for the commit: add a notice to the Github
issue
  * Else: add the notice to a generic issue for the release including tags
for the commit ID, PR author, and the committer who merged the PR.

Thanks,
Cham




On Mon, Oct 23, 2023 at 11:49 AM Danny McCormick via dev <
dev@beam.apache.org> wrote:

> I'd probably vote to include both the issue filer and the contributor. It
> is pretty equally straightforward - one way to do this would be using all
> issues related to that release's milestone and extracting the issue author
> and the issue closer.
>
> This does leave out the (unfortunately sizable) set of contributions that
> don't have an associated issue; if we're worried about that, we could
> always fall back to anyone with a commit in the last release who doesn't
> have an associated issue (aka what I thought we were initially proposing
> and what I think Airflow does today).
>
> I'm pretty much +1 on any sort of automation here, and it certainly can
> come in stages :)
>
> On Mon, Oct 23, 2023 at 1:50 PM Johanna Öjeling via dev <
> dev@beam.apache.org> wrote:
>
>> Yes that's a good point to include also those who created the issue.
>>
>> On Mon, Oct 23, 2023, 19:18 Robert Bradshaw via dev <dev@beam.apache.org>
>> wrote:
>>
>>> On Mon, Oct 23, 2023 at 7:26 AM Danny McCormick via dev <
>>> dev@beam.apache.org> wrote:
>>>
>>>> So to summarize, I think there's broad consensus (or at least lazy
>>>> consensus) around the following:
>>>>
>>>> - (1) Updating our release email/guidelines to be more specific about
>>>> what we mean by release validation/how to be helpful during this process.
>>>> This includes both encouraging validation within each user's own code base
>>>> and encouraging people to document/share their process of validation and
>>>> link it in the release spreadsheet.
>>>> - (2) Doing something like what Airflow does (#29424
>>>> <https://github.com/apache/airflow/issues/29424>) and creating an
>>>> issue asking people who have contributed to the current release to help
>>>> validate their changes.
>>>>
>>>> I'm also +1 on doing both of these. The first bit (updating our
>>>> guidelines) is relatively easy - it should just require updating
>>>> https://github.com/apache/beam/blob/master/contributor-docs/release-guide.md#vote-and-validate-the-release-candidate
>>>> .
>>>>
>>>> I took a look at the second piece (copying what Airflow does) to see if
>>>> we could just copy their automation, but it looks like it's tied to
>>>> airflow breeze
>>>> <https://github.com/apache/airflow/blob/main/dev/breeze/src/airflow_breeze/provider_issue_TEMPLATE.md.jinja2>
>>>> (their repo-specific automation tooling), so we'd probably need to build
>>>> the automation ourselves. It shouldn't be terrible, basically we'd want a
>>>> GitHub Action that compares the current release tag with the last release
>>>> tag, grabs all the commits in between, parses them to get the author, and
>>>> creates an issue with that data, but it does represent more effort than
>>>> just updating a markdown file. There might even be an existing Action that
>>>> can help with this, I haven't looked too hard.
>>>>
>>>
>>> I was thinking along the lines of a script that would scrape the issues
>>> resolved in a given release and add a comment to them noting that the
>>> change is in release N and encouraging (with clear instructions) how this
>>> can be validated. Creating a "validate this release" issue with all
>>> "contributing" participants could be an interesting way to do this as well.
>>> (I think it'd be valuable to get those who filed the issue, not just those
>>> who fixed it, to validate.)
>>>
>>>
>>>> As our next release manager, I'm happy to review PRs for either of
>>>> these if anyone wants to volunteer to help out. If not, I'm happy to update
>>>> the guidelines, but I probably won't have time to add the commit inspection
>>>> tooling (I'm planning on throwing any extra time towards continuing to
>>>> automate release candidate creation which is currently a more impactful
>>>> problem IMO). I would very much like it if both of these things happened
>>>> though :)
>>>>
>>>> Thanks,
>>>> Danny
>>>>
>>>> On Mon, Oct 23, 2023 at 10:05 AM XQ Hu <x...@google.com> wrote:
>>>>
>>>>> +1. This is a great idea to try. @Danny McCormick
>>>>> <dannymccorm...@google.com> FYI as our next release manager.
>>>>>
>>>>> On Wed, Oct 18, 2023 at 2:30 PM Johanna Öjeling via dev <
>>>>> dev@beam.apache.org> wrote:
>>>>>
>>>>>> When I have contributed to Apache Airflow, they have tagged all
>>>>>> contributors concerned in a GitHub issue when the RC is available and 
>>>>>> asked
>>>>>> us to validate it. Example: #29424
>>>>>> <https://github.com/apache/airflow/issues/29424>.
>>>>>>
>>>>>> I found that to be an effective way to notify contributors of the RC
>>>>>> and nudge them to help out. In the issue description there is a reference
>>>>>> to the guidelines on how to test the RC and a note that people are
>>>>>> encouraged to vote on the mailing list (which could admittedly be more
>>>>>> highlighted because I did not pay attention to it until now and was 
>>>>>> unaware
>>>>>> that contributors had a vote).
>>>>>>
>>>>>> It might be an idea to consider something similar here to increase
>>>>>> the participation?
>>>>>>
>>>>>> On Tue, Oct 17, 2023 at 7:01 PM Jack McCluskey via dev <
>>>>>> dev@beam.apache.org> wrote:
>>>>>>
>>>>>>> I'm +1 on helping explain what we mean by "validate the RC" since
>>>>>>> we're really just asking users to see if their existing use cases work
>>>>>>> along with our typical slate of tests. I don't know if offloading that 
>>>>>>> work
>>>>>>> to our active validators is the right approach though, 
>>>>>>> documentation/screen
>>>>>>> share of their specific workflow is definitely less useful than having a
>>>>>>> more general outline of how to install the RC and things to look out for
>>>>>>> when testing.
>>>>>>>
>>>>>>> On Tue, Oct 17, 2023 at 12:55 PM Austin Bennett <aus...@apache.org>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Great effort.  I'm also interested in streamlining releases -- so
>>>>>>>> if there are alot of manual tests that could be automated, would be 
>>>>>>>> great
>>>>>>>> to discover and then look to address.
>>>>>>>>
>>>>>>>> On Tue, Oct 17, 2023 at 8:47 AM Robert Bradshaw via dev <
>>>>>>>> dev@beam.apache.org> wrote:
>>>>>>>>
>>>>>>>>> +1
>>>>>>>>>
>>>>>>>>> I would also strongly suggest that people try out the release
>>>>>>>>> against their own codebases. This has the benefit of ensuring the 
>>>>>>>>> release
>>>>>>>>> won't break your own code when they go out, and stress-tests the new 
>>>>>>>>> code
>>>>>>>>> against real-world pipelines. (Ideally our own tests are all passing, 
>>>>>>>>> and
>>>>>>>>> this validation is automated as much as possible (though ensuring it
>>>>>>>>> matches our documentation and works in a clean environment still has
>>>>>>>>> value), but there's a lot of code and uses out there that we don't 
>>>>>>>>> have
>>>>>>>>> access to during normal Beam development.)
>>>>>>>>>
>>>>>>>>> On Tue, Oct 17, 2023 at 8:21 AM Svetak Sundhar via dev <
>>>>>>>>> dev@beam.apache.org> wrote:
>>>>>>>>>
>>>>>>>>>> Hi all,
>>>>>>>>>>
>>>>>>>>>> I’ve participated in RC testing for a few releases and have
>>>>>>>>>> observed a bit of a knowledge gap in how releases can be tested. 
>>>>>>>>>> Given that
>>>>>>>>>> Beam encourages contributors to vote on RC’s regardless of tenure, 
>>>>>>>>>> and that
>>>>>>>>>> voting on an RC is a relatively low-effort, high leverage way to 
>>>>>>>>>> influence
>>>>>>>>>> the release of the library, I propose the following:
>>>>>>>>>>
>>>>>>>>>> During the vote for the next release, voters can document the
>>>>>>>>>> process they followed on a separate document, and add the link on 
>>>>>>>>>> column G
>>>>>>>>>> here
>>>>>>>>>> <https://docs.google.com/spreadsheets/d/1qk-N5vjXvbcEk68GjbkSZTR8AGqyNUM-oLFo_ZXBpJw/edit#gid=437054928>.
>>>>>>>>>> One step further, could be a screencast of running the test, and 
>>>>>>>>>> attaching
>>>>>>>>>> a link of that.
>>>>>>>>>>
>>>>>>>>>> We can keep repeating this through releases until we have
>>>>>>>>>> documentation for many of the different tests. We can then add these 
>>>>>>>>>> docs
>>>>>>>>>> into the repo.
>>>>>>>>>>
>>>>>>>>>> I’m proposing this because I’ve gathered the following feedback
>>>>>>>>>> from colleagues that are tangentially involved with Beam: They are
>>>>>>>>>> interested in participating in release validation, but don’t know 
>>>>>>>>>> how to
>>>>>>>>>> get started. Happy to hear other suggestions too, if there are any to
>>>>>>>>>> address the above.
>>>>>>>>>>
>>>>>>>>>> Thanks,
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Svetak Sundhar
>>>>>>>>>>
>>>>>>>>>>   Data Engineer
>>>>>>>>>> s <nellywil...@google.com>vetaksund...@google.com
>>>>>>>>>>
>>>>>>>>>>

Reply via email to