Hi, Alexey,

Yes, now we can keep updating the website as before. Here is a contributor
guide: https://github.com/apache/beam/blob/master/website/CONTRIBUTE.md

Thanks for checking :)

Aizhamal


On Tue, May 26, 2020 at 8:39 AM Alexey Romanenko <aromanenko....@gmail.com>
wrote:

> Thank you to everybody who worked on this migration.
>
> I just wanted to clarify - does it mean, since it’s finished now, we can
> update a website as before?
>
> On 15 May 2020, at 01:33, Aizhamal Nurmamat kyzy <aizha...@apache.org>
> wrote:
>
> Thank you Ahmet, Brian, Robert and everyone else who spent time working on
> this. The pull requests are now merged and the website seems to be working
> as expected [1].
>
> One minor issue that I have noticed is that the code blocks have a grey
> background, which makes it less accessible than before. I created a Jira
> issue for this [2], and will follow up to get it fixed. If you notice any
> other issues, please file Jira issues and let me know.
>
> Hope you are all safe,
> Aizhamal
>
> [1] https://beam.apache.org/
> [2] https://issues.apache.org/jira/browse/BEAM-10001
>
> On Thu, May 14, 2020 at 11:25 AM Pablo Estrada <pabl...@google.com> wrote:
>
>> Here's a zipped-up tree from a staged sample of the website:
>> https://drive.google.com/file/d/1LKL936tBJ79jpjvlL5vC5uYYwTHsWXiJ/view?usp=sharing
>>
>> I'd also suggest tagging the commit, so we can find the fist commit later
>> on for reference. I can push the tag after the PR is merged.
>>
>> On Thu, May 14, 2020 at 10:43 AM Ahmet Altay <al...@google.com> wrote:
>>
>>>
>>>
>>> On Thu, May 14, 2020 at 9:16 AM Aizhamal Nurmamat kyzy <
>>> aizha...@apache.org> wrote:
>>>
>>>> Thank you all for reviewing and validating this pull request. I see
>>>> that all tests are passing now, should we merge it?
>>>>
>>>
>>> +1 to merging now.
>>>
>>> Before the merge, please share a link to an archive copy of the old
>>> website. After the merge, please try out the live website see if it is
>>> working as expected.
>>>
>>>
>>>>
>>>> On Wed, May 13, 2020, 5:41 PM Ahmet Altay <al...@google.com> wrote:
>>>>
>>>>> Thank you! Let's merge it once tests are done.
>>>>>
>>>>> On Wed, May 13, 2020 at 5:23 PM Robert Bradshaw <rober...@google.com>
>>>>> wrote:
>>>>>
>>>>>> I took a (non-comprehensive) look at these as well, and didn't see
>>>>>> any issues, so am happy to sign off on this. Thanks Nam, Brian, Ahmet, 
>>>>>> and
>>>>>> everyone else.
>>>>>>
>>>>>> On Wed, May 13, 2020 at 7:58 AM Nam Bui <nam....@polidea.com> wrote:
>>>>>>
>>>>>>> Hi Ahmet,
>>>>>>> "Does this mean the internal links (e.g. contribute/team) will
>>>>>>> disappear?"
>>>>>>> Yes, I'd like to get rid of them. And to make sure it won't appear
>>>>>>> to confuse people, I replaced all of the spots using "contribute/team" 
>>>>>>> with
>>>>>>> the external one. Currently, we only have 2 "redirect_to" links which 
>>>>>>> are
>>>>>>> "contribute/team" & "contribute/project/team", so this act won't have 
>>>>>>> any
>>>>>>> affects.
>>>>>>> Also, based on your question, I just added a section in the
>>>>>>> documentation (CONTRIBUTE.md), which mentions the replaced/removed 
>>>>>>> features
>>>>>>> of Jekyll in terms of writing a new blog post or documentation in Hugo.
>>>>>>>
>>>>>>
>>>>> Got it. The main effect will be any one has a bookmark/link to these
>>>>> pages, those links will no longer work. It is fine if it is only limited 
>>>>> to
>>>>> these 2 urls.
>>>>>
>>>>>
>>>>>>
>>>>>>>
>>>>>>> On Wed, May 13, 2020 at 4:17 AM Ahmet Altay <al...@google.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> - I reviewed the diff output with Nam's explanations. The change
>>>>>>>> looks minimal. Large diffs are primarily coming from index and redirect
>>>>>>>> files. codeblocks have differences but the content is seemingly 
>>>>>>>> preserved.
>>>>>>>> IIUC, the source of truth is snippet files anyway. (It would be good 
>>>>>>>> to get
>>>>>>>> one more set of eyes on this.)
>>>>>>>> - Brian and I reviewed the infrastructure changes. They look
>>>>>>>> reasonable.
>>>>>>>>
>>>>>>>> I think PR is very close to a mergeable state. Especially if we can
>>>>>>>> get an archive copy of the current website, I will be comfortable with 
>>>>>>>> the
>>>>>>>> merge.
>>>>>>>>
>>>>>>>> And, thank you Nam for your work so far.
>>>>>>>>
>>>>>>>> On Tue, May 12, 2020 at 4:13 PM Nam Bui <nam....@polidea.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Hi,
>>>>>>>>>
>>>>>>>>> A new commit covers Robert's script is pushed [1], and also the
>>>>>>>>> script output is attached in this email.
>>>>>>>>>
>>>>>>>>> Based on the diff output of the script, my strategy is looking at
>>>>>>>>> the sections which contain the large/massive removed texts, to make 
>>>>>>>>> sure
>>>>>>>>> that there are no lost content or files. And below are all of the 
>>>>>>>>> links
>>>>>>>>> which have large of the removed content.
>>>>>>>>>
>>>>>>>>> - Detection:
>>>>>>>>> These links lost some of the contents. Fixed!
>>>>>>>>> + documentation/runners/jstorm/index.html
>>>>>>>>> + documentation/dsls/sql/calcite/lexical-structure/index.html
>>>>>>>>> + documentation/dsls/sql/zetasql/data-types/index.html
>>>>>>>>> + documentation/dsls/sql/zetasql/query-syntax/index.html
>>>>>>>>>
>>>>>>>>> - Aliases:
>>>>>>>>> These links are redirected links. So in Hugo, these HTML files
>>>>>>>>> only include redirected URLs. I also took a look at them to ensure the
>>>>>>>>> content was there.
>>>>>>>>> + documentation/dsls/sql/calcite/lexical/index.html
>>>>>>>>> + old URLs of blog posts
>>>>>>>>>
>>>>>>>>> - Ignore:
>>>>>>>>> Hugo and Jekyll have different structures of code highlighters
>>>>>>>>> rendering in HTML. Ahmed & Pablo agree with me that its fair to 
>>>>>>>>> ignore them
>>>>>>>>> for now.
>>>>>>>>> + codeblocks
>>>>>>>>>
>>>>>>>>> - Missing files:
>>>>>>>>> The script returns some of “missing files” status
>>>>>>>>> + coming-soon.html (this file was used nowhere in Jekyll, so I
>>>>>>>>> didn’t migrate to Hugo)
>>>>>>>>> + documentation/dsls/sql/statements/select/index.html (aliases)
>>>>>>>>> + blog/2019/04/25/beam-2.12.0.html (fixed!)
>>>>>>>>> + blog/2020/05/08/beam-summit-digital-2020.html (new blog post,
>>>>>>>>> added!)
>>>>>>>>> + v2/index.html (this file was used nowhere in Jekyll, so I didn’t
>>>>>>>>> migrate to Hugo)
>>>>>>>>> + contribute/team/index.html (mentioned in “redirect_to” below)
>>>>>>>>> + contribute/project/team/index.html (mentioned in “redirect_to”
>>>>>>>>> below)
>>>>>>>>>
>>>>>>>>> - “redirect_to”:
>>>>>>>>> In Jekyll, there is a feature called “redirect_to”. For instance,
>>>>>>>>> you click on an internal link “contribute/team/” to reach the markdown
>>>>>>>>> “team.md”, then from the markdown file, it redirects you to the 
>>>>>>>>> external
>>>>>>>>> URL “https://example.com”.
>>>>>>>>> However, there is no such feature in Hugo. My solution is to
>>>>>>>>> directly replace “contribute/team/” with “https://example.com”.
>>>>>>>>>
>>>>>>>>
>>>>>>>> Does this mean the internal links (e.g. contribute/team) will
>>>>>>>> disappear?
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>> [1] https://github.com/apache/beam/pull/11554
>>>>>>>>>
>>>>>>>>> On Mon, May 11, 2020 at 7:34 PM Nam Bui <nam....@polidea.com>
>>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>>> Updates for today:
>>>>>>>>>> - Thanks Brian & Ahmet for your reviews. I left my comments for
>>>>>>>>>> some of the questions and also adapted new changes to the reviews 
>>>>>>>>>> [1].
>>>>>>>>>> - I see that the new blog post was merged yesterday, so I added
>>>>>>>>>> it to the PR as well.
>>>>>>>>>>
>>>>>>>>>> I briefly tried the script from Robert with the input of build
>>>>>>>>>> files from old and new websites. It seemed to work well in terms of
>>>>>>>>>> detecting missing files (or probably wrong links leading to missing 
>>>>>>>>>> files).
>>>>>>>>>> I will push another commit to fix all that up, hope can be tomorrow.
>>>>>>>>>>
>>>>>>>>>> [1]
>>>>>>>>>> https://github.com/apache/beam/pull/11554#issuecomment-626792031
>>>>>>>>>>
>>>>>>>>>> Best regards,
>>>>>>>>>> Nam
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On Mon, May 11, 2020 at 9:01 AM Nam Bui <nam....@polidea.com>
>>>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hi,
>>>>>>>>>>>
>>>>>>>>>>> @Ahmet: Yeah, it's all clear to me. :)
>>>>>>>>>>> @Robert: Thanks for your ideas and also the script. It really
>>>>>>>>>>> helps me to serve my works.
>>>>>>>>>>>
>>>>>>>>>>> Best regard!
>>>>>>>>>>>
>>>>>>>>>>> On Sat, May 9, 2020 at 2:10 AM Ahmet Altay <al...@google.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> This sounds reasonable to me. Thank you. Nam, does it make
>>>>>>>>>>>> sense to you?
>>>>>>>>>>>>
>>>>>>>>>>>> On Fri, May 8, 2020 at 11:53 AM Robert Bradshaw <
>>>>>>>>>>>> rober...@google.com> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>> I'd really like to not see this work go to waste, both the
>>>>>>>>>>>>> original revision, the further efforts Nam has done in making it 
>>>>>>>>>>>>> more
>>>>>>>>>>>>> manageable to review, and the work put into reviewing this so 
>>>>>>>>>>>>> far, so we
>>>>>>>>>>>>> can get the benefits of being on Hugo. How about this for a
>>>>>>>>>>>>> concrete proposal:
>>>>>>>>>>>>>
>>>>>>>>>>>>> (1) We get "standard" approval from one or more committers for
>>>>>>>>>>>>> the infrastructure changes, just as with any other PR. Brian has
>>>>>>>>>>>>> already started this, but if others could step up as well that'd 
>>>>>>>>>>>>> be great.
>>>>>>>>>>>>>
>>>>>>>>>>>>> (2) Reviewers (and authors) typically count on (or request)
>>>>>>>>>>>>> sufficient automated test coverage to augment the fact that their 
>>>>>>>>>>>>> eyeballs
>>>>>>>>>>>>> are fallible, which is something that is missing here (and given 
>>>>>>>>>>>>> the size
>>>>>>>>>>>>> of the change not easily compensated for by a more detailed 
>>>>>>>>>>>>> manual review).
>>>>>>>>>>>>> How about we use the script above (or similar) as an automated 
>>>>>>>>>>>>> test to
>>>>>>>>>>>>> validate the website's contents haven't (materially) changed. I 
>>>>>>>>>>>>> feel we've
>>>>>>>>>>>>> validated enough that the style looks good via spot checking 
>>>>>>>>>>>>> (which is
>>>>>>>>>>>>> something that should work on all pages if it works on one). The 
>>>>>>>>>>>>> diff
>>>>>>>>>>>>> between the current site and the newly generated site should be 
>>>>>>>>>>>>> empty (it
>>>>>>>>>>>>> might already be [1]), or at least we should get a stamp of 
>>>>>>>>>>>>> approval on the
>>>>>>>>>>>>> plain-text diff (which should be small), before merging.
>>>>>>>>>>>>>
>>>>>>>>>>>>> (3) To make things easier, everyone holds off on making any
>>>>>>>>>>>>> changes to the old site until a fixed future date (say, next 
>>>>>>>>>>>>> Wednesday).
>>>>>>>>>>>>> Hopefully we can get it merged by then. If not, a condition for 
>>>>>>>>>>>>> merging
>>>>>>>>>>>>> would be a commitment incorporating new changes after this date.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Does this sound reasonable?
>>>>>>>>>>>>>
>>>>>>>>>>>>> - Robert
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> [1] I'd be curious as to how small the diff already is, but my
>>>>>>>>>>>>> script relies on local directories with the generated HTML, which 
>>>>>>>>>>>>> I don't
>>>>>>>>>>>>> have handy at the moment.
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Fri, May 8, 2020 at 10:45 AM Robert Bradshaw <
>>>>>>>>>>>>> rober...@google.com> wrote:
>>>>>>>>>>>>>
>>>>>>>>>>>>>> Here's a script that we could run on the old and new sites
>>>>>>>>>>>>>> that should quickly catch any major issues but not get caught up 
>>>>>>>>>>>>>> in
>>>>>>>>>>>>>> formatting minutia.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Fri, May 8, 2020 at 10:23 AM Robert Bradshaw <
>>>>>>>>>>>>>> rober...@google.com> wrote:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Fri, May 8, 2020 at 9:58 AM Aizhamal Nurmamat kyzy <
>>>>>>>>>>>>>>> aizha...@apache.org> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> I understand the difficulty, and this certainly comes with
>>>>>>>>>>>>>>>> lessons learned for future similar projects.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> To your questions Robert:
>>>>>>>>>>>>>>>> (1 and 2) I will commit to review the text in the resulting
>>>>>>>>>>>>>>>> pages. I will try and use some automation to extract visible 
>>>>>>>>>>>>>>>> text from each
>>>>>>>>>>>>>>>> page and diff it with the current state of the website. I can 
>>>>>>>>>>>>>>>> do this
>>>>>>>>>>>>>>>> starting next week. From some quick research, there seem to be 
>>>>>>>>>>>>>>>> tools that
>>>>>>>>>>>>>>>> help with this analysis (
>>>>>>>>>>>>>>>> https://stackoverflow.com/questions/3286955/compare-two-websites-and-see-if-they-are-equal
>>>>>>>>>>>>>>>> )
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> At first glance it looks like these tools would give diffs
>>>>>>>>>>>>>>> that are *larger* than the 47K one we're struggling to review 
>>>>>>>>>>>>>>> here.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> By remaining in this state, we hold others up from making
>>>>>>>>>>>>>>>> changes, or we increase the amount of work needed after 
>>>>>>>>>>>>>>>> merging to port
>>>>>>>>>>>>>>>> over changes that may be missed. If we move forward, new 
>>>>>>>>>>>>>>>> changes can be
>>>>>>>>>>>>>>>> done on top of the new website.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I agree we don't want to hold others up from making changes.
>>>>>>>>>>>>>>> However, the amount of work to port changes over seems small in 
>>>>>>>>>>>>>>> comparison
>>>>>>>>>>>>>>> to everything else that is being discussed here. (It also 
>>>>>>>>>>>>>>> provides good
>>>>>>>>>>>>>>> incentives to reach the bar quickly and has the advantage of 
>>>>>>>>>>>>>>> falling on the
>>>>>>>>>>>>>>> right people.) (3) will still take some time.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If we go this route, we're lowering the bar for doc changes,
>>>>>>>>>>>>>>> but not removing it.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> (3) This makes sense. Brian, would you be able to spend
>>>>>>>>>>>>>>>> some time to look at the automation changes (build files and 
>>>>>>>>>>>>>>>> scripts) to
>>>>>>>>>>>>>>>> ensure they look fine?
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> I would also like to write a post mortem to extract lessons
>>>>>>>>>>>>>>>> learned and avoid this situation in the future.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> On Fri, May 8, 2020 at 9:44 AM Brian Hulette <
>>>>>>>>>>>>>>>> bhule...@google.com> wrote:
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> I'm -0 on merging as-is. I have the same concerns as
>>>>>>>>>>>>>>>>> Robert and he's voiced them very well so I won't waste time 
>>>>>>>>>>>>>>>>> re-airing them.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> (2) I spot checked the content, pulled out some common
>>>>>>>>>>>>>>>>>> patterns, and
>>>>>>>>>>>>>>>>>> it mostly looks good, but there were also some issues
>>>>>>>>>>>>>>>>>> (e.g. several
>>>>>>>>>>>>>>>>>> pages were replaced with the contents from entirely
>>>>>>>>>>>>>>>>>> different pages).
>>>>>>>>>>>>>>>>>> I would be more comfortable if, say, a smoke test of
>>>>>>>>>>>>>>>>>> comparing the old
>>>>>>>>>>>>>>>>>> and new sites, with html tags stripped and ignoring
>>>>>>>>>>>>>>>>>> whitespace,
>>>>>>>>>>>>>>>>>> yielded what should be empty diffs.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Can you share any details about this analysis?
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> It was basically paging through the diff, adding things to
>>>>>>>>>>>>>>> the sed script, and then looking at more diffs.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> +1 for verifying the old and new are the same by diffing
>>>>>>>>>>>>>>>>> the output.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> (3) It'd be good to have someone give a stamp of approval
>>>>>>>>>>>>>>>>>> on the
>>>>>>>>>>>>>>>>>> infrastructure changes, at least to validate that we're
>>>>>>>>>>>>>>>>>> not going to
>>>>>>>>>>>>>>>>>> be taking on extra tech debt with regard to jenkins
>>>>>>>>>>>>>>>>>> stability and
>>>>>>>>>>>>>>>>>> developer workflow. I see that Brian has at least looked
>>>>>>>>>>>>>>>>>> at this some.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> My involvement so far was just recognizing a problem
>>>>>>>>>>>>>>>>> (creating root-owned files on jenkins workers) and helping to 
>>>>>>>>>>>>>>>>> fix it. If
>>>>>>>>>>>>>>>>> there's anyone available who's familiar with the website 
>>>>>>>>>>>>>>>>> infrastructure it
>>>>>>>>>>>>>>>>> would be great if they could take a look instead (if not I 
>>>>>>>>>>>>>>>>> could probably
>>>>>>>>>>>>>>>>> acquaint myself enough to review).
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On Thu, May 7, 2020 at 11:57 PM Robert Bradshaw <
>>>>>>>>>>>>>>>>> rober...@google.com> wrote:
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> This is a tough situation.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> It would have been much better if this transition was
>>>>>>>>>>>>>>>>>> structured in
>>>>>>>>>>>>>>>>>> such a way that the review was more manageable (e.g. the
>>>>>>>>>>>>>>>>>> suggestion of
>>>>>>>>>>>>>>>>>> scripts, not mixing in voluminous unnecessary changes
>>>>>>>>>>>>>>>>>> like whitespace,
>>>>>>>>>>>>>>>>>> and not updating content), and possibly even
>>>>>>>>>>>>>>>>>> incrementally (e.g. the
>>>>>>>>>>>>>>>>>> new site would have been developed over multiple PRs in a
>>>>>>>>>>>>>>>>>> subdomain or
>>>>>>>>>>>>>>>>>> subdirectory while being worked on). But hindsight is
>>>>>>>>>>>>>>>>>> 20/20 and no
>>>>>>>>>>>>>>>>>> one, myself included, thought to bring this up when the
>>>>>>>>>>>>>>>>>> original
>>>>>>>>>>>>>>>>>> migration was proposed, so this is more something to keep
>>>>>>>>>>>>>>>>>> in mind for
>>>>>>>>>>>>>>>>>> the future. I also appreciate the efforts that have been
>>>>>>>>>>>>>>>>>> made to clean
>>>>>>>>>>>>>>>>>> things up (e.g. preserving history) and address feedback.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> So, where do we go from here? My first thought is that I
>>>>>>>>>>>>>>>>>> really don't
>>>>>>>>>>>>>>>>>> want to set a precedent that just because a PR "will
>>>>>>>>>>>>>>>>>> require a large
>>>>>>>>>>>>>>>>>> effort" and in a state that if we don't "move forward and
>>>>>>>>>>>>>>>>>> merge what
>>>>>>>>>>>>>>>>>> we have now" then "work done so far will be lost" means
>>>>>>>>>>>>>>>>>> that we think
>>>>>>>>>>>>>>>>>> it's OK to forgo doing a proper review.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> On the other hand, there are some mitigating factors with
>>>>>>>>>>>>>>>>>> this being
>>>>>>>>>>>>>>>>>> the website and not the code in that "bugs," though
>>>>>>>>>>>>>>>>>> possibly
>>>>>>>>>>>>>>>>>> embarrassing, won't break production pipelines or data
>>>>>>>>>>>>>>>>>> loss, and
>>>>>>>>>>>>>>>>>> though the source is technically part of the release,
>>>>>>>>>>>>>>>>>> when we find
>>>>>>>>>>>>>>>>>> something to fix we can fix the live website much more
>>>>>>>>>>>>>>>>>> quickly than go
>>>>>>>>>>>>>>>>>> through the whole release process and convince people to
>>>>>>>>>>>>>>>>>> upgrade. (I
>>>>>>>>>>>>>>>>>> recognize accepting this argument is, to some degree at
>>>>>>>>>>>>>>>>>> least, saying
>>>>>>>>>>>>>>>>>> that we don't care about the correctness of docs as much
>>>>>>>>>>>>>>>>>> as so-called
>>>>>>>>>>>>>>>>>> "real" code, if we go there.)
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> If we decide to go ahead and merge (and I would not
>>>>>>>>>>>>>>>>>> object), there are
>>>>>>>>>>>>>>>>>> some things I would like to see.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> (1) I would like to understand what we would do
>>>>>>>>>>>>>>>>>> afterwards to "review
>>>>>>>>>>>>>>>>>> the outcome, and ensure that all the content is there,"
>>>>>>>>>>>>>>>>>> and why it
>>>>>>>>>>>>>>>>>> can't be done before merging instead. (Is it because it'd
>>>>>>>>>>>>>>>>>> take time
>>>>>>>>>>>>>>>>>> and we don't want to incorporate changes that are made to
>>>>>>>>>>>>>>>>>> the website
>>>>>>>>>>>>>>>>>> in the meantime? I think that boat has sailed, but maybe
>>>>>>>>>>>>>>>>>> we can avoid
>>>>>>>>>>>>>>>>>> making it worse...)
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> (2) I spot checked the content, pulled out some common
>>>>>>>>>>>>>>>>>> patterns, and
>>>>>>>>>>>>>>>>>> it mostly looks good, but there were also some issues
>>>>>>>>>>>>>>>>>> (e.g. several
>>>>>>>>>>>>>>>>>> pages were replaced with the contents from entirely
>>>>>>>>>>>>>>>>>> different pages).
>>>>>>>>>>>>>>>>>> I would be more comfortable if, say, a smoke test of
>>>>>>>>>>>>>>>>>> comparing the old
>>>>>>>>>>>>>>>>>> and new sites, with html tags stripped and ignoring
>>>>>>>>>>>>>>>>>> whitespace,
>>>>>>>>>>>>>>>>>> yielded what should be empty diffs.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> (3) It'd be good to have someone give a stamp of approval
>>>>>>>>>>>>>>>>>> on the
>>>>>>>>>>>>>>>>>> infrastructure changes, at least to validate that we're
>>>>>>>>>>>>>>>>>> not going to
>>>>>>>>>>>>>>>>>> be taking on extra tech debt with regard to jenkins
>>>>>>>>>>>>>>>>>> stability and
>>>>>>>>>>>>>>>>>> developer workflow. I see that Brian has at least looked
>>>>>>>>>>>>>>>>>> at this some.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> - Robert
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> On Thu, May 7, 2020 at 12:40 PM Aizhamal Nurmamat kyzy
>>>>>>>>>>>>>>>>>> <aizha...@apache.org> wrote:
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> > Thank you Ahmet.
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> > Robert/Brian, what do you think?
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> > The website staging and pre commit tests have passed
>>>>>>>>>>>>>>>>>> [1]. If nobody has objections, we could merge it soon.
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> > [1] https://github.com/apache/beam/pull/11554
>>>>>>>>>>>>>>>>>> >
>>>>>>>>>>>>>>>>>> > On Thu, May 7, 2020 at 11:38 AM Ahmet Altay <
>>>>>>>>>>>>>>>>>> al...@google.com> wrote:
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >> On Thu, May 7, 2020 at 10:50 AM Aizhamal Nurmamat kyzy
>>>>>>>>>>>>>>>>>> <aizha...@apache.org> wrote:
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>> Thanks for the writeup Ahmet.
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>> My bias is to move forward and merge the PR. After
>>>>>>>>>>>>>>>>>> this, we'll review the outcome, and ensure that all the 
>>>>>>>>>>>>>>>>>> content is there.
>>>>>>>>>>>>>>>>>> Nam will help us with that.
>>>>>>>>>>>>>>>>>> >>> The reason that I'd like to move forward and merge
>>>>>>>>>>>>>>>>>> what we have now - is that if we don't do that, the work 
>>>>>>>>>>>>>>>>>> done so far will
>>>>>>>>>>>>>>>>>> be lost.
>>>>>>>>>>>>>>>>>> >>> We'll make sure to stage the website in its current
>>>>>>>>>>>>>>>>>> state, and use that as reference/archive to ensure all the 
>>>>>>>>>>>>>>>>>> content have
>>>>>>>>>>>>>>>>>> been moved.
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>> Is this reasonable to everyone?
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >> This is reasonable to me. I agree with your reasons.
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >> What do others think?
>>>>>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>>
>>>>>>>>>>>>>>>>>> >>> On Wed, May 6, 2020 at 7:07 PM Ahmet Altay <
>>>>>>>>>>>>>>>>>> al...@google.com> wrote:
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>> On Wed, May 6, 2020 at 2:33 PM Aizhamal Nurmamat
>>>>>>>>>>>>>>>>>> kyzy <aizha...@apache.org> wrote:
>>>>>>>>>>>>>>>>>> >>>>>>
>>>>>>>>>>>>>>>>>> >>>>>>
>>>>>>>>>>>>>>>>>> >>>>>> > 1) Currently, the main blocker for merging is
>>>>>>>>>>>>>>>>>> Staging Test Failures.
>>>>>>>>>>>>>>>>>> >>>>>>
>>>>>>>>>>>>>>>>>> >>>>>> That and finishing the review. (Is someone
>>>>>>>>>>>>>>>>>> tracking/coordinating this?)
>>>>>>>>>>>>>>>>>> >>>>>
>>>>>>>>>>>>>>>>>> >>>>>
>>>>>>>>>>>>>>>>>> >>>>> I am coordinating the work on the failed tests, but
>>>>>>>>>>>>>>>>>> I would need other committer's help to perform the review. 
>>>>>>>>>>>>>>>>>> @Ahmet, could
>>>>>>>>>>>>>>>>>> you help us prioritize the review for this PR?
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>> The problem is there are too many manual changes.
>>>>>>>>>>>>>>>>>> Reviewing this change in this form will require a large 
>>>>>>>>>>>>>>>>>> effort. I do not
>>>>>>>>>>>>>>>>>> think I can interrupt other projects to prioritize reviews 
>>>>>>>>>>>>>>>>>> on this PR. IMO,
>>>>>>>>>>>>>>>>>> we have a few options:
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>> - PR to be restructured in the format suggested in
>>>>>>>>>>>>>>>>>> this thread. A commit for infrastructure changes from Jekyll 
>>>>>>>>>>>>>>>>>> to hugo. A
>>>>>>>>>>>>>>>>>> second commit for a script that will convert the majority of 
>>>>>>>>>>>>>>>>>> the content. A
>>>>>>>>>>>>>>>>>> third commit for the execution of the script. And a fourth 
>>>>>>>>>>>>>>>>>> commit for the
>>>>>>>>>>>>>>>>>> additional manual content changes. If Nam can get to this 
>>>>>>>>>>>>>>>>>> form, people on
>>>>>>>>>>>>>>>>>> this thread myself/Robert/Pablo/Brian can review the changes.
>>>>>>>>>>>>>>>>>> >>>> - Another option is, we can accept that we already
>>>>>>>>>>>>>>>>>> invested in this transition and overall this is a good 
>>>>>>>>>>>>>>>>>> change, and merge
>>>>>>>>>>>>>>>>>> the PR more or less in its current form (with tests fixed 
>>>>>>>>>>>>>>>>>> and open comments
>>>>>>>>>>>>>>>>>> addressed) even though it has issues. And then overtime fix 
>>>>>>>>>>>>>>>>>> the issues we
>>>>>>>>>>>>>>>>>> encounter. There was already some amount of review and 
>>>>>>>>>>>>>>>>>> visual comparisons,
>>>>>>>>>>>>>>>>>> we risk losing some recent content changes but I am assuming 
>>>>>>>>>>>>>>>>>> this will not
>>>>>>>>>>>>>>>>>> be much. If Nam can commit to compare two sites after a 
>>>>>>>>>>>>>>>>>> merge, fixing the
>>>>>>>>>>>>>>>>>> majority of the delta, this might be a viable option.
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>> Another thing we can do, we can archive/store a
>>>>>>>>>>>>>>>>>> read-only copy of the current website in an "archive" url 
>>>>>>>>>>>>>>>>>> temporarily
>>>>>>>>>>>>>>>>>> instead of completely deleting it. It will give us a 
>>>>>>>>>>>>>>>>>> baseline for a while
>>>>>>>>>>>>>>>>>> to go back to the old content and move any missing data. 
>>>>>>>>>>>>>>>>>> (And maybe,
>>>>>>>>>>>>>>>>>> someone can come up with an innovative way to compare the 
>>>>>>>>>>>>>>>>>> textual content
>>>>>>>>>>>>>>>>>> of both sites.) A note on the stop world approach, I believe 
>>>>>>>>>>>>>>>>>> we are already
>>>>>>>>>>>>>>>>>> failing on that with merge conflicts showing up on the PR. 
>>>>>>>>>>>>>>>>>> It will be
>>>>>>>>>>>>>>>>>> better for us to complete the transition as soon as 
>>>>>>>>>>>>>>>>>> possible. Fixing after
>>>>>>>>>>>>>>>>>> the initial merge might be a simpler task, especially if we 
>>>>>>>>>>>>>>>>>> can archive the
>>>>>>>>>>>>>>>>>> old site.
>>>>>>>>>>>>>>>>>> >>>>
>>>>>>>>>>>>>>>>>> >>>>>
>>>>>>>>>>>>>>>>>> >>>>>
>>>>>>>>>>>>>>>>>> >>>>>>
>>>>>>>>>>>>>>>>>> >>>>>> > Michal showed Nam how to handle the 1st test
>>>>>>>>>>>>>>>>>> which was about Apache License missing.
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> > However, the 2nd and 3rd tests looked like some
>>>>>>>>>>>>>>>>>> kind of permissions error on the Jenkins worker, not to be 
>>>>>>>>>>>>>>>>>> configured by
>>>>>>>>>>>>>>>>>> code. For more details based on Jenkin logs, the 2nd test 
>>>>>>>>>>>>>>>>>> failed because of
>>>>>>>>>>>>>>>>>> website/www/site/themes and the 3rd test failed because of
>>>>>>>>>>>>>>>>>> website/www/node_modules, they are both auto-generated files 
>>>>>>>>>>>>>>>>>> on build. Can
>>>>>>>>>>>>>>>>>> someone help Nam to look into this?
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> > RAT ("Run RAT PreCommit") — FAILURE
>>>>>>>>>>>>>>>>>> >>>>>> > Website_Stage_GCS ("Run Website_Stage_GCS
>>>>>>>>>>>>>>>>>> PreCommit") — FAILURE
>>>>>>>>>>>>>>>>>> >>>>>> > Website_Stage_GCS ("Run Website_Stage_GCS
>>>>>>>>>>>>>>>>>> PreCommit") — FAILURE
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> > 2) Are there any other blockers for merging?
>>>>>>>>>>>>>>>>>> @Ahmet/Robert/others please share if there are any other 
>>>>>>>>>>>>>>>>>> blockers.
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> > [1] https://github.com/gohugoio/hugo/pull/4494
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> >
>>>>>>>>>>>>>>>>>> >>>>>> > On Wed, May 6, 2020 at 10:19 AM Robert Bradshaw <
>>>>>>>>>>>>>>>>>> rober...@google.com> wrote:
>>>>>>>>>>>>>>>>>> >>>>>> >>
>>>>>>>>>>>>>>>>>> >>>>>> >> On Mon, May 4, 2020 at 7:07 PM Ahmet Altay <
>>>>>>>>>>>>>>>>>> al...@google.com> wrote:
>>>>>>>>>>>>>>>>>> >>>>>> >> >
>>>>>>>>>>>>>>>>>> >>>>>> >> >> On Mon, May 4, 2020 at 6:30 PM Robert
>>>>>>>>>>>>>>>>>> Bradshaw <rober...@google.com> wrote:
>>>>>>>>>>>>>>>>>> >>>>>> >> >>>
>>>>>>>>>>>>>>>>>> >>>>>> >> >>> I took the massive commit and split it up
>>>>>>>>>>>>>>>>>> into:
>>>>>>>>>>>>>>>>>> >>>>>> >> >>>
>>>>>>>>>>>>>>>>>> >>>>>> >> >>> (1) Infrastructure changes (basically
>>>>>>>>>>>>>>>>>> everything outside of
>>>>>>>>>>>>>>>>>> >>>>>> >> >>> (website/www/site/content)
>>>>>>>>>>>>>>>>>> >>>>>> >> >>> (2) Sed script changes, and
>>>>>>>>>>>>>>>>>> >>>>>> >> >>> (3) Manual changes (everything not in (1)
>>>>>>>>>>>>>>>>>> and (2)).
>>>>>>>>>>>>>>>>>> >>>>>> >> >
>>>>>>>>>>>>>>>>>> >>>>>> >> >
>>>>>>>>>>>>>>>>>> >>>>>> >> > Thank you Robert. This makes it much easier.
>>>>>>>>>>>>>>>>>> What is the source of the sed script? I am not sure why some 
>>>>>>>>>>>>>>>>>> of those lines
>>>>>>>>>>>>>>>>>> are there. It would be much easier for us to comment on the 
>>>>>>>>>>>>>>>>>> script source
>>>>>>>>>>>>>>>>>> if it is reviewable somewhere.
>>>>>>>>>>>>>>>>>> >>>>>> >>
>>>>>>>>>>>>>>>>>> >>>>>> >> I just gathered up common patterns as I was
>>>>>>>>>>>>>>>>>> trying to go through and
>>>>>>>>>>>>>>>>>> >>>>>> >> review the files... Mostly it was an exercise
>>>>>>>>>>>>>>>>>> in finding a compact
>>>>>>>>>>>>>>>>>> >>>>>> >> representation for the delta, not trying to be
>>>>>>>>>>>>>>>>>> a perfect conversion.
>>>>>>>>>>>>>>>>>> >>>>>> >> (I do think in retrospect, if we do something
>>>>>>>>>>>>>>>>>> like this again, it
>>>>>>>>>>>>>>>>>> >>>>>> >> would be preferable to commit a script that
>>>>>>>>>>>>>>>>>> does the auto-conversion
>>>>>>>>>>>>>>>>>> >>>>>> >> (maybe even with some patch files for manual
>>>>>>>>>>>>>>>>>> changes) both for ease of
>>>>>>>>>>>>>>>>>> >>>>>> >> reviewing and to avoid the stop-the-world
>>>>>>>>>>>>>>>>>> situation we're in now. (I'm
>>>>>>>>>>>>>>>>>> >>>>>> >> still worried that some changes will get lost
>>>>>>>>>>>>>>>>>> in the shuffle.)
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>

Reply via email to