On Sat, 4 Mar 2023 at 01:47, Erik Krogen <xkro...@apache.org> wrote:

> Thanks Steve. I see now that the branch cut was way back in October so I
> definitely understand your frustration here!
>
> This made me realize that HDFS-16832
> <https://issues.apache.org/jira/browse/HDFS-16832>, which resolves a very
> similar issue as the aforementioned HDFS-16923, is also missing from the
> RC. I erroneously marked it with a fix version of 3.3.5 -- it was before
> the initial 3.3.5 RC was made and I didn't notice the branch was cut. My
> apologies for that. I've pushed both HDFS-16832 and HDFS-16932 to
> branch-3.3.5, so they are ready if/when an RC3 is cut.
>

thanks.

>
> In the meantime, I tested for RC2 that a local cluster of NN + standby +
> observer + QJM works as expected for some basic HDFS commands.
>

OK. Could you have a go with a (locally built) patch release

>
> On Fri, Mar 3, 2023 at 2:52 AM Steve Loughran <ste...@cloudera.com.invalid>
> wrote:
>
>> shipping broken hdfs isn't something we'd want to do, but if we can be
>> confident that all other issues can be addressed in RC3 then I'd be happy.
>>
>> On Fri, 3 Mar 2023 at 05:09, Ayush Saxena <ayush...@gmail.com> wrote:
>>
>> > I will highlight that I am completely fed up with doing this  release
>> and
>> >> really want to get it out the way -for which I depend on support from
>> as
>> >> many other developers as possible.
>> >
>> >
>> > hmm, I can feel the pain. I tried to find if there is any config or any
>> > workaround which can dodge this HDFS issue, but unfortunately couldn't
>> find
>> > any. If someone does a getListing with needLocation and the file doesn't
>> > exist at Observer he is gonna get a NPE rather than a FNF, It isn't just
>> > the exception, AFAIK Observer reads have some logic around handling FNF
>> > specifically, that it attempts Active NN or something like that in such
>> > cases, So, that will be broken as well for this use case.
>> >
>> > Now, there is no denying the fact there is an issue on the HDFS side,
>> and
>> > it has already been too much work on your side, so you can argue that it
>> > might not be a very frequent use case or so. It's your call.
>> >
>> > Just sharing, no intentions of saying you should do that, But as an RM
>> > "nobody" can force you for a new iteration of a RC, it is gonna be your
>> > call and discretion. As far as I know a release can not be vetoed by
>> > "anybody" as per the apache by laws.(
>> > https://www.apache.org/legal/release-policy.html#release-approval).
>> Even
>> > our bylaws say that product release requires a Lazy Majority not a
>> > Consensus Approval.
>> >
>> > So, you have a way out. You guys are 2 already and 1 I will give you a
>> > pass, in case you are really in a state: ''Get me out of this mess"
>> state,
>> > my basic validations on x86 & Aarch64 both are passing as of now,
>> couldn't
>> > reach the end for any of the RC's
>> >
>> > -Ayush
>> >
>> > On Fri, 3 Mar 2023 at 08:41, Viraj Jasani <vjas...@apache.org> wrote:
>> >
>> >> While this RC is not going to be final, I just wanted to share the
>> results
>> >> of the testing I have done so far with RC1 and RC2.
>> >>
>> >> * Signature: ok
>> >> * Checksum : ok
>> >> * Rat check (1.8.0_341): ok
>> >>  - mvn clean apache-rat:check
>> >> * Built from source (1.8.0_341): ok
>> >>  - mvn clean install  -DskipTests
>> >> * Built tar from source (1.8.0_341): ok
>> >>  - mvn clean package  -Pdist -DskipTests -Dtar
>> -Dmaven.javadoc.skip=true
>> >>
>> >> * Built images using the tarball, installed and started all of Hdfs,
>> JHS
>> >> and Yarn components
>> >> * Ran Hbase (latest 2.5) tests against Hdfs, ran RowCounter Mapreduce
>> job
>> >> * Hdfs CRUD tests
>> >> * MapReduce wordcount job
>> >>
>> >> * Ran S3A tests with scale profile against us-west-2:
>> >> mvn clean verify -Dparallel-tests -DtestsThreadCount=8 -Dscale
>> >>
>> >> ITestS3AConcurrentOps#testParallelRename is timing out after ~960s.
>> This
>> >> is
>> >> consistently failing, looks like a recent regression.
>> >> I was also able to repro on trunk, will create Jira.
>> >>
>> >>
>> >> On Mon, Feb 27, 2023 at 9:59 AM Steve Loughran
>> >> <ste...@cloudera.com.invalid>
>> >> wrote:
>> >>
>> >> > Mukund and I have put together a release candidate (RC2) for Hadoop
>> >> 3.3.5.
>> >> >
>> >> > We need anyone who can to verify the source and binary artifacts,
>> >> > including those JARs staged on maven, the site documentation and the
>> >> arm64
>> >> > tar file.
>> >> >
>> >> > The RC is available at:
>> >> > https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.3.5-RC2/
>> >> >
>> >> > The git tag is release-3.3.5-RC2, commit 72f8c2a4888
>> >> >
>> >> > The maven artifacts are staged at
>> >> >
>> >>
>> https://repository.apache.org/content/repositories/orgapachehadoop-1369/
>> >> >
>> >> > You can find my public key at:
>> >> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>> >> >
>> >> > Change log
>> >> >
>> >>
>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.3.5-RC2/CHANGELOG.md
>> >> >
>> >> > Release notes
>> >> >
>> >> >
>> >>
>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.3.5-RC2/RELEASENOTES.md
>> >> >
>> >> > This is off branch-3.3 and is the first big release since 3.3.2.
>> >> >
>> >> > As to what changed since the RC1 attempt last week
>> >> >
>> >> >
>> >> >    1. Version fixup in JIRA (credit due to Takanobu Asanuma there)
>> >> >    2. HADOOP-18470. Remove HDFS RBF text in the 3.3.5 index.md file
>> >> >    3. Revert "HADOOP-18590. Publish SBOM artifacts (#5281)" (creating
>> >> build
>> >> >    issues in maven 3.9.0)
>> >> >    4. HADOOP-18641. Cloud connector dependency and LICENSE fixup.
>> >> (#5429)
>> >> >
>> >> >
>> >> > Note, because the arm64 binaries are built separately on a different
>> >> > platform and JVM, their jar files may not match those of the x86
>> >> > release -and therefore the maven artifacts. I don't think this is
>> >> > an issue (the ASF actually releases source tarballs, the binaries are
>> >> > there for help only, though with the maven repo that's a bit
>> blurred).
>> >> >
>> >> > The only way to be consistent would actually untar the x86.tar.gz,
>> >> > overwrite its binaries with the arm stuff, retar, sign and push out
>> >> > for the vote. Even automating that would be risky.
>> >> >
>> >> > Please try the release and vote. The vote will run for 5 days.
>> >> >
>> >> > Steve and Mukund
>> >> >
>> >>
>> >
>>
>

Reply via email to