I poked around a bit. The 3.0.0-alpha2 binary tarball is only 246M and has more changes than 2.8.0.
It looks like the 2.8.0 bin tarball has an extra 1.5GB of docs when extracted compared to 3.0.0-alpha2. I think it's from the extra src-html content: -> % find share/doc -name src-html | xargs du -sb | awk -e '{SUM+=$1} END {print SUM}' 1651891481 On Tue, Mar 21, 2017 at 9:53 AM, Wei-Chiu Chuang <weic...@cloudera.com> wrote: > Thanks Junping for taking on this huge effort! > > I found one tiny nit: the md5 files are not in conventional format > That is to say, > $ cat hadoop-2.8.0.tar.gz.md5 > > $ /usr/bin/md5sum /build/source/target/artifacts/hadoop-2.8.0.tar.gz > c728a090b68d009070085367695ed507 /build/source/target/ > artifacts/hadoop-2.8.0.tar.gz > > But a typical md5 file would have been: > c728a090b68d009070085367695ed507 hadoop-2.8.0.tar.gz > > I was pretty stunned finding hadoop-2.8.0.tar.gz is a whopping 410MB > binary, comparing to hadoop-2.7.3.tar.gz which is just 205 MB. > But later on I realized the source code hadoop-2.8.0-src.tar.gz is 33MB > comparing to hadoop-2.7.3-src.tar.gz which is 18MB. So probably it’s the > amount of changes made into Hadoop 2.8 makes such difference in size. > > Regards, > Wei-Chiu Chuang > > > On Mar 21, 2017, at 9:39 AM, Akira Ajisaka <aajis...@apache.org> wrote: > > > > Thanks Junping! > > > > +1 (binding) > > > > * Verified signatures and checksums > > * Built Hive 2.1.0 and Tez 0.8.5 with Hadoop 2.8.0 pom > > * Deployed a single node cluster and ran some Hive on Tez queries > successfully > > * The document looks good. > > > > I found a trivial issue in the doc. It does not block the release. > > https://issues.apache.org/jira/browse/HADOOP-14208 > > > > Regards, > > Akira > > > > > > On 2017/03/17 18:18, Junping Du wrote: > >> Hi all, > >> With fix of HDFS-11431 get in, I've created a new release candidate > (RC3) for Apache Hadoop 2.8.0. > >> > >> This is the next minor release to follow up 2.7.0 which has been > released for more than 1 year. It comprises 2,900+ fixes, improvements, and > new features. Most of these commits are released for the first time in > branch-2. > >> > >> More information about the 2.8.0 release plan can be found here: > https://cwiki.apache.org/confluence/display/HADOOP/Hadoop+2.8+Release > >> > >> New RC is available at: http://home.apache.org/~ > junping_du/hadoop-2.8.0-RC3 > >> > >> The RC tag in git is: release-2.8.0-RC3, and the latest commit id > is: 91f2b7a13d1e97be65db92ddabc627cc29ac0009 > >> > >> The maven artifacts are available via repository.apache.org at: > https://repository.apache.org/content/repositories/orgapachehadoop-1057 > >> > >> Please try the release and vote; the vote will run for the usual 5 > days, ending on 03/22/2017 PDT time. > >> > >> Thanks, > >> > >> Junping > >> > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org > > For additional commands, e-mail: common-dev-h...@hadoop.apache.org > > > >