I just saw this email, Andrew. Should I re-open HBASE-21959? And revert it
before we understand/fix why it caused the test failure?
Regarding the failing test, do you mean this one "TestBlocksRead"?
Thanks,

Xu

On Tue, Apr 16, 2019 at 9:47 PM Andrew Purtell <andrew.purt...@gmail.com>
wrote:

> I've bisected twice and it lands on this commit:
>
> commit 6bc46bb10920c1c335b784b01d2a326db1a3d587 (HEAD, refs/bisect/bad)
>     HBASE-21959 CompactionTool should close the store it uses for
> compacting files, in order to properly archive compacted files.
>
>  
> hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/CompactionTool.java
> |   2 ++
>
>  
> hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompactionTool.java
> | 100
>
> At first glance it's hard to see how this change is relevant, but it does
> introduce a new unit test.
>
>
> On Tue, Apr 16, 2019 at 7:48 PM Andrew Purtell <andrew.purt...@gmail.com>
> wrote:
>
> > I’ve been able to reproduce it sometimes too and am bisecting. It may be
> > an interaction between test cases, not a failure per se, but does seem
> have
> > a recent cause, as you pointed out. I’ll be looking at it.
> >
> > Thank you for your kind consideration and for revoking your veto.
> >
> > A coprocessor API fix was just committed to branch-1 so I want to roll a
> > new RC soon to include it. There is also an issue open to improve the
> > behavior of the UI when the profiler link is clicked but system support
> is
> > not available.
> >
> > > On Apr 16, 2019, at 7:40 PM, Yu Li <car...@gmail.com> wrote:
> > >
> > > After more investigation, the ConnectionRefused exception could be
> > > reproduced with "mvn -Dtest=<case_name> test" after a complete run of
> all
> > > cases through "mvn -PrunAllTests clean test", but cannot by a clean
> > > standalone run (with "mvn *clean* test"). So now I'm more convinced
> it's
> > > some kind of environment chaos caused by parallel execution of test
> > cases,
> > > and not a blocker issue.
> > >
> > > @Andrew It seems to me that kerby jar is not included in our binary
> > > package, so I'm not sure whether a new RC is required by HBASE-22219.
> > > Anyway I'd like to revoke my -1 vote now. Thanks.
> > >
> > > Best Regards,
> > > Yu
> > >
> > >
> > >> On Tue, 16 Apr 2019 at 10:19, Yu Li <car...@gmail.com> wrote:
> > >>
> > >> Sorry for the late response due to job priority.
> > >>
> > >> This ConnectionRefused issue cannot be reproduced on my laptop (MacOS
> > >> 10.14.4) but could on the linux env. And I've checked and confirmed it
> > >> could pass with 1.4.7/1.4.9 source package but stably failed with
> 1.5.0,
> > >> performing a git bisect now, will report back later.
> > >>
> > >> Best Regards,
> > >> Yu
> > >>
> > >>
> > >> On Sat, 13 Apr 2019 at 00:38, Andrew Purtell <
> andrew.purt...@gmail.com>
> > >> wrote:
> > >>
> > >>> I also see the occasional ConnectionRefused errors. They don’t
> > reproduce
> > >>> if you run the test standalone. I also only see them on a Linux dev
> > host.
> > >>> That may be enough to find by bisect the commit that introduced this
> > >>> behavior. Working on it. There is a JIRA filed for this one. Search
> for
> > >>> “TestBlocksRead” and label “branch-1”.
> > >>>
> > >>> Thanks for the investigations.
> > >>>
> > >>>> On Apr 12, 2019, at 6:36 AM, Yu Li <car...@gmail.com> wrote:
> > >>>>
> > >>>> Quick updates:
> > >>>>
> > >>>> W/ patch of HBASE-22219 or say upgrading kerby version to 1.0.1, the
> > >>>> failures listed above in the 1st part of hbase-server disappeared.
> > >>>>
> > >>>> However, in the 2nd part of hbase-server UT there're still many
> > >>>> ConnectionRefused exceptions (17 errors in total) as shown below,
> > which
> > >>>> could be reproduced easily with -Dtest=xxx command on my
> environments,
> > >>>> still checking the root cause.
> > >>>>
> > >>>> [INFO] Running org.apache.hadoop.hbase.regionserver.TestBlocksRead
> > >>>> [ERROR] Tests run: 4, Failures: 0, Errors: 4, Skipped: 0, Time
> > elapsed:
> > >>>> 0.853 s <<< FAILURE! - in
> > >>>> org.apache.hadoop.hbase.regionserver.TestBlocksRead
> > >>>> [ERROR]
> > >>>>
> > >>>
> >
> testBlocksStoredWhenCachingDisabled(org.apache.hadoop.hbase.regionserver.TestBlocksRead)
> > >>>> Time elapsed: 0.17 s  <<< ERROR!
> > >>>> java.net.ConnectException: Call From
> > >>>> z05f06378.sqa.zth.tbsite.net/11.163.183.195 to localhost:35669
> failed
> > >>> on
> > >>>> connection exception: java.net.ConnectException: Connection refused;
> > For
> > >>>> more details see:
> > >>>> http://wiki.apache.org/hadoop/ConnectionRefused
> > >>>>       at
> > >>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksRead.initHRegion(TestBlocksRead.java:112)
> > >>>>       at
> > >>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksRead.testBlocksStoredWhenCachingDisabled(TestBlocksRead.java:389)
> > >>>> Caused by: java.net.ConnectException: Connection refused
> > >>>>       at
> > >>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksRead.initHRegion(TestBlocksRead.java:112)
> > >>>>       at
> > >>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksRead.testBlocksStoredWhenCachingDisabled(TestBlocksRead.java:389)
> > >>>>
> > >>>> Best Regards,
> > >>>> Yu
> > >>>>
> > >>>>
> > >>>>> On Fri, 12 Apr 2019 at 13:11, Yu Li <car...@gmail.com> wrote:
> > >>>>>
> > >>>>> I have no doubt that you've run the tests locally before
> announcing a
> > >>>>> release as you're always a great RM boss. And this shows one value
> of
> > >>>>> verifying release, that different voter has different environments.
> > >>>>>
> > >>>>> Now I think the failures may be kerberos related, since I possibly
> > has
> > >>>>> changed some system configuration when doing Flink testing on this
> > env
> > >>>>> weeks ago. Located one issue (HBASE-22219) which also observed in
> > >>> 1.4.7,
> > >>>>> will further investigate.
> > >>>>>
> > >>>>> Best Regards,
> > >>>>> Yu
> > >>>>>
> > >>>>>
> > >>>>> On Fri, 12 Apr 2019 at 12:38, Andrew Purtell <
> > andrew.purt...@gmail.com
> > >>>>
> > >>>>> wrote:
> > >>>>>
> > >>>>>> “However it's good to find the issue earlier if there
> > >>>>>> really is any, before release announced.”
> > >>>>>>
> > >>>>>> I run the complete unit test suite before announcing a release
> > >>> candidate.
> > >>>>>> Just to be clear.
> > >>>>>>
> > >>>>>> Totally agree we should get these problems sorted before an actual
> > >>>>>> release. My policy is to cancel a RC if anyone vetoes for this
> > >>> reason...
> > >>>>>> want as much coverage and varying environments as we can manage.
> > >>>>>>
> > >>>>>> Thank you for your help so far and I hope the failures you see
> > result
> > >>> in
> > >>>>>> analysis and fixes that lead to better test stability.
> > >>>>>>
> > >>>>>>> On Apr 11, 2019, at 9:32 PM, Yu Li <car...@gmail.com> wrote:
> > >>>>>>>
> > >>>>>>> Confirmed in 1.4.7 source the listed out cases passed (all in the
> > 1st
> > >>>>>> part
> > >>>>>>> of hbase-server so the result comes out quickly.)... Also
> confirmed
> > >>> the
> > >>>>>>> test ran order are the same...
> > >>>>>>>
> > >>>>>>> Will try 1.5.0 again to prevent the environment difference caused
> > by
> > >>>>>> time.
> > >>>>>>> If 1.5.0 still fails, will start to do the git bisect to locate
> the
> > >>>>>> first
> > >>>>>>> bad commit.
> > >>>>>>>
> > >>>>>>> Was also expecting an easy pass and +1 as always to save time and
> > >>>>>> efforts,
> > >>>>>>> but obvious no luck. However it's good to find the issue earlier
> if
> > >>>>>> there
> > >>>>>>> really is any, before release announced.
> > >>>>>>>
> > >>>>>>> Best Regards,
> > >>>>>>> Yu
> > >>>>>>>
> > >>>>>>>
> > >>>>>>>> On Fri, 12 Apr 2019 at 12:16, Yu Li <car...@gmail.com> wrote:
> > >>>>>>>>
> > >>>>>>>> Fine, let's focus on verifying whether it's a real problem
> rather
> > >>> than
> > >>>>>>>> arguing about wording, after all that's not my intention...
> > >>>>>>>>
> > >>>>>>>> As mentioned, I participated in the 1.4.7 release vote[1] and
> > IIRC I
> > >>>>>> was
> > >>>>>>>> using the same env and all tests passed w/o issue, that's where
> my
> > >>>>>> concern
> > >>>>>>>> lies and the main reason I gave a -1 vote. I'm running against
> > 1.4.7
> > >>>>>> source
> > >>>>>>>> on the same now and let's see the result.
> > >>>>>>>>
> > >>>>>>>> [1]
> > https://www.mail-archive.com/dev@hbase.apache.org/msg51380.html
> > >>>>>>>>
> > >>>>>>>> Best Regards,
> > >>>>>>>> Yu
> > >>>>>>>>
> > >>>>>>>>
> > >>>>>>>> On Fri, 12 Apr 2019 at 12:05, Andrew Purtell <
> > >>> andrew.purt...@gmail.com
> > >>>>>>>
> > >>>>>>>> wrote:
> > >>>>>>>>
> > >>>>>>>>> I believe the test execution order matters. We run some tests
> in
> > >>>>>>>>> parallel. The ordering of tests is determined by readdir()
> > results
> > >>>>>> and this
> > >>>>>>>>> differs from host to host and checkout to checkout. So when you
> > >>> see a
> > >>>>>>>>> repeatable group of failures, that’s great. And when someone
> else
> > >>>>>> doesn’t
> > >>>>>>>>> see those same tests fail, or they cannot be reproduced when
> > >>> running
> > >>>>>> by
> > >>>>>>>>> themselves, the commonly accepted term of art for this is
> > “flaky”.
> > >>>>>>>>>
> > >>>>>>>>>
> > >>>>>>>>>> On Apr 11, 2019, at 8:52 PM, Yu Li <car...@gmail.com> wrote:
> > >>>>>>>>>>
> > >>>>>>>>>> Sorry but I'd call it "possible environment related problem"
> or
> > >>> "some
> > >>>>>>>>>> feature may not work well in specific environment", rather
> than
> > a
> > >>>>>> flaky.
> > >>>>>>>>>>
> > >>>>>>>>>> Will check against 1.4.7 released source package before
> opening
> > >>> any
> > >>>>>>>>> JIRA.
> > >>>>>>>>>>
> > >>>>>>>>>> Best Regards,
> > >>>>>>>>>> Yu
> > >>>>>>>>>>
> > >>>>>>>>>>
> > >>>>>>>>>> On Fri, 12 Apr 2019 at 11:37, Andrew Purtell <
> > >>>>>> andrew.purt...@gmail.com>
> > >>>>>>>>>> wrote:
> > >>>>>>>>>>
> > >>>>>>>>>>> And if they pass in my environment , then what should we call
> > it
> > >>>>>> then.
> > >>>>>>>>> I
> > >>>>>>>>>>> have no doubt you are seeing failures. Therefore can you
> please
> > >>> file
> > >>>>>>>>> JIRAs
> > >>>>>>>>>>> and attach information that can help identify a fix. Thanks.
> > >>>>>>>>>>>
> > >>>>>>>>>>>> On Apr 11, 2019, at 8:35 PM, Yu Li <car...@gmail.com>
> wrote:
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> I ran the test suite with the
> > >>> -Dsurefire.rerunFailingTestsCount=2
> > >>>>>>>>> option
> > >>>>>>>>>>>> and on two different env separately, so it sums up to 6
> times
> > >>>>>> stable
> > >>>>>>>>>>>> failure for each case, and from my perspective this is not
> > >>> flaky.
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> IIRC last time when verifying 1.4.7 on the same env no such
> > >>> issue
> > >>>>>>>>>>> observed,
> > >>>>>>>>>>>> will double check.
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> Best Regards,
> > >>>>>>>>>>>> Yu
> > >>>>>>>>>>>>
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> On Fri, 12 Apr 2019 at 00:07, Andrew Purtell <
> > >>>>>>>>> andrew.purt...@gmail.com>
> > >>>>>>>>>>>> wrote:
> > >>>>>>>>>>>>
> > >>>>>>>>>>>>> There are two failure cases it looks like. And this looks
> > like
> > >>>>>>>>> flakes.
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>>> The wrong FS assertions are not something I see when I run
> > >>> these
> > >>>>>>>>> tests
> > >>>>>>>>>>>>> myself. I am not able to investigate something I can’t
> > >>> reproduce.
> > >>>>>>>>> What I
> > >>>>>>>>>>>>> suggest is since you can reproduce do a git bisect to find
> > the
> > >>>>>> commit
> > >>>>>>>>>>> that
> > >>>>>>>>>>>>> introduced the problem. Then we can revert it. As an
> > >>> alternative
> > >>>>>> we
> > >>>>>>>>> can
> > >>>>>>>>>>>>> open a JIRA, report the problem, temporarily @ignore the
> > test,
> > >>> and
> > >>>>>>>>>>>>> continue. This latter option only should be done if we are
> > >>> fairly
> > >>>>>>>>>>> confident
> > >>>>>>>>>>>>> it is a test only problem.
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>>> The connect exceptions are interesting. I see these
> sometimes
> > >>> when
> > >>>>>>>>> the
> > >>>>>>>>>>>>> suite is executed, not this particular case, but when the
> > >>> failed
> > >>>>>>>>> test is
> > >>>>>>>>>>>>> executed by itself it always passes. It is possible some
> > >>> change to
> > >>>>>>>>>>> classes
> > >>>>>>>>>>>>> related to the minicluster or startup or shutdown timing
> are
> > >>> the
> > >>>>>>>>> cause,
> > >>>>>>>>>>> but
> > >>>>>>>>>>>>> it is test time flaky behavior. I’m not happy about this
> but
> > it
> > >>>>>>>>> doesn’t
> > >>>>>>>>>>>>> actually fail the release because the failure is never
> > >>> repeatable
> > >>>>>>>>> when
> > >>>>>>>>>>> the
> > >>>>>>>>>>>>> test is run standalone.
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>>> In general it would be great if some attention was paid to
> > test
> > >>>>>>>>>>>>> cleanliness on branch-1. As RM I’m not in a position to
> > insist
> > >>>>>> that
> > >>>>>>>>>>>>> everything is perfect or there will never be another 1.x
> > >>> release,
> > >>>>>>>>>>> certainly
> > >>>>>>>>>>>>> not from branch-1. So, tests which fail repeatedly block a
> > >>> release
> > >>>>>>>>> IMHO
> > >>>>>>>>>>> but
> > >>>>>>>>>>>>> flakes do not.
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>>>> On Apr 10, 2019, at 11:20 PM, Yu Li <car...@gmail.com>
> > wrote:
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> -1
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> Observed many UT failures when checking the source package
> > >>> (tried
> > >>>>>>>>>>>>> multiple
> > >>>>>>>>>>>>>> rounds on two different environments, MacOs and Linux, got
> > the
> > >>>>>> same
> > >>>>>>>>>>>>>> result), including (but not limited to):
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> TestBulkload:
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> shouldBulkLoadSingleFamilyHLog(org.apache.hadoop.hbase.regionserver.TestBulkLoad)
> > >>>>>>>>>>>>>> Time elapsed: 0.083 s  <<< ERROR!
> > >>>>>>>>>>>>>> java.lang.IllegalArgumentException: Wrong FS:
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> file:/var/folders/t6/vch4nh357f98y1wlq09lbm7h0000gn/T/junit1805329913454564189/junit8020757893576011944/data/default/shouldBulkLoadSingleFamilyHLog/8f4a6b584533de2fd1bf3c398dfaac29,
> > >>>>>>>>>>>>>> expected: hdfs://localhost:55938
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBulkLoad.testRegionWithFamiliesAndSpecifiedTableName(TestBulkLoad.java:246)
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBulkLoad.testRegionWithFamilies(TestBulkLoad.java:256)
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBulkLoad.shouldBulkLoadSingleFamilyHLog(TestBulkLoad.java:150)
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> TestStoreFile:
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> testCacheOnWriteEvictOnClose(org.apache.hadoop.hbase.regionserver.TestStoreFile)
> > >>>>>>>>>>>>>> Time elapsed: 0.083 s  <<< ERROR!
> > >>>>>>>>>>>>>> java.net.ConnectException: Call From localhost/127.0.0.1
> to
> > >>>>>>>>>>>>> localhost:55938
> > >>>>>>>>>>>>>> failed on connection exception: java.net.ConnectException:
> > >>>>>>>>> Connection
> > >>>>>>>>>>>>>> refused; For more details see:
> > >>>>>>>>>>>>>> http://wiki.apache.org/hadoop/ConnectionRefused
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestStoreFile.writeStoreFile(TestStoreFile.java:1047)
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestStoreFile.testCacheOnWriteEvictOnClose(TestStoreFile.java:908)
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> TestHFile:
> > >>>>>>>>>>>>>> testEmptyHFile(org.apache.hadoop.hbase.io.hfile.TestHFile)
> > >>> Time
> > >>>>>>>>>>> elapsed:
> > >>>>>>>>>>>>>> 0.08 s  <<< ERROR!
> > >>>>>>>>>>>>>> java.net.ConnectException: Call From
> > >>>>>>>>>>>>>> z05f06378.sqa.zth.tbsite.net/11.163.183.195 to
> > >>> localhost:35529
> > >>>>>>>>> failed
> > >>>>>>>>>>> on
> > >>>>>>>>>>>>>> connection exception: java.net.ConnectException:
> Connection
> > >>>>>> refused;
> > >>>>>>>>>>> For
> > >>>>>>>>>>>>>> more details see:
> > >>>>>> http://wiki.apache.org/hadoop/ConnectionRefused
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>> org.apache.hadoop.hbase.io
> > >>>>>>>>>>>>> .hfile.TestHFile.testEmptyHFile(TestHFile.java:90)
> > >>>>>>>>>>>>>> Caused by: java.net.ConnectException: Connection refused
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>> org.apache.hadoop.hbase.io
> > >>>>>>>>>>>>> .hfile.TestHFile.testEmptyHFile(TestHFile.java:90)
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> TestBlocksScanned:
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> testBlocksScannedWithEncoding(org.apache.hadoop.hbase.regionserver.TestBlocksScanned)
> > >>>>>>>>>>>>>> Time elapsed: 0.069 s  <<< ERROR!
> > >>>>>>>>>>>>>> java.lang.IllegalArgumentException: Wrong FS:
> > >>>>>>>>>>> hdfs://localhost:35529/tmp/
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> hbase-jueding.ly/hbase/data/default/TestBlocksScannedWithEncoding/a4a416cc3060d9820a621c294af0aa08
> > >>>>>>>>>>>>> ,
> > >>>>>>>>>>>>>> expected: file:///
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksScanned._testBlocksScanned(TestBlocksScanned.java:90)
> > >>>>>>>>>>>>>>   at
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> org.apache.hadoop.hbase.regionserver.TestBlocksScanned.testBlocksScannedWithEncoding(TestBlocksScanned.java:86)
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> And please let me know if any known issue I'm not aware
> of.
> > >>>>>> Thanks.
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>> Best Regards,
> > >>>>>>>>>>>>>> Yu
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>> On Mon, 8 Apr 2019 at 11:38, Yu Li <car...@gmail.com>
> > wrote:
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>> The performance report LGTM, thanks! (and sorry for the
> lag
> > >>> due
> > >>>>>> to
> > >>>>>>>>>>>>>>> Qingming Festival Holiday here in China)
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>> Still verifying the release, just some quick feedback:
> > >>> observed
> > >>>>>>>>> some
> > >>>>>>>>>>>>>>> incompatible changes in compatibility report including
> > >>>>>>>>>>>>>>> HBASE-21492/HBASE-21684 and worth a reminder in
> > ReleaseNote.
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>> Irrelative but noticeable: the 1.4.9 release note URL is
> > >>>>>> invalid on
> > >>>>>>>>>>>>>>> https://hbase.apache.org/downloads.html
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>> Best Regards,
> > >>>>>>>>>>>>>>> Yu
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> On Fri, 5 Apr 2019 at 08:45, Andrew Purtell <
> > >>>>>> apurt...@apache.org>
> > >>>>>>>>>>>>> wrote:
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> The difference is basically noise per the usual YCSB
> > >>>>>> evaluation.
> > >>>>>>>>>>> Small
> > >>>>>>>>>>>>>>>> differences in workloads D and F (slightly worse) and
> > >>> workload
> > >>>>>> E
> > >>>>>>>>>>>>> (slightly
> > >>>>>>>>>>>>>>>> better) that do not indicate serious regression.
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> Linux version 4.14.55-62.37.amzn1.x86_64
> > >>>>>>>>>>>>>>>> c3.8xlarge x 5
> > >>>>>>>>>>>>>>>> OpenJDK Runtime Environment (build
> > 1.8.0_181-shenandoah-b13)
> > >>>>>>>>>>>>>>>> -Xms20g -Xmx20g -XX:+UseG1GC -XX:+AlwaysPreTouch
> > >>> -XX:+UseNUMA
> > >>>>>>>>>>>>>>>> -XX:-UseBiasedLocking -XX:+ParallelRefProcEnabled
> > >>>>>>>>>>>>>>>> Hadoop 2.9.2
> > >>>>>>>>>>>>>>>> Init: Load 100 M rows and snapshot
> > >>>>>>>>>>>>>>>> Run: Delete table, clone and redeploy from snapshot, run
> > 10
> > >>> M
> > >>>>>>>>>>>>> operations
> > >>>>>>>>>>>>>>>> Args: -threads 100 -target 50000
> > >>>>>>>>>>>>>>>> Test table: {NAME => 'u', BLOOMFILTER => 'ROW', VERSIONS
> > =>
> > >>>>>> '1',
> > >>>>>>>>>>>>> IN_MEMORY
> > >>>>>>>>>>>>>>>> => 'false', KEEP_DELETED_CELLS => 'FALSE',
> > >>> DATA_BLOCK_ENCODING
> > >>>>>> =>
> > >>>>>>>>>>>>>>>> 'ROW_INDEX_V1', TTL => 'FOREVER', COMPRESSION =>
> 'SNAPPY',
> > >>>>>>>>>>>>> MIN_VERSIONS =>
> > >>>>>>>>>>>>>>>> '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536',
> > >>>>>>>>> REPLICATION_SCOPE =>
> > >>>>>>>>>>>>>>>> '0'}
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload A
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 50k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 200592 200583
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 49852 49855
> > >>>>>>>>>>>>>>>> [READ], AverageLatency(us) 544 559
> > >>>>>>>>>>>>>>>> [READ], MinLatency(us) 267 292
> > >>>>>>>>>>>>>>>> [READ], MaxLatency(us) 165631 185087
> > >>>>>>>>>>>>>>>> [READ], 95thPercentileLatency(us) 738 742
> > >>>>>>>>>>>>>>>> [READ], 99thPercentileLatency(us), 1877 1961
> > >>>>>>>>>>>>>>>> [UPDATE], AverageLatency(us) 1370 1181
> > >>>>>>>>>>>>>>>> [UPDATE], MinLatency(us) 702 646
> > >>>>>>>>>>>>>>>> [UPDATE], MaxLatency(us) 180735 177279
> > >>>>>>>>>>>>>>>> [UPDATE], 95thPercentileLatency(us) 1943 1652
> > >>>>>>>>>>>>>>>> [UPDATE], 99thPercentileLatency(us) 3257 3085
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload B
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 50k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 200599 200581
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 49850 49855
> > >>>>>>>>>>>>>>>> [READ], AverageLatency(us),  454 471
> > >>>>>>>>>>>>>>>> [READ], MinLatency(us) 203 213
> > >>>>>>>>>>>>>>>> [READ], MaxLatency(us) 183423 174207
> > >>>>>>>>>>>>>>>> [READ], 95thPercentileLatency(us) 563 599
> > >>>>>>>>>>>>>>>> [READ], 99thPercentileLatency(us) 1360 1172
> > >>>>>>>>>>>>>>>> [UPDATE], AverageLatency(us) 1064 1029
> > >>>>>>>>>>>>>>>> [UPDATE], MinLatency(us) 746 726
> > >>>>>>>>>>>>>>>> [UPDATE], MaxLatency(us) 163455 101631
> > >>>>>>>>>>>>>>>> [UPDATE], 95thPercentileLatency(us) 1327 1157
> > >>>>>>>>>>>>>>>> [UPDATE], 99thPercentileLatency(us) 2241 1898
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload C
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 50k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 200541 200538
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 49865 49865
> > >>>>>>>>>>>>>>>> [READ], AverageLatency(us) 332 327
> > >>>>>>>>>>>>>>>> [READ], MinLatency(us) 175 179
> > >>>>>>>>>>>>>>>> [READ], MaxLatency(us) 210559 170367
> > >>>>>>>>>>>>>>>> [READ], 95thPercentileLatency(us) 410 396
> > >>>>>>>>>>>>>>>> [READ], 99thPercentileLatency(us) 871 892
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload D
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 50k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 200579 200562
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 49855 49859
> > >>>>>>>>>>>>>>>> [READ], AverageLatency(us) 487 547
> > >>>>>>>>>>>>>>>> [READ], MinLatency(us) 210 214
> > >>>>>>>>>>>>>>>> [READ], MaxLatency(us) 192255 177535
> > >>>>>>>>>>>>>>>> [READ], 95thPercentileLatency(us) 973 1529
> > >>>>>>>>>>>>>>>> [READ], 99thPercentileLatency(us) 1836 2683
> > >>>>>>>>>>>>>>>> [INSERT], AverageLatency(us) 1239 1152
> > >>>>>>>>>>>>>>>> [INSERT], MinLatency(us) 807 788
> > >>>>>>>>>>>>>>>> [INSERT], MaxLatency(us) 184575 148735
> > >>>>>>>>>>>>>>>> [INSERT], 95thPercentileLatency(us) 1496 1243
> > >>>>>>>>>>>>>>>> [INSERT], 99thPercentileLatency(us) 2965 2495
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload E
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 10k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 100605 100568
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 9939 9943
> > >>>>>>>>>>>>>>>> [SCAN], AverageLatency(us) 3548 2687
> > >>>>>>>>>>>>>>>> [SCAN], MinLatency(us) 696 678
> > >>>>>>>>>>>>>>>> [SCAN], MaxLatency(us) 1059839 238463
> > >>>>>>>>>>>>>>>> [SCAN], 95thPercentileLatency(us) 8327 6791
> > >>>>>>>>>>>>>>>> [SCAN], 99thPercentileLatency(us) 17647 14415
> > >>>>>>>>>>>>>>>> [INSERT], AverageLatency(us) 2688 1555
> > >>>>>>>>>>>>>>>> [INSERT], MinLatency(us) 887 815
> > >>>>>>>>>>>>>>>> [INSERT], MaxLatency(us) 173311 154623
> > >>>>>>>>>>>>>>>> [INSERT], 95thPercentileLatency(us) 4455 2571
> > >>>>>>>>>>>>>>>> [INSERT], 99thPercentileLatency(us) 9303 5375
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> YCSB Workload F
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> target 50k/op/s 1.4.9 1.5.0
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> [OVERALL], RunTime(ms) 200562 204178
> > >>>>>>>>>>>>>>>> [OVERALL], Throughput(ops/sec) 49859 48976
> > >>>>>>>>>>>>>>>> [READ], AverageLatency(us) 856 1137
> > >>>>>>>>>>>>>>>> [READ], MinLatency(us) 262 257
> > >>>>>>>>>>>>>>>> [READ], MaxLatency(us) 205567 222335
> > >>>>>>>>>>>>>>>> [READ], 95thPercentileLatency(us) 2365 3475
> > >>>>>>>>>>>>>>>> [READ], 99thPercentileLatency(us) 3099 4143
> > >>>>>>>>>>>>>>>> [READ-MODIFY-WRITE], AverageLatency(us) 2559 2917
> > >>>>>>>>>>>>>>>> [READ-MODIFY-WRITE], MinLatency(us) 1100 1034
> > >>>>>>>>>>>>>>>> [READ-MODIFY-WRITE], MaxLatency(us) 208767 204799
> > >>>>>>>>>>>>>>>> [READ-MODIFY-WRITE], 95thPercentileLatency(us) 5747 7627
> > >>>>>>>>>>>>>>>> [READ-MODIFY-WRITE], 99thPercentileLatency(us) 7203 8919
> > >>>>>>>>>>>>>>>> [UPDATE], AverageLatency(us) 1700 1777
> > >>>>>>>>>>>>>>>> [UPDATE], MinLatency(us) 737 687
> > >>>>>>>>>>>>>>>> [UPDATE], MaxLatency(us) 97983 94271
> > >>>>>>>>>>>>>>>> [UPDATE], 95thPercentileLatency(us) 3377 4147
> > >>>>>>>>>>>>>>>> [UPDATE], 99thPercentileLatency(us) 4147 4831
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>> On Thu, Apr 4, 2019 at 1:14 AM Yu Li <car...@gmail.com
> >
> > >>>>>> wrote:
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>> Thanks for the efforts boss.
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>> Since it's a new minor release, do we have performance
> > >>>>>> comparison
> > >>>>>>>>>>>>> report
> > >>>>>>>>>>>>>>>>> with 1.4.9 as we did when releasing 1.4.0? If so, any
> > >>>>>> reference?
> > >>>>>>>>>>> Many
> > >>>>>>>>>>>>>>>>> thanks!
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>> Best Regards,
> > >>>>>>>>>>>>>>>>> Yu
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>> On Thu, 4 Apr 2019 at 07:44, Andrew Purtell <
> > >>>>>> apurt...@apache.org
> > >>>>>>>>>>
> > >>>>>>>>>>>>>>>> wrote:
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> The fourth HBase 1.5.0 release candidate (RC3) is
> > >>> available
> > >>>>>> for
> > >>>>>>>>>>>>>>>> download
> > >>>>>>>>>>>>>>>>> at
> > >>>>>>>>>>>>>>>>>>
> > >>> https://dist.apache.org/repos/dist/dev/hbase/hbase-1.5.0RC3/
> > >>>>>>>>> and
> > >>>>>>>>>>>>>>>> Maven
> > >>>>>>>>>>>>>>>>>> artifacts are available in the temporary repository
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> > https://repository.apache.org/content/repositories/orgapachehbase-1292/
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> The git tag corresponding to the candidate is
> '1.5.0RC3’
> > >>>>>>>>>>>>> (b0bc7225c5).
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> A detailed source and binary compatibility report for
> > this
> > >>>>>>>>> release
> > >>>>>>>>>>> is
> > >>>>>>>>>>>>>>>>>> available for your review at
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>
> > >>>
> >
> https://dist.apache.org/repos/dist/dev/hbase/hbase-1.5.0RC3/compat-check-report.html
> > >>>>>>>>>>>>>>>>>> .
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> A list of the 115 issues resolved in this release can
> be
> > >>>>>> found
> > >>>>>>>>> at
> > >>>>>>>>>>>>>>>>>> https://s.apache.org/K4Wk . The 1.5.0 changelog is
> > >>> derived
> > >>>>>> from
> > >>>>>>>>>>> the
> > >>>>>>>>>>>>>>>>>> changelog of the last branch-1.4 release, 1.4.9.
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> Please try out the candidate and vote +1/0/-1.
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> The vote will be open for at least 72 hours. Unless
> > >>>>>> objection I
> > >>>>>>>>>>> will
> > >>>>>>>>>>>>>>>> try
> > >>>>>>>>>>>>>>>>> to
> > >>>>>>>>>>>>>>>>>> close it Friday April 12, 2019 if we have sufficient
> > >>> votes.
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> Prior to making this announcement I made the following
> > >>>>>> preflight
> > >>>>>>>>>>>>>>>> checks:
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> RAT check passes (7u80)
> > >>>>>>>>>>>>>>>>>> Unit test suite passes (7u80, 8u181)*
> > >>>>>>>>>>>>>>>>>> Opened the UI in a browser, poked around
> > >>>>>>>>>>>>>>>>>> LTT load 100M rows with 100% verification and 20%
> > updates
> > >>>>>>>>> (8u181)
> > >>>>>>>>>>>>>>>>>> ITBLL 1B rows with slowDeterministic monkey (8u181)
> > >>>>>>>>>>>>>>>>>> ITBLL 1B rows with serverKilling monkey (8u181)
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> There are known flaky tests. See HBASE-21904 and
> > >>> HBASE-21905.
> > >>>>>>>>> These
> > >>>>>>>>>>>>>>>> flaky
> > >>>>>>>>>>>>>>>>>> tests do not represent serious test failures that
> would
> > >>>>>> prevent
> > >>>>>>>>> a
> > >>>>>>>>>>>>>>>>> release.
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>> --
> > >>>>>>>>>>>>>>>>>> Best regards,
> > >>>>>>>>>>>>>>>>>> Andrew
> > >>>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> --
> > >>>>>>>>>>>>>>>> Best regards,
> > >>>>>>>>>>>>>>>> Andrew
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>> Words like orphans lost among the crosstalk, meaning
> torn
> > >>> from
> > >>>>>>>>>>> truth's
> > >>>>>>>>>>>>>>>> decrepit hands
> > >>>>>>>>>>>>>>>> - A23, Crosstalk
> > >>>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>>>
> > >>>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>
> > >>>>>>>>
> > >>>>>>
> > >>>>>
> > >>>
> > >>
> >
>
>
> --
> Best regards,
> Andrew
>
> Words like orphans lost among the crosstalk, meaning torn from truth's
> decrepit hands
>    - A23, Crosstalk
>

Reply via email to