Hi Alijoscha, I have merged the following issues found in RC1 and RC2 into the release-1.8 branch.
- Add `frocksdbjni` dependency in NOTICE - FLINK-11950 - Improve end-to-end test - FLINK-11892 - Deprecated Window API - FLINK-11918 Currently, I am performing functional testing of YARN cluster mode and multiple operating systems. I think these tests result will be valid for the next RC as well. Best, Jincheng Shaoxuan Wang <wshaox...@gmail.com> 于2019年3月19日周二 上午11:45写道: > I tested RC2 with the following items: > - Maven Central Repository contains all artifacts > - Built the source with Maven (ensured all source files have Apache > headers) > - Checked checksums and GPG files (for instance, flink-core-1.8.0.jar) that > match the corresponding release files > - Verified that the source archives do not contains any binaries > - Manually executed the tests in IDE > > @Alijoscha, per the discussion in RC1, we should consider sending the > release vote to the user group to gather more feedbacks. > @Gordon and @Yu, I noticed there are some perf regressions occurred on > Jan.29 (and consistently exist after that) for the tests > of stateBackends.FS and stateBackends.ROCKS_INC. > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=stateBackends.FS&env=2&revs=200&equid=off&quarts=on&extr=on > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=tumblingWindow&env=2&revs=200&equid=off&quarts=on&extr=on > @Chesnay, how did you notice and capture the license Notice issue? It seems > very difficult to track. I am trying to understand the way how we organized > the license Notice. For this case, why do we only need to add the > dependency of 5.17.2-artisans-1.0 to the Notice file of flink-dist? It > seems there are other modules that bundles dependency of the > flink-statebackend. > > Regards, > Shaoxuan > > > > On Tue, Mar 19, 2019 at 10:49 AM Tzu-Li (Gordon) Tai <tzuli...@apache.org> > wrote: > > > Hi, > > > > The regressions in the benchmark were also brought up earlier in this > > thread by Yu. > > From the previous investigations, these are the commits that touched > > relevant serializers (TupleSerializer, AvroSerializer, RowSerializer) > > around Jan / Feb: > > > > TupleSerializer - > > 73e4d0ecfd (Thu Feb 14 11:56:51 2019 +0800) [FLINK-10493] Migrate all > > subclasses of TupleSerializerBase to use new serialization compatibility > > abstractions > > > > AvroSerializer - > > 09bb7bbc0f (Wed Feb 20 09:52:57 2019 +0100) [FLINK-9803] Drop canEqual() > > from TypeSerializer > > 479ebd5987 (Tue Jan 29 15:06:09 2019 +0800) [FLINK-11436] [avro] Manually > > Java-deserialize AvroSerializer for backwards compatibility > > > > RowSerializer - > > 09bb7bbc0f (Wed Feb 20 09:52:57 2019 +0100) [FLINK-9803] Drop canEqual() > > from TypeSerializer > > b434b32c08 (Wed Jan 30 22:53:27 2019 +0800) [FLINK-11329] [table] > Migrating > > the RowSerializer to use new compatibility API > > > > The odd thing is, the times of these commits don't really match the drops > > in their respective benchmark result timeline. > > For tupleKeyBy benchmark, the drop started around end of January, where > as > > the TupleSerializer was only last touched mid February. > > For the serializerRow and serializerAvro benchmarks, the drop occurred > > around mid February, where as the only commit around that time was > > 09bb7bbc0f ([FLINK-9803] Drop canEqual() from TypeSerializer). > > > > The only possible explanation that I can provide for the AvroSerializer > > benchmark drop for now, is due to 479ebd5987 (FLINK-11436). > > That commit had to touch the `readObject` method of the AvroSerializer, > > which introduced some type checks / casts. > > This may have caused regression in deserializing the AvroSerializer > itself, > > which would have been accounted for in the job initialization phase of > the > > serializerAvro benchmark. > > The commit should not have affected per-record performance of the > > AvroSerializer. > > However, again, the commit time for 479ebd5987 was end of January, where > as > > the benchmark result drop occurred around mid February for the > > serializerAvro benchmark. > > > > We haven't managed to identify any solid causes so far, only the above > > speculations. > > > > Cheers, > > Gordon > > > > > > On Tue, Mar 19, 2019 at 1:36 AM Stephan Ewen <se...@apache.org> wrote: > > > > > Piotr and me discovered a possible issue in the benchmarks. > > > > > > Looking at the time graphs, there seems to be one issue coming around > end > > > of January. It increased network throughput, but decreased overall > > > performance and added more variation in time (possibly through GC). > Check > > > the trend in these graphs: > > > > > > Increased Throughput: > > > > > > > > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=networkThroughput.1000,100ms&env=2&revs=200&equid=off&quarts=on&extr=on > > > Higher variance in count benchmark: > > > > > > > > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=benchmarkCount&env=2&revs=200&equid=off&quarts=on&extr=on > > > Drop in tuple-key-by performance trend: > > > > > > > > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=tupleKeyBy&env=2&revs=200&equid=off&quarts=on&extr=on > > > > > > In addition, the Avro and Row serializers seem to have a performance > drop > > > since mid February: > > > > > > > > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=serializerAvro&env=2&revs=200&equid=off&quarts=on&extr=on > > > > > > > > > http://codespeed.dak8s.net:8000/timeline/#/?exe=1&ben=serializerRow&env=2&revs=200&equid=off&quarts=on&extr=on > > > > > > @Gordon any idea what could be the cause of this? > > > > > > > > > On Mon, Mar 18, 2019 at 3:08 PM Yu Li <car...@gmail.com> wrote: > > > > > > > Watching the benchmark data for days and indeed it's normalized for > the > > > > time being. However, the result seems to be unstable. I also tried > the > > > > benchmark locally and observed obvious wave even with the same > > commit... > > > > > > > > I guess we may need to improve it such as increasing the > > > > RECORDS_PER_INVOCATION to generate a reproducible result. IMHO a > stable > > > > micro benchmark is important to verify perf-related improvements > (and I > > > > think the benchmark and website are already great ones but just need > > some > > > > love). Let me mark this as one of my backlog and will open a JIRA > when > > > > prepared. > > > > > > > > Anyway good to know it's not a regression, and thanks for the efforts > > > spent > > > > on checking it over! @Gordon @Chesnay > > > > > > > > Best Regards, > > > > Yu > > > > > > > > > > > > On Fri, 15 Mar 2019 at 19:20, Chesnay Schepler <ches...@apache.org> > > > wrote: > > > > > > > > > The regressions is already normalizing again. I'd observer it > further > > > > > before doing anything. > > > > > > > > > > The same applies to the benchmarkCount which tanked even more in > that > > > > > same run. > > > > > > > > > > On 15.03.2019 06:02, Tzu-Li (Gordon) Tai wrote: > > > > > > @Yu > > > > > > Thanks for reporting that Yu, great that this was noticed. > > > > > > > > > > > > The serializerAvro case seems to only be testing on-wire > > > serialization. > > > > > > I checked the changes to the `AvroSerializer`, and it seems like > > > > > > FLINK-11436 [1] with commit 479ebd59 was the only change that may > > > have > > > > > > affected that. > > > > > > That commit wasn't introduced exactly around the time when the > > > > indicated > > > > > > performance regression occurred, but was still before the > > regression. > > > > > > The commit introduced some instanceof type checks / type casting > in > > > the > > > > > > readObject of the AvroSerializer, which may have caused this. > > > > > > > > > > > > Currently investigating further. > > > > > > > > > > > > Cheers, > > > > > > Gordon > > > > > > > > > > > > On Fri, Mar 15, 2019 at 11:45 AM Yu Li <car...@gmail.com> wrote: > > > > > > > > > > > >> Hi Aljoscha and all, > > > > > >> > > > > > >> From our performance benchmark web site ( > > > > > >> http://codespeed.dak8s.net:8000/changes/) I observed a > noticeable > > > > > >> regression (-6.92%) on the serializerAvro case comparing the > > latest > > > > 100 > > > > > >> revisions, which may need some attention. Thanks. > > > > > >> > > > > > >> Best Regards, > > > > > >> Yu > > > > > >> > > > > > >> > > > > > >> On Thu, 14 Mar 2019 at 20:42, Aljoscha Krettek < > > aljos...@apache.org > > > > > > > > > >> wrote: > > > > > >> > > > > > >>> Hi everyone, > > > > > >>> Please review and vote on the release candidate 2 for Flink > > 1.8.0, > > > as > > > > > >>> follows: > > > > > >>> [ ] +1, Approve the release > > > > > >>> [ ] -1, Do not approve the release (please provide specific > > > comments) > > > > > >>> > > > > > >>> > > > > > >>> The complete staging area is available for your review, which > > > > includes: > > > > > >>> * JIRA release notes [1], > > > > > >>> * the official Apache source release and binary convenience > > > releases > > > > to > > > > > >> be > > > > > >>> deployed to dist.apache.org <http://dist.apache.org/> [2], > which > > > are > > > > > >>> signed with the key with fingerprint > > > > > >>> F2A67A8047499BBB3908D17AA8F4FD97121D7293 [3], > > > > > >>> * all artifacts to be deployed to the Maven Central Repository > > [4], > > > > > >>> * source code tag "release-1.8.0-rc2" [5], > > > > > >>> * website pull request listing the new release [6] > > > > > >>> * website pull request adding announcement blog post [7]. > > > > > >>> > > > > > >>> The vote will be open for at least 72 hours. It is adopted by > > > > majority > > > > > >>> approval, with at least 3 PMC affirmative votes. > > > > > >>> > > > > > >>> Thanks, > > > > > >>> Aljoscha > > > > > >>> > > > > > >>> [1] > > > > > >>> > > > > > >> > > > > > > > > > > > > > > > https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315522&version=12344274 > > > > > >>> < > > > > > >>> > > > > > >> > > > > > > > > > > > > > > > https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315522&version=12344274 > > > > > >>> [2] > > https://dist.apache.org/repos/dist/dev/flink/flink-1.8.0-rc2/ > > > < > > > > > >>> https://dist.apache.org/repos/dist/dev/flink/flink-1.8.0-rc2/> > > > > > >>> [3] https://dist.apache.org/repos/dist/release/flink/KEYS < > > > > > >>> https://dist.apache.org/repos/dist/release/flink/KEYS> > > > > > >>> [4] > > > > > >> > > > > > https://repository.apache.org/content/repositories/orgapacheflink-1213 > > > > > >>> < > > > > > > > > > https://repository.apache.org/content/repositories/orgapacheflink-1210/ > > > > > >>> > > > > > >>> [5] > > > > > >>> > > > > > >> > > > > > > > > > > > > > > > https://gitbox.apache.org/repos/asf?p=flink.git;a=tag;h=c77a329b71e3068bfde965ae91921ad5c47246dd > > > > > >>> < > > > > > >>> > > > > > >> > > > > > > > > > > > > > > > https://gitbox.apache.org/repos/asf?p=flink.git;a=tag;h=2d00b1c26d7b4554707063ab0d1d6cc236cfe8a5 > > > > > >>> [6] https://github.com/apache/flink-web/pull/180 < > > > > > >>> https://github.com/apache/flink-web/pull/180> > > > > > >>> [7] https://github.com/apache/flink-web/pull/179 < > > > > > >>> https://github.com/apache/flink-web/pull/179> > > > > > >>> > > > > > >>> P.S. The difference to the previous RC1 is very small, you can > > > fetch > > > > > the > > > > > >>> two tags and do a "git log > release-1.8.0-rc1..release-1.8.0-rc2” > > to > > > > see > > > > > >> the > > > > > >>> difference in commits. Its fixes for the issues that led to the > > > > > >>> cancellation of the previous RC plus smaller fixes. Most > > > > > >>> verification/testing that was carried out should apply as is to > > > this > > > > > RC. > > > > > > > > > > > > > > > > > > > > > > > > >