Oops, just saw than Kenn already mentioned spark perf degradation on spark 
runner around 10/05. Sorry for the
repetition.Nevertheless, IMHO, I think it will be still worth checking PR #6181.
Etienne
Le lundi 29 octobre 2018 à 10:42 +0100, Etienne Chauchot a écrit :
> Hey,I would vote -0 : here is the explanation: 
> I took a look at Nexmark dashboards for output size and performance for all 
> the runners in all the modes around the
> date of the release cut to search for regressions. 
> I noted a regression on the performance of the spark runner. Query4, Query6, 
> Query8 and Query9 running times were
> multiplied by 2 to 3 around the date of 10/05/18. See 
> https://apache-beam-testing.appspot.com/explore?dashboard=513838
> 0291571712So I searched in the commit history of the spark runner module for 
> what happened around 10/05/18. And I
> found this commit 
> e4a1ccbaa10808d88c6ad2a687fe9f6d52392d90: Merge pull request #6181: 
> [BEAM-4783] Add bundleSize for splitting
> BoundedSources
> I don't know if it should be considered a blocker but we should definitely 
> take another look at pull request #6181
> that seems to change the way we split on spark runner.
> BestEtienne
> 
> Le vendredi 26 octobre 2018 à 18:20 +0200, Maximilian Michels a écrit :
> > +1 (binding)
> > On 26.10.18 17:45, Kenneth Knowles wrote:
> > Nice. Thanks.
> > +1
> > 
> > On Fri, Oct 26, 2018 at 8:44 AM Robert Bradshaw <rober...@google.com 
> > <mailto:rober...@google.com>> wrote:
> >     Thanks Tim!
> >     This was my only hesitation, and sounds like we're in the clear here.
> >     +1 (binding)    On Fri, Oct 26, 2018 at 5:05 PM Tim Robertson    
> > <timrobertson...@gmail.com
> > <mailto:timrobertson...@gmail.com>> wrote:     >     > A colleague and I 
> > tested on 2.7.0 and 2.8.0RC1:     >     >
> > 1. Quickstart on Spark/YARN/HDFS (CDH 5.12.0) (commented in    spreadsheet) 
> >     > 2. Our Avro to Avro pipelines on
> > Spark/YARN/HDFS (note we    backport the un-merged BEAM-5036 fix in our 
> > code)     > 3. Our Avro to Elasticsearch
> > pipelines on Spark/YARN/HDFS     >     > Everything worked, and performance 
> > was similar on both.     > We built
> > using maven pointing at    
> > https://repository.apache.org/content/repositories/orgapachebeam-1049//     
> > >     > Based
> > on this limited testing: +1     >     > Thank you to the release managers,  
> >    > Tim     >     >     > On Thu, Oct
> > 25, 2018 at 7:21 PM Tim <timrobertson...@gmail.com    
> > <mailto:timrobertson...@gmail.com>> wrote:     >>     >> I can
> > do some tests on Spark / YARN tomorrow (CEST timezone).    Sorry I’ve just 
> > been too busy to assist.     >>     >>
> > Tim     >>     >> On 25 Oct 2018, at 18:59, Kenneth Knowles 
> > <k...@apache.org    <mailto:k...@apache.org>>
> > wrote:     >>     >> I tried to do a more thorough job on this.     >>     
> > >>  - I could not reproduce the slowdown
> > in Query 9. I believe the    variance was simply high given the parameters 
> > and environment     >>  - I saw the same
> > slowdown in Query 8 when running as part of    the suite, but it vanished 
> > when I ran repeatedly on its own,
> > so    again it is not good methodology probably     >>     >> We do have 
> > the dashboard at    https://apache-beam-tes
> > ting.appspot.com/dashboard-admin though no    anomaly detection set up 
> > AFAIK.     >>     >>  - There is no issue
> > easily visible in DirectRunner:    
> > https://apache-beam-testing.appspot.com/explore?dashboard=5084698770407424  
> >    >>
> >   - There is a notable degradation in Spark runner on 10/5 for    many 
> > queries.    https://apache-beam-testing.appsp
> > ot.com/explore?dashboard=5138380291571712     >>  - Something minor 
> > happened for Dataflow around 10/1:    https://ap
> > ache-beam-testing.appspot.com/explore?dashboard=5670405876482048     >>  - 
> > Flink runner seems to have had some
> > fantastic improvements    :-)    
> > https://apache-beam-testing.appspot.com/explore?dashboard=56992575877283844 
> >     >> 
> >     >> So if there is a blocker it would really be the Spark runner    perf 
> > changes. Of course, all these except
> > Dataflow are using local    instances so may not be representative of 
> > larger scale AFAIK.     >>     >>
> > Kenn     >>     >> On Wed, Oct 24, 2018 at 9:48 AM Maximilian Michels    
> > <m...@apache.org <mailto:m...@apache.org>>
> > wrote:     >>>     >>> I've run WordCount using Quickstart with the 
> > FlinkRunner    (locally and     >>> against a
> > Flink cluster).     >>>     >>> Would give a +1 but waiting what Kenn 
> > finds.     >>>     >>> -Max     >>>     >>> On
> > 23.10.18 07:11, Ahmet Altay wrote:     >>> >     >>> >     >>> > On Mon, 
> > Oct 22, 2018 at 10:06 PM, Kenneth
> > Knowles    <k...@apache.org <mailto:k...@apache.org>     >>> > 
> > <mailto:k...@apache.org <mailto:k...@apache.org>>>
> > wrote:     >>> >     >>> >     You two did so much verification I had a 
> > hard time    finding something     >>> >   
> >  where my help was meaningful! :-)     >>> >     >>> >     I did run the 
> > Nexmark suite on the DirectRunner
> > against    2.7.0 and     >>> >     2.8.0 following     >>> >    
> > https://beam.apache.org/documentation/sdks/java/nexm
> > ark/#running-smoke-suite-on-the-directrunner-locall     >>> >         
> > <https://beam.apache.org/documentation/sdks/ja
> > va/nexmark/#running-smoke-suite-on-the-directrunner-local>.     >>> >     
> > >>> >     It is admittedly a very silly
> > test - the instructions leave     >>> >     immutability enforcement on, 
> > etc. But it does appear that    there is
> > a     >>> >     30% degradation in query 8 and 15% in query 9. These are    
> > the pure     >>> >     Java tests, not
> > the SQL variants. The rest of the queries    are close     >>> >     enough 
> > that differences are not
> > meaningful.     >>> >     >>> >     >>> > (It would be a good improvement 
> > for us to have alerts on daily     >>> >
> > benchmarks if we do not have such a concept already.)     >>> >     >>> >   
> >   >>> >     I would ask a little more
> > time to see what is going on    here - is it     >>> >     a real 
> > performance issue or an artifact of how the tests
> > are     >>> >     invoked, or ...?     >>> >     >>> >     >>> > Thank you! 
> > Much appreciated. Please let us know
> > when you are    done with     >>> > your investigation.     >>> >     >>> > 
> >     >>> >     Kenn     >>> >     >>> > 
> >    On Mon, Oct 22, 2018 at 6:20 PM Ahmet Altay    <al...@google.com 
> > <mailto:al...@google.com>     >>> >   
> >  <mailto:al...@google.com <mailto:al...@google.com>>> wrote:     >>> >     
> > >>> >         Hi all,     >>> >     >>>
> > >         Did you have a chance to review this RC? Between me    and Robert 
> > >     >>> >         we ran a significant
> > chunk of the validations. Let me    know if     >>> >         you have any 
> > questions.     >>> >     >>> >       
> >  Ahmet     >>> >     >>> >         On Thu, Oct 18, 2018 at 5:26 PM, Ahmet 
> > Altay    <al...@google.com
> > <mailto:al...@google.com>     >>> >         <mailto:al...@google.com 
> > <mailto:al...@google.com>>>    wrote:     >>>
> > >     >>> >             Hi everyone,     >>> >     >>> >             Please 
> > > review and vote on the release
> > candidate    #1 for the     >>> >             version 2.8.0, as follows:    
> >  >>> >             [ ] +1, Approve the
> > release     >>> >             [ ] -1, Do not approve the release (please    
> > provide specific     >>> >           
> >  comments)     >>> >     >>> >             The complete staging area is 
> > available for your    review,     >>> >     
> >        which includes:     >>> >             * JIRA release notes [1],     
> > >>> >             * the official Apache
> > source release to be    deployed to     >>> > dist.apache.org 
> > <http://dist.apache.org>    <http://dist.apache.org>
> > [2], which is     >>> >             signed with the key with fingerprint 
> > 6096FA00 [3],     >>> >             * all
> > artifacts to be deployed to the Maven Central     >>> >             
> > Repository [4],     >>> >             * source
> > code tag "v2.8.0-RC1" [5],     >>> >             * website pull request 
> > listing the release
> > and    publishing     >>> >             the API reference manual [6].     
> > >>> >             * Python artifacts are
> > deployed along with the source     >>> >             release to the 
> > dist.apache.org    <http://dist.apache.org> <htt
> > p://dist.apache.org> [2].     >>> >             * Validation sheet with a 
> > tab for 2.8.0 release    to help
> > with     >>> >             validation [7].     >>> >     >>> >             
> > The vote will be open for at least 72
> > hours. It    is adopted     >>> >             by majority approval, with at 
> > least 3 PMC    affirmative
> > votes.     >>> >     >>> >             Thanks,     >>> >             Ahmet  
> >    >>> >     >>> >           
> >  [1]     >>> >    
> > https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12319527&version=123439855
> >      >>
> > > >                 
> > > > <https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12319527&version=12343985>
> > > >     
> >  >>> >             [2] https://dist.apache.org/repos/dist/dev/beam/2.8.00   
> >   >>> >             <https://dist.apache
> > .org/repos/dist/dev/beam/2.8.0>     >>> >             [3] 
> > https://dist.apache.org/repos/dist/dev/beam/KEYSS     >>>
> > >             <https://dist.apache.org/repos/dist/dev/beam/KEYS>     >>> >  
> > >            [4]     >>> >    https://repo
> > sitory.apache.org/content/repositories/orgapachebeam-1049//     >>> >       
> >           <https://repository.apache.org
> > /content/repositories/orgapachebeam-1049/>     >>> >             [5] 
> > https://github.com/apache/beam/tree/v2.8.0-RC11
> >      >>> >             <https://github.com/apache/beam/tree/v2.8.0-RC1>     
> > >>> >             [6] https://github.com
> > /apache/beam-site/pull/5833     >>> >             
> > <https://github.com/apache/beam-site/pull/583> and     >>> > https
> > ://github.com/apache/beam/pull/67455     >>> >             
> > <https://github.com/apache/beam/pull/6745>     >>> >     
> >        [7]     >>> >    
> > https://docs.google.com/spreadsheets/d/1qk-N5vjXvbcEk68GjbkSZTR8AGqyNUM-oLFo_ZXBpJw/edit#gid
> > =18547128166     >>> >                 
> > <https://docs.google.com/spreadsheets/d/1qk-N5vjXvbcEk68GjbkSZTR8AGqyNUM-oLFo
> > _ZXBpJw/edit#gid=1854712816>     >>> >     >>> >     >>> >

Reply via email to