Update: Implemented and deployed.

On 02/08/2019 12:11, Jark Wu wrote:
Wow. That's great! Thanks Chesnay.

On Fri, 2 Aug 2019 at 17:50, Chesnay Schepler <ches...@apache.org <mailto:ches...@apache.org>> wrote:

    I'm currently modifying the cibot to do this automatically; should be
    finished until Monday.

    On 02/08/2019 07:41, Jark Wu wrote:
    > Hi Chesnay,
    >
    > Can we assign Flink Committers the permission of flink-ci/flink
    repo?
    > Several times, when I pushed some new commits, the old build
    jobs are still
    > in pending and not canceled.
    > Before we fix that, we can manually cancel some old jobs to save
    build
    > resource.
    >
    > Best,
    > Jark
    >
    >
    > On Wed, 10 Jul 2019 at 16:17, Chesnay Schepler
    <ches...@apache.org <mailto:ches...@apache.org>> wrote:
    >
    >> Your best bet would be to check the first commit in the PR and
    check the
    >> parent commit.
    >>
    >> To re-run things, you will have to rebase the PR on the latest
    master.
    >>
    >> On 10/07/2019 03:32, Kurt Young wrote:
    >>> Thanks for all your efforts Chesnay, it indeed improves a lot
    for our
    >>> develop experience. BTW, do you know how to find the master branch
    >>> information which the CI runs with?
    >>>
    >>> For example, like this one:
    >>> https://travis-ci.com/flink-ci/flink/jobs/214542568
    >>> It shows pass with the commits, which rebased on the master
    when the CI
    >>> is triggered. But it's both possible that the master branch CI
    runs on is
    >>> the
    >>> same or different with current master. If it's the same, I can
    simply
    >> rely
    >>> on the
    >>> passed information to push commits, but if it's not, I think i
    should
    >> find
    >>> another
    >>> way to re-trigger tests based on the newest master.
    >>>
    >>> Do you know where can I get such information?
    >>>
    >>> Best,
    >>> Kurt
    >>>
    >>>
    >>> On Tue, Jul 9, 2019 at 3:27 AM Chesnay Schepler
    <ches...@apache.org <mailto:ches...@apache.org>>
    >> wrote:
    >>>> The kinks have been worked out; the bot is running again and
    pr builds
    >>>> are yet again no longer running on ASF resources.
    >>>>
    >>>> PRs are mirrored to: https://github.com/flink-ci/flink
    >>>> Bot source: https://github.com/flink-ci/ci-bot
    >>>>
    >>>> On 08/07/2019 17:14, Chesnay Schepler wrote:
    >>>>> I have temporarily re-enabled running PR builds on the ASF
    account;
    >>>>> migrating to the Travis subscription caused some issues in
    the bot
    >>>>> that I have to fix first.
    >>>>>
    >>>>> On 07/07/2019 23:01, Chesnay Schepler wrote:
    >>>>>> The vote has passed unanimously in favor of migrating to a
    separate
    >>>>>> Travis account.
    >>>>>>
    >>>>>> I will now set things up such that no PullRequest is no
    longer run on
    >>>>>> the ASF servers.
    >>>>>> This is a major setup in reducing our usage of ASF resources.
    >>>>>> For the time being we'll use free Travis plan for flink-ci
    (i.e. 5
    >>>>>> workers, which is the same the ASF gives us). Over the
    course of the
    >>>>>> next week we'll setup the Ververica subscription to
    increase this
    >> limit.
    >>>>>>   From now now, a bot will mirror all new and updated
    PullRequests to a
    >>>>>> mirror repository (https://github.com/flink-ci/flink-ci)
    and write an
    >>>>>> update into the PR once the build is complete.
    >>>>>> I have ran the bots for the past 3 days in parallel to our
    existing
    >>>>>> Travis and it was working without major issues.
    >>>>>>
    >>>>>> The biggest change that contributors will see is that
    there's no
    >>>>>> longer a icon next to each commit. We may revisit this in
    the future.
    >>>>>>
    >>>>>> I'll setup a repo with the source of the bot later.
    >>>>>>
    >>>>>> On 04/07/2019 10:46, Chesnay Schepler wrote:
    >>>>>>> I've raised a JIRA
    >>>>>>> <https://issues.apache.org/jira/browse/INFRA-18703>with
    INFRA to
    >>>>>>> inquire whether it would be possible to switch to a
    different Travis
    >>>>>>> account, and if so what steps would need to be taken.
    >>>>>>> We need a proper confirmation from INFRA since we are not
    in full
    >>>>>>> control of the flink repository (for example, we cannot
    access the
    >>>>>>> settings page).
    >>>>>>>
    >>>>>>> If this is indeed possible, Ververica is willing sponsor a
    Travis
    >>>>>>> account for the Flink project.
    >>>>>>> This would provide us with more than enough resources than
    we need.
    >>>>>>>
    >>>>>>> Since this makes the project more reliant on resources
    provided by
    >>>>>>> external companies I would like to vote on this.
    >>>>>>>
    >>>>>>> Please vote on this proposal, as follows:
    >>>>>>> [ ] +1, Approve the migration to a Ververica-sponsored Travis
    >>>>>>> account, provided that INFRA approves
    >>>>>>> [ ] -1, Do not approach the migration to a Ververica-sponsored
    >>>>>>> Travis account
    >>>>>>>
    >>>>>>> The vote will be open for at least 24h, and until we have
    >>>>>>> confirmation from INFRA. The voting period may be shorter
    than the
    >>>>>>> usual 3 days since our current is effectively not working.
    >>>>>>>
    >>>>>>> On 04/07/2019 06:51, Bowen Li wrote:
    >>>>>>>> Re: > Are they using their own Travis CI pool, or did the
    switch to
    >>>>>>>> an entirely different CI service?
    >>>>>>>>
    >>>>>>>> I reached out to Wes and Krisztián from Apache Arrow PMC.
    They are
    >>>>>>>> currently moving away from ASF's Travis to their own
    in-house metal
    >>>>>>>> machines at [1] with custom CI application at [2].
    They've seen
    >>>>>>>> significant improvement w.r.t both much higher
    performance and
    >>>>>>>> basically no resource waiting time, "night-and-day"
    difference
    >>>>>>>> quoting Wes.
    >>>>>>>>
    >>>>>>>> Re: > If we can just switch to our own Travis pool, just
    for our
    >>>>>>>> project, then this might be something we can do fairly
    quickly?
    >>>>>>>>
    >>>>>>>> I believe so, according to [3] and [4]
    >>>>>>>>
    >>>>>>>>
    >>>>>>>> [1] https://ci.ursalabs.org/ <https://ci.ursalabs.org/#/>
    >>>>>>>> [2] https://github.com/ursa-labs/ursabot
    >>>>>>>> [3]
    >>>>>>>>
    >>
    https://docs.travis-ci.com/user/migrate/open-source-repository-migration
    >>>>>>>> [4]
    >>>>>>>>
    >>
    https://docs.travis-ci.com/user/migrate/open-source-on-travis-ci-com
    >>>>>>>>
    >>>>>>>>
    >>>>>>>> On Wed, Jul 3, 2019 at 12:01 AM Chesnay Schepler
    >>>>>>>> <ches...@apache.org <mailto:ches...@apache.org>
    <mailto:ches...@apache.org <mailto:ches...@apache.org>>> wrote:
    >>>>>>>>
    >>>>>>>>       Are they using their own Travis CI pool, or did the
    switch to
    >> an
    >>>>>>>>       entirely different CI service?
    >>>>>>>>
    >>>>>>>>       If we can just switch to our own Travis pool, just
    for our
    >>>>>>>>       project, then
    >>>>>>>>       this might be something we can do fairly quickly?
    >>>>>>>>
    >>>>>>>>       On 03/07/2019 05:55, Bowen Li wrote:
    >>>>>>>>       > I responded in the INFRA ticket [1] that I
    believe they are
    >>>>>>>>       using a wrong
    >>>>>>>>       > metric against Flink and the total build time is
    a completely
    >>>>>>>>       different
    >>>>>>>>       > thing than guaranteed build capacity.
    >>>>>>>>       >
    >>>>>>>>       > My response:
    >>>>>>>>       >
    >>>>>>>>       > "As mentioned above, since I started to pay
    attention to
    >> Flink's
    >>>>>>>>       build
    >>>>>>>>       > queue a few tens of days ago, I'm in Seattle and
    I saw no
    >> build
    >>>>>>>>       was kicking
    >>>>>>>>       > off in PST daytime in weekdays for Flink. Our
    teammates in
    >> China
    >>>>>>>>       and Europe
    >>>>>>>>       > have also reported similar observations. So we
    need to
    >> evaluate
    >>>>>>>>       how the
    >>>>>>>>       > large total build time came from - if 1) your
    number and 2)
    >> our
    >>>>>>>>       > observations from three locations that cover
    pretty much a
    >> full
    >>>>>>>>       day, are
    >>>>>>>>       > all true, I **guess** one reason can be that -
    highly likely
    >> the
    >>>>>>>>       extra
    >>>>>>>>       > build time came from weekends when other Apache
    projects may
    >> be
    >>>>>>>>       idle and
    >>>>>>>>       > Flink just drains hard its congested queue.
    >>>>>>>>       >
    >>>>>>>>       > Please be aware of that we're not complaining
    about the lack
    >> of
    >>>>>>>>       resources
    >>>>>>>>       > in general, I'm complaining about the lack of
    **stable,
    >>>>>>>> dedicated**
    >>>>>>>>       > resources. An example for the latter one is,
    currently even
    >> if
    >>>>>>>>       no build is
    >>>>>>>>       > in Flink's queue and I submit a request to be the
    queue head
    >>>>>>>> in PST
    >>>>>>>>       > morning, my build won't even start in 6-8+h. That
    is an
    >> absurd
    >>>>>>>>       amount of
    >>>>>>>>       > waiting time.
    >>>>>>>>       >
    >>>>>>>>       > That's saying, if ASF INFRA decides to adopt a
    quota system
    >> and
    >>>>>>>>       grants
    >>>>>>>>       > Flink five DEDICATED servers that runs all the
    time only for
    >>>>>>>>       Flink, that'll
    >>>>>>>>       > be PERFECT and can totally solve our problem now.
    >>>>>>>>       >
    >>>>>>>>       > Please be aware of that we're not complaining
    about the lack
    >> of
    >>>>>>>>       resources
    >>>>>>>>       > in general, I'm complaining about the lack of
    **stable,
    >>>>>>>> dedicated**
    >>>>>>>>       > resources. An example for the latter one is,
    currently even
    >> if
    >>>>>>>>       no build is
    >>>>>>>>       > in Flink's queue and I submit a request to be the
    queue head
    >>>>>>>> in PST
    >>>>>>>>       > morning, my build won't even start in 6-8+h. That
    is an
    >> absurd
    >>>>>>>>       amount of
    >>>>>>>>       > waiting time.
    >>>>>>>>       >
    >>>>>>>>       >
    >>>>>>>>       > That's saying, if ASF INFRA decides to adopt a
    quota system
    >> and
    >>>>>>>>       grants
    >>>>>>>>       > Flink five DEDICATED servers that runs all the
    time only for
    >>>>>>>>       Flink, that'll
    >>>>>>>>       > be PERFECT and can totally solve our problem now.
    >>>>>>>>       >
    >>>>>>>>       > I feel what's missing in the ASF INFRA's Travis
    resource
    >> pool is
    >>>>>>>>       some level
    >>>>>>>>       > of build capacity SLAs and certainty"
    >>>>>>>>       >
    >>>>>>>>       >
    >>>>>>>>       > Again, I believe there are differences in nature
    of these two
    >>>>>>>>       problems,
    >>>>>>>>       > long build time v.s. lack of dedicated build
    resource. That's
    >>>>>>>>       saying,
    >>>>>>>>       > shortening build time may relieve the situation,
    and may not.
    >>>>>>>>       I'm sightly
    >>>>>>>>       > negative on disabling IT cases for PRs, due to
    the downside
    >> is
    >>>>>>>>       that we are
    >>>>>>>>       > at risk of any potential bugs in PR that UTs
    doesn't catch,
    >> and
    >>>>>>>>       may cost a
    >>>>>>>>       > lot more to fix and if it slows others down or
    even block
    >>>>>>>>       others, but am
    >>>>>>>>       > open to others opinions on it.
    >>>>>>>>       >
    >>>>>>>>       > AFAICT from INFRA ticket[1], donating to ASF
    INFRA won't be
    >>>>>>>>       feasible to
    >>>>>>>>       > solve our problem since INFRA's pool is fully
    shared and they
    >>>>>>>>       have no
    >>>>>>>>       > control and finer insights over resource
    allocation to a
    >>>>>>>>       specific Apache
    >>>>>>>>       > project. As mentioned in [1], Apache Arrow is
    moving away
    >> from
    >>>>>>>>       ASF INFRA
    >>>>>>>>       > Travis pool (they are actually surprised Flink
    hasn't plan
    >> to do
    >>>>>>>>       so). I
    >>>>>>>>       > know that Spark is on its own build infra. If we
    all agree
    >> that
    >>>>>>>>       funding our
    >>>>>>>>       > own build infra, I'd be glad to help investigate any
    >> potential
    >>>>>>>>       options
    >>>>>>>>       > after releasing 1.9 since I'm super busy with 1.9
    now.
    >>>>>>>>       >
    >>>>>>>>       > [1] https://issues.apache.org/jira/browse/INFRA-18533
    >>>>>>>>       >
    >>>>>>>>       >
    >>>>>>>>       >
    >>>>>>>>       > On Tue, Jul 2, 2019 at 4:46 AM Chesnay Schepler
    >>>>>>>>       <ches...@apache.org <mailto:ches...@apache.org>
    <mailto:ches...@apache.org <mailto:ches...@apache.org>>> wrote:
    >>>>>>>>       >
    >>>>>>>>       >> As a short-term stopgap, since we can assume
    this issue to
    >>>>>>>>       become much
    >>>>>>>>       >> worse in the following days/weeks, we could
    disable IT
    >> cases in
    >>>>>>>>       PRs and
    >>>>>>>>       >> only run them on master.
    >>>>>>>>       >>
    >>>>>>>>       >> On 02/07/2019 12:03, Chesnay Schepler wrote:
    >>>>>>>>       >>> People really have to stop thinking that just
    because
    >>>>>>>>       something works
    >>>>>>>>       >>> for us it is also a good solution.
    >>>>>>>>       >>> Also, please remember that our builds run for
    2h from
    >> start to
    >>>>>>>>       finish,
    >>>>>>>>       >>> and not the 14 _minutes_ it takes for zeppelin.
    >>>>>>>>       >>> We are dealing with an entirely different scale
    here, both
    >> in
    >>>>>>>>       terms of
    >>>>>>>>       >>> build times and number of builds.
    >>>>>>>>       >>>
    >>>>>>>>       >>> In this very thread people have been
    complaining about long
    >>>>>>>> queue
    >>>>>>>>       >>> times for their builds. Surprise, other Apache
    projects
    >>>>>>>> have been
    >>>>>>>>       >>> suffering the very same thing due to us not
    controlling our
    >>>>>>>> build
    >>>>>>>>       >>> times. While switching services (be it Jenkins,
    CircleCI or
    >>>>>>>>       whatever)
    >>>>>>>>       >>> will possibly work for us (and these options
    are actually
    >>>>>>>>       attractive,
    >>>>>>>>       >>> like CircleCI's proper support for build
    artifacts), it
    >>>>>>>> will also
    >>>>>>>>       >>> result in us likely negatively affecting other
    projects in
    >>>>>>>>       significant
    >>>>>>>>       >>> ways.
    >>>>>>>>       >>>
    >>>>>>>>       >>> Sure, the Jenkins setup has a good user
    experience for us,
    >> at
    >>>>>>>>       the cost
    >>>>>>>>       >>> of blocking Jenkins workers for a _lot_ of
    time. Right now
    >> we
    >>>>>>>>       have 25
    >>>>>>>>       >>> PR's in our queue; that's possibly 50h we'd
    consume of
    >> Jenkins
    >>>>>>>>       >>> resources, and the European contributors
    haven't even
    >> really
    >>>>>>>>       started yet.
    >>>>>>>>       >>>
    >>>>>>>>       >>> FYI, the latest INFRA response from INFRA-18533:
    >>>>>>>>       >>>
    >>>>>>>>       >>> "Our rough metrics shows that Flink used over
    5800 hours of
    >>>>>>>>       build time
    >>>>>>>>       >>> last month. That is equal to EIGHT servers
    running 24/7 for
    >>>>>>>>       the ENTIRE
    >>>>>>>>       >>> MONTH. EIGHT. nonstop.
    >>>>>>>>       >>> When we discovered this last night, we
    discussed it some
    >> and
    >>>>>>>>       are going
    >>>>>>>>       >>> to tune down Flink to allow only five executors
    maximum. We
    >>>>>>>> cannot
    >>>>>>>>       >>> allow Flink to consume so much of a Foundation
    shared
    >>>>>>>> resource."
    >>>>>>>>       >>>
    >>>>>>>>       >>> So yes, we either
    >>>>>>>>       >>> a) have to heavily reduce our CI usage or
    >>>>>>>>       >>> b) fund our own, either maintaining it ourselves or
    >> donating
    >>>>>>>>       to Apache.
    >>>>>>>>       >>>
    >>>>>>>>       >>> On 02/07/2019 05:11, Bowen Li wrote:
    >>>>>>>>       >>>> By looking at the git history of the Jenkins
    script, its
    >> core
    >>>>>>>>       part
    >>>>>>>>       >>>> was finished in March 2017 (and only two minor
    update in
    >>>>>>>>       2017/2018),
    >>>>>>>>       >>>> so it's been running for over two years now
    and feels like
    >>>>>>>>       Zepplin
    >>>>>>>>       >>>> community has been quite happy with it. @Jeff
    Zhang
    >>>>>>>>       >>>> <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com> <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com>>> can
    >> you
    >>>>>>>>       share your insights and user
    >>>>>>>>       >>>> experience with the Jenkins+Travis approach?
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> Things like:
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> - has the approach completely solved the
    resource capacity
    >>>>>>>>       problem
    >>>>>>>>       >>>> for Zepplin community? is Zepplin community
    happy with the
    >>>>>>>>       result?
    >>>>>>>>       >>>> - is the whole configuration chain stable
    (e.g. uptime)
    >>>>>>>> enough?
    >>>>>>>>       >>>> - how often do you need to maintain the
    Jenkins infra? how
    >>>>>>>> many
    >>>>>>>>       >>>> people are usually involved in maintenance and
    bug-fixes?
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> The downside of this approach seems mostly to
    be on the
    >>>>>>>>       maintenance
    >>>>>>>>       >>>> to me - maintain the script and Jenkins infra.
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> ** Having Our Own Travis-CI.com Account **
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> Another alternative I've been thinking of is
    to have our
    >> own
    >>>>>>>>       >>>> travis-ci.com <http://travis-ci.com>
    <http://travis-ci.com> <
    >> http://travis-ci.com>
    >>>>>>>>       account with paid dedicated
    >>>>>>>>       >>>> resources. Note travis-ci.org
    <http://travis-ci.org> <http://travis-ci.org>
    >>>>>>>>       <http://travis-ci.org> is the free
    >>>>>>>>       >>>> version and travis-ci.com
    <http://travis-ci.com> <http://travis-ci.com>
    >>>>>>>>       <http://travis-ci.com> is the commercial
    >>>>>>>>       >>>> version. We currently use a shared resource
    pool managed
    >> by
    >>>>>>>>       ASK INFRA
    >>>>>>>>       >>>> team on travis-ci.org <http://travis-ci.org>
    <http://travis-ci.org>
    >>>>>>>>       <http://travis-ci.org>, but we have no control
    >>>>>>>>       >>>> over it - we can't see how it's configured,
    how much
    >>>>>>>>       resources are
    >>>>>>>>       >>>> available, how resources are allocated among
    Apache
    >> projects,
    >>>>>>>>       etc.
    >>>>>>>>       >>>> The nice thing about having an account on
    travis-ci.com <http://travis-ci.com>
    >>>>>>>>       <http://travis-ci.com>
    >>>>>>>>       >>>> <http://travis-ci.com> are:
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> - relatively low cost with much better
    resource guarantee
    >>>>>>>>       than what
    >>>>>>>>       >>>> we currently have [1]: $249/month with 5 dedicated
    >>>>>>>> concurrency,
    >>>>>>>>       >>>> $489/month with 10 concurrency
    >>>>>>>>       >>>> - low maintenance work compared to using Jenkins
    >>>>>>>>       >>>> - (potentially) no migration cost according to
    Travis's
    >>>>>>>> doc [2]
    >>>>>>>>       >>>> (pending verification)
    >>>>>>>>       >>>> - full control over the build
    capacity/configuration
    >>>>>>>> compared to
    >>>>>>>>       >>>> using ASF INFRA's pool
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> I'd be surprised if we as such a vibrant
    community cannot
    >>>>>>>>       find and
    >>>>>>>>       >>>> fund $249*12=$2988 a year in exchange for a
    much better
    >>>>>>>> developer
    >>>>>>>>       >>>> experience and much higher productivity.
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> [1] https://travis-ci.com/plans
    >>>>>>>>       >>>> [2]
    >>>>>>>>       >>>>
    >>>>>>>>       >>
    >>>>>>>>
    >>
    https://docs.travis-ci.com/user/migrate/open-source-repository-migration
    >>>>>>>>       >>>> On Sat, Jun 29, 2019 at 8:39 AM Chesnay Schepler
    >>>>>>>>       <ches...@apache.org <mailto:ches...@apache.org>
    <mailto:ches...@apache.org <mailto:ches...@apache.org>>
    >>>>>>>>       >>>> <mailto:ches...@apache.org
    <mailto:ches...@apache.org> <mailto:ches...@apache.org
    <mailto:ches...@apache.org>>>>
    >>>>>>>> wrote:
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> So yes, the Jenkins job keeps pulling the
    state from
    >>>>>>>>       Travis until it
    >>>>>>>>       >>>> finishes.
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> Note sure I'm comfortable with the idea of using
    >> Jenkins
    >>>>>>>>       workers
    >>>>>>>>       >>>> just to
    >>>>>>>>       >>>> idle for a several hours.
    >>>>>>>>       >>>>
    >>>>>>>>       >>>> On 29/06/2019 14:56, Jeff Zhang wrote:
    >>>>>>>>       >>>> > Here's what zeppelin community did, we make a
    >> python
    >>>>>>>>       script to
    >>>>>>>>       >>>> check the
    >>>>>>>>       >>>> > build status of pull request.
    >>>>>>>>       >>>> > Here's script:
    >>>>>>>>       >>>> >
    >>>>>>>>
    https://github.com/apache/zeppelin/blob/master/travis_check.py
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> > And this is the script we used in Jenkins build
    >> job.
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> > if [ -f "travis_check.py" ]; then
    >>>>>>>>       >>>> >    git log -n 1
    >>>>>>>>       >>>> >    STATUS=$(curl -s $BUILD_URL | grep -e "GitHub
    >> pull
    >>>>>>>>       >>>> request.*from.*" | sed
    >>>>>>>>       >>>> > 's/.*GitHub pull request <a
    >>>>>>>>       >>>> >
    >> href=\"\(https[^"]*\).*from[^"]*.\(https[^"]*\).*/\1
    >>>>>>>>       \2/g')
    >>>>>>>>       >>>> >    AUTHOR=$(echo $STATUS | sed
    >> 's/.*[/]\(.*\)$/\1/g')
    >>>>>>>>       >>>> >    PR=$(echo $STATUS | awk '{print $1}' | sed
    >>>>>>>>       >>>> 's/.*[/]\(.*\)$/\1/g')
    >>>>>>>>       >>>> >    #COMMIT=$(git log -n 1 | grep "^Merge:" | awk
    >>>>>>>>       '{print $3}')
    >>>>>>>>       >>>> >    #if [ -z $COMMIT ]; then
    >>>>>>>>       >>>> >    #  COMMIT=$(curl -s
    >>>>>>>>       >>>>
    https://api.github.com/repos/apache/zeppelin/pulls/$PR
    >>>>>>>>       >>>> > | grep -e "\"label\":" -e "\"ref\":" -e
    "\"sha\":"
    >> |
    >>>>>>>>       tr '\n' ' '
    >>>>>>>>       >>>> | sed
    >>>>>>>>       >>>> > 's/\(.*sha[^,]*,\)\(.*ref.*\)/\1 = \2/g' | tr =
    >> '\n' |
    >>>>>>>>       grep -v
    >>>>>>>>       >>>> "apache:" |
    >>>>>>>>       >>>> > sed 's/.*sha.[^"]*["]\([^"]*\).*/\1/g')
    >>>>>>>>       >>>> >    #fi
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> >    # get commit hash from PR
    >>>>>>>>       >>>> >    COMMIT=$(curl -s
    >>>>>>>>       >>>>
    https://api.github.com/repos/apache/zeppelin/pulls/$PR |
    >>>>>>>>       >>>> > grep -e "\"label\":" -e "\"ref\":" -e
    "\"sha\":" |
    >> tr
    >>>>>>>>       '\n' ' '
    >>>>>>>>       >>>> | sed
    >>>>>>>>       >>>> > 's/\(.*sha[^,]*,\)\(.*ref.*\)/\1 = \2/g' | tr =
    >> '\n' |
    >>>>>>>>       grep -v
    >>>>>>>>       >>>> "apache:" |
    >>>>>>>>       >>>> > sed 's/.*sha.[^"]*["]\([^"]*\).*/\1/g')
    >>>>>>>>       >>>> >    sleep 30 # sleep few moment to wait travis
    >> starts
    >>>>>>>>       the build
    >>>>>>>>       >>>> >    RET_CODE=0
    >>>>>>>>       >>>> >    python ./travis_check.py ${AUTHOR}
    ${COMMIT} ||
    >>>>>>>>       RET_CODE=$?
    >>>>>>>>       >>>> >    if [ $RET_CODE -eq 2 ]; then # try with
    >> repository
    >>>>>>>>       name when
    >>>>>>>>       >>>> travis-ci is
    >>>>>>>>       >>>> > not available in the account
    >>>>>>>>       >>>> >      RET_CODE=0
    >>>>>>>>       >>>> >      AUTHOR=$(curl -s
    >>>>>>>>       >>>>
    https://api.github.com/repos/apache/zeppelin/pulls/$PR
    >>>>>>>>       >>>> > | grep '"full_name":' | grep -v
    "apache/zeppelin" |
    >>>>>>>> sed
    >>>>>>>>       >>>> > 's/.*[:][^"]*["]\([^/]*\).*/\1/g')
    >>>>>>>>       >>>> >    python ./travis_check.py ${AUTHOR}
    ${COMMIT} ||
    >>>>>>>>       RET_CODE=$?
    >>>>>>>>       >>>> >    fi
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> >    if [ $RET_CODE -eq 2 ]; then # fail with
    can't
    >> find
    >>>>>>>>       build
    >>>>>>>>       >>>> information in
    >>>>>>>>       >>>> > the travis
    >>>>>>>>       >>>> >      set +x
    >>>>>>>>       >>>> >      echo
    >>>>>>>>  "-----------------------------------------------------"
    >>>>>>>>       >>>> >      echo "Looks like travis-ci is not
    configured
    >> for
    >>>>>>>>       your fork."
    >>>>>>>>       >>>> >      echo "Please setup by swich on 'zeppelin'
    >>>>>>>>       repository at
    >>>>>>>>       >>>> > https://travis-ci.org/profile and travis-ci."
    >>>>>>>>       >>>> >      echo "And then make sure 'Build branch
    >> updates'
    >>>>>>>>       option is
    >>>>>>>>       >>>> enabled in
    >>>>>>>>       >>>> > the settings
    >>>>>>>> https://travis-ci.org/${AUTHOR}/zeppelin/settings
    <https://travis-ci.org/$%7BAUTHOR%7D/zeppelin/settings>
    >>>>>>>> <https://travis-ci.org/$%7BAUTHOR%7D/zeppelin/settings>
    >>>>>>>>       >>>>
    <https://travis-ci.org/$%7BAUTHOR%7D/zeppelin/settings>."
    >>>>>>>>       >>>> >      echo ""
    >>>>>>>>       >>>> >      echo "To trigger CI after setup, you
    will need
    >>>>>>>>       ammend your
    >>>>>>>>       >>>> last commit
    >>>>>>>>       >>>> > with"
    >>>>>>>>       >>>> >      echo "git commit --amend"
    >>>>>>>>       >>>> >      echo "git push your-remote HEAD --force"
    >>>>>>>>       >>>> >      echo ""
    >>>>>>>>       >>>> >      echo "See
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>>
    >>>>>>>>       >>
    >>>>>>>>
    >>
    
http://zeppelin.apache.org/contribution/contributions.html#continuous-integration
    >>>>>>>>       >>>> > ."
    >>>>>>>>       >>>> >    fi
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> >    exit $RET_CODE
    >>>>>>>>       >>>> > else
    >>>>>>>>       >>>> >    set +x
    >>>>>>>>       >>>> >    echo "travis_check.py does not exists"
    >>>>>>>>       >>>> >    exit 1
    >>>>>>>>       >>>> > fi
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> > Chesnay Schepler <ches...@apache.org
    <mailto:ches...@apache.org>
    >>>>>>>>       <mailto:ches...@apache.org <mailto:ches...@apache.org>>
    >>>>>>>>       >>>> <mailto:ches...@apache.org
    <mailto:ches...@apache.org> <mailto:
    >> ches...@apache.org <mailto:ches...@apache.org>
    >>>>>>>>       于2019年6月29日周六 下午3:17写道:
    >>>>>>>>       >>>> >
    >>>>>>>>       >>>> >> Does this imply that a Jenkins job is active as
    >> long
    >>>>>>>>       as the
    >>>>>>>>       >>>> Travis build
    >>>>>>>>       >>>> >> runs?
    >>>>>>>>       >>>> >>
    >>>>>>>>       >>>> >> On 26/06/2019 21:28, Bowen Li wrote:
    >>>>>>>>       >>>> >>> Hi,
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> @Dawid, I think the "long test running" as I
    >>>>>>>>       mentioned in the
    >>>>>>>>       >>>> first
    >>>>>>>>       >>>> >> email,
    >>>>>>>>       >>>> >>> also as you guys said, belongs to "a big
    effort
    >>>>>>>>       which is much
    >>>>>>>>       >>>> harder to
    >>>>>>>>       >>>> >>> accomplish in a short period of time and may
    >> deserve
    >>>>>>>>       its own
    >>>>>>>>       >>>> separate
    >>>>>>>>       >>>> >>> discussion". Thus I didn't include it in
    what we
    >> can
    >>>>>>>>       do in a
    >>>>>>>>       >>>> foreseeable
    >>>>>>>>       >>>> >>> short term.
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> Besides, I don't think that's the ultimate
    reason
    >>>>>>>>       for lack of
    >>>>>>>>       >>>> build
    >>>>>>>>       >>>> >>> resources. Even if the build is shortened to
    >>>>>>>>       something like
    >>>>>>>>       >>>> 2h, the
    >>>>>>>>       >>>> >>> problems of no build machine works about 6 or
    >> more
    >>>>>>>>       hours in
    >>>>>>>>       >>>> PST daytime
    >>>>>>>>       >>>> >>> that I described will still happen, because no
    >>>>>>>>       machine from
    >>>>>>>>       >>>> ASF INFRA's
    >>>>>>>>       >>>> >>> pool is allocated to Flink. As I have paid
    close
    >>>>>>>>       attention to
    >>>>>>>>       >>>> the build
    >>>>>>>>       >>>> >>> queue in the past few weekdays, it's a pretty
    >> clear
    >>>>>>>>       pattern now.
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> **The ultimate root cause** for that is - we
    >> don't
    >>>>>>>>       have any
    >>>>>>>>       >>>> **dedicated**
    >>>>>>>>       >>>> >>> build resources that we can stably rely
    on. I'm
    >>>>>>>>       actually ok to
    >>>>>>>>       >>>> wait for a
    >>>>>>>>       >>>> >>> long time if there are build requests
    running, it
    >>>>>>>>       means at
    >>>>>>>>       >>>> least we are
    >>>>>>>>       >>>> >>> making progress. But I'm not ok with no build
    >>>>>>>>       resource. A
    >>>>>>>>       >>>> better place I
    >>>>>>>>       >>>> >>> think we should aim at in short term is to
    always
    >>>>>>>>       have at
    >>>>>>>>       >>>> least a central
    >>>>>>>>       >>>> >>> pool (can be 3 or 5) of machines dedicated to
    >> build
    >>>>>>>>       Flink at
    >>>>>>>>       >>>> any time, or
    >>>>>>>>       >>>> >>> maybe use users resources.
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> @Chesnay @Robert I synced with Jeff
    offline that
    >>>>>>>>       Zeppelin
    >>>>>>>>       >>>> community is
    >>>>>>>>       >>>> >>> using a Jenkins job to automatically build on
    >> users'
    >>>>>>>>       travis
    >>>>>>>>       >>>> account and
    >>>>>>>>       >>>> >>> link the result back to github PR. I guess the
    >>>>>>>>       Jenkins job
    >>>>>>>>       >>>> would fetch
    >>>>>>>>       >>>> >>> latest upstream master and build the PR
    against
    >> it.
    >>>>>>>>       Jeff has
    >>>>>>>>       >>>> filed
    >>>>>>>>       >>>> >> tickets
    >>>>>>>>       >>>> >>> to learn and get access to the Jenkins infra.
    >> It'll
    >>>>>>>>       better to
    >>>>>>>>       >>>> fully
    >>>>>>>>       >>>> >>> understand it first before judging this
    approach.
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> I also heard good things about CircleCI,
    and ASF
    >>>>>>>>       INFRA seems
    >>>>>>>>       >>>> to have a
    >>>>>>>>       >>>> >> pool
    >>>>>>>>       >>>> >>> of build capacity there too. Can be an
    >> alternative
    >>>>>>>>       to consider.
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>> On Wed, Jun 26, 2019 at 12:44 AM Dawid
    >> Wysakowicz <
    >>>>>>>>       >>>> >> dwysakow...@apache.org
    <mailto:dwysakow...@apache.org>
    >>>>>>>>       <mailto:dwysakow...@apache.org
    <mailto:dwysakow...@apache.org>> <mailto:dwysakow...@apache.org
    <mailto:dwysakow...@apache.org>
    >>>>>>>>       <mailto:dwysakow...@apache.org
    <mailto:dwysakow...@apache.org>>>>
    >>>>>>>>       >>>> >>> wrote:
    >>>>>>>>       >>>> >>>
    >>>>>>>>       >>>> >>>> Sorry to jump in late, but I think Bowen
    missed
    >> the
    >>>>>>>>       most
    >>>>>>>>       >>>> important point
    >>>>>>>>       >>>> >>>> from Chesnay's previous message in the
    summary.
    >> The
    >>>>>>>>       ultimate
    >>>>>>>>       >>>> reason for
    >>>>>>>>       >>>> >>>> all the problems is that the tests take close
    >> to 2
    >>>>>>>>       hours to
    >>>>>>>>       >>>> run already.
    >>>>>>>>       >>>> >>>> I fully support this claim: "Unless
    people start
    >>>>>>>>       caring about
    >>>>>>>>       >>>> test times
    >>>>>>>>       >>>> >>>> before adding them, this issue cannot be
    solved"
    >>>>>>>>       >>>> >>>>
    >>>>>>>>       >>>> >>>> This is also another reason why using user's
    >> Travis
    >>>>>>>>       account
    >>>>>>>>       >>>> won't help.
    >>>>>>>>       >>>> >>>> Every few weeks we reach the user's time
    limit
    >> for
    >>>>>>>>       a single
    >>>>>>>>       >>>> profile.
    >>>>>>>>       >>>> >>>> This makes the user's builds simply fail,
    until
    >> we
    >>>>>>>>       either
    >>>>>>>>       >>>> properly
    >>>>>>>>       >>>> >>>> decrease the time the tests take (which I
    am not
    >>>>>>>>       sure we ever
    >>>>>>>>       >>>> did) or
    >>>>>>>>       >>>> >>>> postpone the problem by splitting into more
    >>>>>>>>       profiles. (Note
    >>>>>>>>       >>>> that the ASF
    >>>>>>>>       >>>> >>>> Travis account has higher time limits)
    >>>>>>>>       >>>> >>>>
    >>>>>>>>       >>>> >>>> Best,
    >>>>>>>>       >>>> >>>>
    >>>>>>>>       >>>> >>>> Dawid
    >>>>>>>>       >>>> >>>>
    >>>>>>>>       >>>> >>>> On 26/06/2019 09:36, Robert Metzger wrote:
    >>>>>>>>       >>>> >>>>> Do we know if using "the best" available
    >> hardware
    >>>>>>>>       would
    >>>>>>>>       >>>> improve the
    >>>>>>>>       >>>> >> build
    >>>>>>>>       >>>> >>>>> times?
    >>>>>>>>       >>>> >>>>> Imagine we would run the build on
    machines with
    >>>>>>>>       plenty of
    >>>>>>>>       >>>> main memory
    >>>>>>>>       >>>> >> to
    >>>>>>>>       >>>> >>>>> mount everything to ramdisk + the latest CPU
    >>>>>>>>       architecture?
    >>>>>>>>       >>>> >>>>>
    >>>>>>>>       >>>> >>>>> Throwing hardware at the problem could help
    >> reduce
    >>>>>>>>       the time
    >>>>>>>>       >>>> of an
    >>>>>>>>       >>>> >>>>> individual build, and using our own
    >> infrastructure
    >>>>>>>>       would
    >>>>>>>>       >>>> remove our
    >>>>>>>>       >>>> >>>>> dependency on Apache's Travis account
    (with the
    >>>>>>>>       obvious
    >>>>>>>>       >>>> downside of
    >>>>>>>>       >>>> >>>> having
    >>>>>>>>       >>>> >>>>> to maintain the infrastructure)
    >>>>>>>>       >>>> >>>>> We could use an open source travis
    >> alternative, to
    >>>>>>>>       have a
    >>>>>>>>       >>>> similar
    >>>>>>>>       >>>> >>>>> experience and make the migration easy.
    >>>>>>>>       >>>> >>>>>
    >>>>>>>>       >>>> >>>>>
    >>>>>>>>       >>>> >>>>> On Wed, Jun 26, 2019 at 9:34 AM Chesnay
    >> Schepler
    >>>>>>>>       >>>> <ches...@apache.org
    <mailto:ches...@apache.org> <mailto:ches...@apache.org
    <mailto:ches...@apache.org>>
    >>>>>>>>       <mailto:ches...@apache.org
    <mailto:ches...@apache.org> <mailto:ches...@apache.org
    <mailto:ches...@apache.org>>>>
    >>>>>>>>       >>>> >>>> wrote:
    >>>>>>>>       >>>> >>>>>> >From what I gathered, there's no special
    >>>>>>>>       sauce that the
    >>>>>>>>       >>>> Zeppelin
    >>>>>>>>       >>>> >>>>>> project uses which actually integrates
    a users
    >>>>>>>> Travis
    >>>>>>>>       >>>> account into the
    >>>>>>>>       >>>> >>>> PR.
    >>>>>>>>       >>>> >>>>>> They just disabled Travis for PRs. And
    that's
    >>>>>>>>       kind of it.
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> Naturally we can do this (duh) and safe the
    >> ASF a
    >>>>>>>>       fair
    >>>>>>>>       >>>> amount of
    >>>>>>>>       >>>> >>>>>> resources, but there are downsides:
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> The discoverability of the Travis check
    takes
    >> a
    >>>>>>>>       nose-dive.
    >>>>>>>>       >>>> Either we
    >>>>>>>>       >>>> >>>>>> require every contributor to always, an
    every
    >>>>>>>>       commit, also
    >>>>>>>>       >>>> post a
    >>>>>>>>       >>>> >> Travis
    >>>>>>>>       >>>> >>>>>> build, or we have the reviewer sift through
    >> the
    >>>>>>>>       >>>> contributors account
    >>>>>>>>       >>>> >> to
    >>>>>>>>       >>>> >>>>>> find it.
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> This is rather cumbersome.
    Additionally, it's
    >>>>>>>>       also not
    >>>>>>>>       >>>> equivalent to
    >>>>>>>>       >>>> >>>>>> having a PR build.
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> A normal branch build takes a branch as
    is and
    >>>>>>>>       tests it. A
    >>>>>>>>       >>>> PR build
    >>>>>>>>       >>>> >>>>>> merges the branch into master, and then
    runs
    >> it.
    >>>>>>>>       (Fun fact:
    >>>>>>>>       >>>> This is
    >>>>>>>>       >>>> >> why
    >>>>>>>>       >>>> >>>>>> a PR without merge conflicts is not
    being run
    >> on
    >>>>>>>>       Travis.)
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> And ultimately, everyone can already
    make use
    >>>>>>>> of this
    >>>>>>>>       >>>> approach anyway.
    >>>>>>>>       >>>> >>>>>>
    >>>>>>>>       >>>> >>>>>> On 25/06/2019 08:02, Jark Wu wrote:
    >>>>>>>>       >>>> >>>>>>> Hi Jeff,
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> Thanks for sharing the Zeppelin
    approach. I
    >>>>>>>>       think it's a
    >>>>>>>>       >>>> good idea to
    >>>>>>>>       >>>> >>>>>>> leverage user's travis account.
    >>>>>>>>       >>>> >>>>>>> In this way, we can have almost unlimited
    >>>>>>>>       concurrent build
    >>>>>>>>       >>>> jobs and
    >>>>>>>>       >>>> >>>>>>> developers can restart build by themselves
    >>>>>>>>       (currently only
    >>>>>>>>       >>>> committers
    >>>>>>>>       >>>> >>>>>>> can restart PR's build).
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> But I'm still not very clear how to
    integrate
    >>>>>>>> user's
    >>>>>>>>       >>>> travis build
    >>>>>>>>       >>>> >> into
    >>>>>>>>       >>>> >>>>>>> the Flink pull request's build
    automatically.
    >>>>>>>>       Can you
    >>>>>>>>       >>>> explain more in
    >>>>>>>>       >>>> >>>>>>> detail?
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> Another question: does travis only build
    >>>>>>>>       branches for user
    >>>>>>>>       >>>> account?
    >>>>>>>>       >>>> >>>>>>> My concern is that builds for PRs will
    rebase
    >>>>>>>> user's
    >>>>>>>>       >>>> commits against
    >>>>>>>>       >>>> >>>>>>> current master branch.
    >>>>>>>>       >>>> >>>>>>> This will help us to find problems before
    >>>>>>>>       merge.  Builds
    >>>>>>>>       >>>> for branches
    >>>>>>>>       >>>> >>>>>>> will lose the impact of new commits in
    >> master.
    >>>>>>>>       >>>> >>>>>>> How does Zeppelin solve this problem?
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> Thanks again for sharing the idea.
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> Regards,
    >>>>>>>>       >>>> >>>>>>> Jark
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>> On Tue, 25 Jun 2019 at 11:01, Jeff Zhang
    >>>>>>>>       <zjf...@gmail.com <mailto:zjf...@gmail.com>
    <mailto:zjf...@gmail.com <mailto:zjf...@gmail.com>>
    >>>>>>>>       >>>> <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com> <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com>>>
    >>>>>>>>       >>>> >>>>>>> <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com>
    >>>>>>>>       <mailto:zjf...@gmail.com <mailto:zjf...@gmail.com>>
    <mailto:zjf...@gmail.com <mailto:zjf...@gmail.com>
    >>>>>>>>       <mailto:zjf...@gmail.com
    <mailto:zjf...@gmail.com>>>>> wrote:
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  Hi Folks,
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  Zeppelin meet this kind of issue
    before, we
    >>>>>>>> solve
    >>>>>>>>       >>>> it by
    >>>>>>>>       >>>> >> delegating
    >>>>>>>>       >>>> >>>>>>>  each
    >>>>>>>>       >>>> >>>>>>>  one's PR build to his travis account
    >>>>>>>>       (Everyone can
    >>>>>>>>       >>>> have 5 free
    >>>>>>>>       >>>> >>>>>>>  slot for
    >>>>>>>>       >>>> >>>>>>>  travis build).
    >>>>>>>>       >>>> >>>>>>>  Apache account travis build is only
    >>>>>>>> triggered when
    >>>>>>>>       >>>> PR is merged.
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  Kurt Young <ykt...@gmail.com
    <mailto:ykt...@gmail.com>
    >>>>>>>>       <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>
    >>>>>>>>       >>>> <mailto:ykt...@gmail.com
    <mailto:ykt...@gmail.com> <mailto:ykt...@gmail.com
    <mailto:ykt...@gmail.com>>>
    >>>>>>>>       <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>
    <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>
    >>>>>>>>       >>>> <mailto:ykt...@gmail.com
    <mailto:ykt...@gmail.com> <mailto:ykt...@gmail.com
    <mailto:ykt...@gmail.com>
    >>>>>>>>       >>>> >>>>>>>  于2019年6月25日周二 上午10:16写道:
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  > (Forgot to cc George)
    >>>>>>>>       >>>> >>>>>>>  >
    >>>>>>>>       >>>> >>>>>>>  > Best,
    >>>>>>>>       >>>> >>>>>>>  > Kurt
    >>>>>>>>       >>>> >>>>>>>  >
    >>>>>>>>       >>>> >>>>>>>  >
    >>>>>>>>       >>>> >>>>>>>  > On Tue, Jun 25, 2019 at 10:16 AM Kurt
    >> Young
    >>>>>>>>       >>>> <ykt...@gmail.com <mailto:ykt...@gmail.com>
    <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>
    >>>>>>>>       <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>
    <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>>
    >>>>>>>>       >>>> >>>>>>> <mailto:ykt...@gmail.com
    <mailto:ykt...@gmail.com>
    >>>>>>>>       <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>
    <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>
    >>>>>>>>       <mailto:ykt...@gmail.com <mailto:ykt...@gmail.com>>>>>
    >>>>>>>>       >>>> wrote:
    >>>>>>>>       >>>> >>>>>>>  >
    >>>>>>>>       >>>> >>>>>>>  > > Hi Bowen,
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  > > Thanks for bringing this up. We
    >>>>>>>>       actually have
    >>>>>>>>       >>>> discussed
    >>>>>>>>       >>>> >> about
    >>>>>>>>       >>>> >>>>>>>  this, and I
    >>>>>>>>       >>>> >>>>>>>  > > think Till and George have
    >>>>>>>>       >>>> >>>>>>>  > > already spend sometime investigating
    >>>>>>>>       it. I have
    >>>>>>>>       >>>> cced both of
    >>>>>>>>       >>>> >>>>>>>  them, and
    >>>>>>>>       >>>> >>>>>>>  > > maybe they can share
    >>>>>>>>       >>>> >>>>>>>  > > their findings.
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  > > Best,
    >>>>>>>>       >>>> >>>>>>>  > > Kurt
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  > > On Tue, Jun 25, 2019 at 10:08 AM
    Jark Wu
    >>>>>>>>       >>>> <imj...@gmail.com <mailto:imj...@gmail.com>
    <mailto:imj...@gmail.com <mailto:imj...@gmail.com>>
    >>>>>>>>       <mailto:imj...@gmail.com <mailto:imj...@gmail.com>
    <mailto:imj...@gmail.com <mailto:imj...@gmail.com>>>
    >>>>>>>>       >>>> >>>>>>> <mailto:imj...@gmail.com
    <mailto:imj...@gmail.com>
    >>>>>>>>       <mailto:imj...@gmail.com <mailto:imj...@gmail.com>>
    <mailto:imj...@gmail.com <mailto:imj...@gmail.com>
    >>>>>>>>       <mailto:imj...@gmail.com <mailto:imj...@gmail.com>>>>>
    >>>>>>>>       >>>> wrote:
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  > >> Hi Bowen,
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> Thanks for bringing this. We also
    >>>>>>>>       suffered from
    >>>>>>>>       >>>> the long
    >>>>>>>>       >>>> >>>>>>>  build time.
    >>>>>>>>       >>>> >>>>>>>  > >> I agree that we should focus on
    >>>>>>>>       solving build
    >>>>>>>>       >>>> capacity
    >>>>>>>>       >>>> >>>>>>>  problem in the
    >>>>>>>>       >>>> >>>>>>>  > >> thread.
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> My observation is there is only one
    >>>>>>>>       build is
    >>>>>>>>       >>>> running, all
    >>>>>>>>       >>>> >> the
    >>>>>>>>       >>>> >>>>>>>  others
    >>>>>>>>       >>>> >>>>>>>  > >> (other
    >>>>>>>>       >>>> >>>>>>>  > >> PRs, master) are pending.
    >>>>>>>>       >>>> >>>>>>>  > >> The pricing plan[1] of travis shows
    >>>>>>>>       it can
    >>>>>>>>       >>>> support
    >>>>>>>>       >>>> >> concurrent
    >>>>>>>>       >>>> >>>>>>>  build
    >>>>>>>>       >>>> >>>>>>>  > jobs.
    >>>>>>>>       >>>> >>>>>>>  > >> But I don't know which plan we are
    >>>>>>>>       using, might
    >>>>>>>>       >>>> be the free
    >>>>>>>>       >>>> >>>>>>>  plan for
    >>>>>>>>       >>>> >>>>>>>  > open
    >>>>>>>>       >>>> >>>>>>>  > >> source.
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> I cc-ed Chesnay who may have some
    >>>>>>>>       experience on
    >>>>>>>>       >>>> Travis.
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> Regards,
    >>>>>>>>       >>>> >>>>>>>  > >> Jark
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> [1]: https://travis-ci.com/plans
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> On Tue, 25 Jun 2019 at 08:11,
    Bowen Li
    >> <
    >>>>>>>>       >>>> >> bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>
    >>>>>>>>       >>>> >>>>>>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>
    >>>>>>>>       >>>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>>> wrote:
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >> > Hi Steven,
    >>>>>>>>       >>>> >>>>>>>  > >> >
    >>>>>>>>       >>>> >>>>>>>  > >> > I think you may not read what I
    >>>>>>>>       wrote. The
    >>>>>>>>       >>>> discussion is
    >>>>>>>>       >>>> >>>> about
    >>>>>>>>       >>>> >>>>>>>  > "unstable
    >>>>>>>>       >>>> >>>>>>>  > >> > build **capacity**", in
    another word
    >>>>>>>>       >>>> "unstable / lack of
    >>>>>>>>       >>>> >>>> build
    >>>>>>>>       >>>> >>>>>>>  > >> resources",
    >>>>>>>>       >>>> >>>>>>>  > >> > not "unstable build".
    >>>>>>>>       >>>> >>>>>>>  > >> >
    >>>>>>>>       >>>> >>>>>>>  > >> > On Mon, Jun 24, 2019 at 4:40 PM
    >>>>>>>>       Steven Wu
    >>>>>>>>       >>>> >>>>>>>  <stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>
    >>>>>>>>       <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>> <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>
    >>>>>>>>       <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>>>
    >>>>>>>>       >>>> <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>
    >>>>>>>>       <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>> <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>
    >>>>>>>>       <mailto:stevenz...@gmail.com
    <mailto:stevenz...@gmail.com>>>>>
    >>>>>>>>       >>>> >>>>>>>  > wrote:
    >>>>>>>>       >>>> >>>>>>>  > >> >
    >>>>>>>>       >>>> >>>>>>>  > >> > > long and sometimes unstable
    build
    >> is
    >>>>>>>>       >>>> definitely a pain
    >>>>>>>>       >>>> >>>>>> point.
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > I suspect the build failure
    here in
    >>>>>>>>       >>>> >> flink-connector-kafka
    >>>>>>>>       >>>> >>>>>>>  is not
    >>>>>>>>       >>>> >>>>>>>  > >> related
    >>>>>>>>       >>>> >>>>>>>  > >> > to
    >>>>>>>>       >>>> >>>>>>>  > >> > > my change. but there is no easy
    >>>>>>>>       re-run the
    >>>>>>>>       >>>> build on
    >>>>>>>>       >>>> >>>>>>>  travis UI.
    >>>>>>>>       >>>> >>>>>>>  > Google
    >>>>>>>>       >>>> >>>>>>>  > >> > > search showed a trick of
    >>>>>>>>       close-and-open the
    >>>>>>>>       >>>> PR will
    >>>>>>>>       >>>> >>>>>>>  trigger rebuild.
    >>>>>>>>       >>>> >>>>>>>  > >> but
    >>>>>>>>       >>>> >>>>>>>  > >> > > that could add noises to the PR
    >>>>>>>>       activities.
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> https://travis-ci.org/apache/flink/jobs/545555519
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > travis-ci for my personal repo
    >>>>>>>>       often failed
    >>>>>>>>       >>>> with
    >>>>>>>>       >>>> >>>>>>>  exceeding time
    >>>>>>>>       >>>> >>>>>>>  > limit
    >>>>>>>>       >>>> >>>>>>>  > >> > after
    >>>>>>>>       >>>> >>>>>>>  > >> > > 4+ hours.
    >>>>>>>>       >>>> >>>>>>>  > >> > > The job exceeded the maximum
    time
    >>>>>>>>       limit for
    >>>>>>>>       >>>> jobs, and
    >>>>>>>>       >>>> >> has
    >>>>>>>>       >>>> >>>>>>>  been
    >>>>>>>>       >>>> >>>>>>>  > >> > terminated.
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > On Mon, Jun 24, 2019 at 4:15 PM
    >>>>>>>>       Bowen Li
    >>>>>>>>       >>>> >>>>>>>  <bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>
    >>>>>>>>       >>>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:
    >> bowenl...@gmail.com <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>>>
    >>>>>>>>       >>>> >>>>>>>  > wrote:
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > >
    >>>>>>>>       >>>>
    https://travis-ci.org/apache/flink/builds/549681530
    >>>>>>>>       >>>> >>>>>>>  This build
    >>>>>>>>       >>>> >>>>>>>  > >> > request
    >>>>>>>>       >>>> >>>>>>>  > >> > > > has
    >>>>>>>>       >>>> >>>>>>>  > >> > > > been sitting at **HEAD of the
    >>>>>>>>       queue**
    >>>>>>>>       >>>> since I first
    >>>>>>>>       >>>> >> saw
    >>>>>>>>       >>>> >>>>>>>  it at PST
    >>>>>>>>       >>>> >>>>>>>  > >> > 10:30am
    >>>>>>>>       >>>> >>>>>>>  > >> > > > (not sure how long it's been
    >>>>>>>>       there before
    >>>>>>>>       >>>> 10:30am).
    >>>>>>>>       >>>> >>>>>>>  It's PST
    >>>>>>>>       >>>> >>>>>>>  > 4:12pm
    >>>>>>>>       >>>> >>>>>>>  > >> now
    >>>>>>>>       >>>> >>>>>>>  > >> > > and
    >>>>>>>>       >>>> >>>>>>>  > >> > > > it hasn't started yet.
    >>>>>>>>       >>>> >>>>>>>  > >> > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > On Mon, Jun 24, 2019 at
    2:48 PM
    >>>>>>>>       Bowen Li
    >>>>>>>>       >>>> >>>>>>>  <bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>
    >>>>>>>>       >>>> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:
    >> bowenl...@gmail.com <mailto:bowenl...@gmail.com>
    >>>>>>>>       <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com> <mailto:bowenl...@gmail.com
    <mailto:bowenl...@gmail.com>>>>>
    >>>>>>>>       >>>> >>>>>>>  > >> wrote:
    >>>>>>>>       >>>> >>>>>>>  > >> > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > Hi devs,
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > I've been experiencing
    the pain
    >>>>>>>>       >>>> resulting from lack
    >>>>>>>>       >>>> >>>>>>>  of stable
    >>>>>>>>       >>>> >>>>>>>  > >> build
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > capacity on Travis for Flink
    >>>>>>>>       PRs [1].
    >>>>>>>>       >>>> >> Specifically, I
    >>>>>>>>       >>>> >>>>>>>  noticed
    >>>>>>>>       >>>> >>>>>>>  > >> often
    >>>>>>>>       >>>> >>>>>>>  > >> > > that
    >>>>>>>>       >>>> >>>>>>>  > >> > > > no
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > build in the queue is making
    >> any
    >>>>>>>>       >>>> progress for
    >>>>>>>>       >>>> >> hours,
    >>>>>>>>       >>>> >>>> and
    >>>>>>>>       >>>> >>>>>>>  > suddenly
    >>>>>>>>       >>>> >>>>>>>  > >> 5
    >>>>>>>>       >>>> >>>>>>>  > >> > or
    >>>>>>>>       >>>> >>>>>>>  > >> > > 6
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > builds kick off all together
    >>>>>>>>       after the
    >>>>>>>>       >>>> long pause.
    >>>>>>>>       >>>> >>>>>>>  I'm at PST
    >>>>>>>>       >>>> >>>>>>>  > >> > (UTC-08)
    >>>>>>>>       >>>> >>>>>>>  > >> > > > time
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > zone, and I've seen
    pause can
    >>>>>>>>       be as
    >>>>>>>>       >>>> long as 6 hours
    >>>>>>>>       >>>> >>>>>>>  from PST 9am
    >>>>>>>>       >>>> >>>>>>>  > >> to
    >>>>>>>>       >>>> >>>>>>>  > >> > 3pm
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > (let alone the time
    needed to
    >>>>>>>>       drain the
    >>>>>>>>       >>>> queue
    >>>>>>>>       >>>> >>>>>>>  afterwards).
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > I think this has greatly
    >>>>>>>>       impacted our
    >>>>>>>>       >>>> productivity.
    >>>>>>>>       >>>> >>>> I've
    >>>>>>>>       >>>> >>>>>>>  > >> experienced
    >>>>>>>>       >>>> >>>>>>>  > >> > > that
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > PRs submitted in the early
    >>>>>>>>       morning of
    >>>>>>>>       >>>> PST time zone
    >>>>>>>>       >>>> >>>>>>>  won't finish
    >>>>>>>>       >>>> >>>>>>>  > >> > their
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > build until late night
    of the
    >>>>>>>>       same day.
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > So my questions are:
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > - Has anyone else
    experienced
    >>>>>>>>       the same
    >>>>>>>>       >>>> problem or
    >>>>>>>>       >>>> >>>>>>>  have similar
    >>>>>>>>       >>>> >>>>>>>  > >> > > > observation
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > on TravisCI? (I suspect it
    >>>>>>>>       has things
    >>>>>>>>       >>>> to do with
    >>>>>>>>       >>>> >> time
    >>>>>>>>       >>>> >>>>>>>  zone)
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > - What pricing plan of
    >>>>>>>>       TravisCI is
    >>>>>>>>       >>>> Flink currently
    >>>>>>>>       >>>> >>>>>>>  using? Is it
    >>>>>>>>       >>>> >>>>>>>  > >> the
    >>>>>>>>       >>>> >>>>>>>  > >> > > free
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > plan for open source
    >>>>>>>>       projects? What
    >>>>>>>>       >>>> are the
    >>>>>>>>       >>>> >>>>>>>  guaranteed build
    >>>>>>>>       >>>> >>>>>>>  > >> capacity
    >>>>>>>>       >>>> >>>>>>>  > >> > > of
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > the current plan?
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > - If the current pricing
    plan
    >>>>>>>>       (either
    >>>>>>>>       >>>> free or paid)
    >>>>>>>>       >>>> >>>>>> can't
    >>>>>>>>       >>>> >>>>>>>  > provide
    >>>>>>>>       >>>> >>>>>>>  > >> > > stable
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > build capacity, can we
    >>>>>>>>       upgrade to a
    >>>>>>>>       >>>> higher priced
    >>>>>>>>       >>>> >>>>>>>  plan with
    >>>>>>>>       >>>> >>>>>>>  > larger
    >>>>>>>>       >>>> >>>>>>>  > >> > and
    >>>>>>>>       >>>> >>>>>>>  > >> > > > more
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > stable build capacity?
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > BTW, another factor that
    >>>>>>>>       contribute to
    >>>>>>>>       >>>> the
    >>>>>>>>       >>>> >>>>>>>  productivity problem
    >>>>>>>>       >>>> >>>>>>>  > is
    >>>>>>>>       >>>> >>>>>>>  > >> > that
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > our build is slow - we run
    >>>>>>>>       full build
    >>>>>>>>       >>>> for every PR
    >>>>>>>>       >>>> >>>> and a
    >>>>>>>>       >>>> >>>>>>>  > >> successful
    >>>>>>>>       >>>> >>>>>>>  > >> > > full
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > build takes ~5h. We
    >>>>>>>>       definitely have
    >>>>>>>>       >>>> more options to
    >>>>>>>>       >>>> >>>>>>>  solve it,
    >>>>>>>>       >>>> >>>>>>>  > for
    >>>>>>>>       >>>> >>>>>>>  > >> > > > instance,
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > modularize the build graphs
    >>>>>>>>       and reuse
    >>>>>>>>       >>>> artifacts
    >>>>>>>>       >>>> >> from
    >>>>>>>>       >>>> >>>> the
    >>>>>>>>       >>>> >>>>>>>  > previous
    >>>>>>>>       >>>> >>>>>>>  > >> > > build.
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > But I think that can be
    a big
    >>>>>>>>       effort
    >>>>>>>>       >>>> which is much
    >>>>>>>>       >>>> >>>>>>>  harder to
    >>>>>>>>       >>>> >>>>>>>  > >> > accomplish
    >>>>>>>>       >>>> >>>>>>>  > >> > > > in
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > a short period of time and
    >>>>>>>>       may deserve
    >>>>>>>>       >>>> its own
    >>>>>>>>       >>>> >>>> separate
    >>>>>>>>       >>>> >>>>>>>  > >> discussion.
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > > [1]
    >>>>>>>>       >>>> >>
    https://travis-ci.org/apache/flink/pull_requests
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > > >
    >>>>>>>>       >>>> >>>>>>>  > >> > >
    >>>>>>>>       >>>> >>>>>>>  > >> >
    >>>>>>>>       >>>> >>>>>>>  > >>
    >>>>>>>>       >>>> >>>>>>>  > >
    >>>>>>>>       >>>> >>>>>>>  >
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  --
    >>>>>>>>       >>>> >>>>>>>  Best Regards
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>>>>>>  Jeff Zhang
    >>>>>>>>       >>>> >>>>>>>
    >>>>>>>>       >>>> >>
    >>>>>>>>       >>>>
    >>>>>>>>       >>>
    >>>>>>>>       >>
    >>>>>>>>
    >>


Reply via email to