Thanks for the active discussion on the document for the new CI for MXNet.
Now that many of you have reviewed it, do you think I should start a vote
on which framework the community wants to move forward with ?

Thanks,
Meghna

On Mon, Nov 6, 2017 at 6:59 PM, Chris Olivier <cjolivie...@gmail.com> wrote:

> After a decision is reached, i am willing to add tasks to Apache MXNet JIRA
>
> On Mon, Nov 6, 2017 at 6:15 AM, Pedro Larroy <pedro.larroy.li...@gmail.com
> >
> wrote:
>
> > Thanks for setting up the document guys, looks like a solid basis to
> > start to work on!
> >
> > Marco, Kellen and I have already added some comments.
> >
> > Pedro
> >
> >
> > On Sun, Nov 5, 2017 at 3:43 AM, Meghna Baijal
> > <meghnabaijal2...@gmail.com> wrote:
> > > Kellen, Thank you for your comments in the doc.
> > > Sure Steffen, I will continue to merge everyone’s comments into the doc
> > and
> > > work with Pedro to finalize it.
> > > And then we can vote on the options.
> > >
> > > Thanks,
> > > Meghna Baijal
> > >
> > >
> > > On Sat, Nov 4, 2017 at 6:34 AM, Steffen Rochel <
> steffenroc...@gmail.com>
> > > wrote:
> > >
> > >> Sandeep and Meghna have been working in background collecting input
> and
> > >> preparing a doc. I suggest to drive discussion forward and would like
> to
> > >> ask everybody to contribute to
> > >> https://docs.google.com/document/d/17PEasQ2VWrXi2Cf7IGZSWGZMawxDk
> > >> dlavUDASzUmLjk/edit?usp=sharing
> > >>
> > >> Lets converge on requirements and architecture, so we can move forward
> > with
> > >> implementation.
> > >>
> > >> I would like to suggest for Pedro  and Meghna to lead the discussion
> and
> > >> help to resolve suggestions.
> > >>
> > >> I assume we need a vote once we are converged on a good draft to call
> > it a
> > >> plan and move forward with implementation. As we all are unhappy with
> > the
> > >> current CI situation I would also suggest a phased approach, so we can
> > get
> > >> back to reliable and efficient basic CI quickly and add advanced
> > >> capabilities over time.
> > >>
> > >> Steffen
> > >>
> > >> On Wed, Nov 1, 2017 at 1:14 PM kellen sunderland <
> > >> kellen.sunderl...@gmail.com> wrote:
> > >>
> > >> > Hey Henri, I think that's what a few of us are advocating.  Running
> a
> > set
> > >> > of quick tests as part of the PR process, and then a more detailed
> > >> > regression test suite periodically (say every 4 hours). This fits
> > nicely
> > >> > into a tagging or 2 branch development system.  Commits will be
> tagged
> > >> (or
> > >> > merged into a stable branch) as soon as they pass the detailed
> > regression
> > >> > testing.
> > >> >
> > >> > On Wed, Nov 1, 2017 at 9:07 PM, Hen <bay...@apache.org> wrote:
> > >> >
> > >> > > Random question - can the CI be split such that the Apache CI is
> > doing
> > >> a
> > >> > > basic set of checks on that hardware, and is hooked to a PR, while
> > >> there
> > >> > is
> > >> > > a larger "Is trunk good for release?" test that is running
> > periodically
> > >> > > rather than on every PR?
> > >> > >
> > >> > > ie: do we need each PR to be run on varied hardware, or can we
> have
> > >> this
> > >> > > two tier approach?
> > >> > >
> > >> > > Hen
> > >> > >
> > >> > > On Fri, Oct 20, 2017 at 1:01 PM, sandeep krishnamurthy <
> > >> > > sandeep.krishn...@gmail.com> wrote:
> > >> > >
> > >> > > > Hello all,
> > >> > > >
> > >> > > > I am hereby opening up a discussion thread on how we can
> stabilize
> > >> > Apache
> > >> > > > MXNet CI build system.
> > >> > > >
> > >> > > > Problems:
> > >> > > >
> > >> > > > ========
> > >> > > >
> > >> > > > Recently, we have seen following issues with Apache MXNet CI
> build
> > >> > > systems:
> > >> > > >
> > >> > > >    1. Apache Jenkins master is overloaded and we see issues
> like -
> > >> > unable
> > >> > > >    to trigger builds, difficult to load and view the blue ocean
> > and
> > >> > other
> > >> > > >    Jenkins build status page.
> > >> > > >    2. We are generating too many request/interaction on Apache
> > Infra
> > >> > > team.
> > >> > > >       1. Addition/deletion of new slave: Caused from scaling
> > >> activity,
> > >> > > >       recycling, troubleshooting or any actions leading to
> change
> > of
> > >> > > slave
> > >> > > >       machines.
> > >> > > >       2. Plugins / other Jenkins Master configurations.
> > >> > > >       3. Experimentation on CI pipelines.
> > >> > > >    3. Harder to debug and resolve issues - Since access to
> master
> > and
> > >> > > slave
> > >> > > >    is not with the same community, it requires Infra and
> > community to
> > >> > > dive
> > >> > > >    deep together on all action items.
> > >> > > >
> > >> > > > Possible Solutions:
> > >> > > >
> > >> > > > ==============
> > >> > > >
> > >> > > >    1. Can we set up a separate Jenkins CI build system for
> Apache
> > >> MXNet
> > >> > > >    outside Apache Infra?
> > >> > > >    2. Can we have a separate Jenkins Master in Apache Infra for
> > >> MXNet?
> > >> > > >    3. Review design of current setup, refine and fill the gaps.
> > >> > > >
> > >> > > > @ Mentors/Infra team/Community:
> > >> > > >
> > >> > > > ==========================
> > >> > > >
> > >> > > > Please provide your suggestions on how we can proceed further
> and
> > >> work
> > >> > on
> > >> > > > stabilizing the CI build systems for MXNet.
> > >> > > >
> > >> > > > Also, if the community decides on separate Jenkins CI build
> > system,
> > >> > what
> > >> > > > important points should be taken care of apart from the below:
> > >> > > >
> > >> > > >    1. Community being able to access the build page for build
> > >> statuses.
> > >> > > >    2. Committers being able to login with apache credentials.
> > >> > > >    3. Hook setup from apache/incubator-mxnet repo to Jenkins
> > master.
> > >> > > >
> > >> > > >
> > >> > > > Irrespective of the solution we come up, I think we should
> > initiate a
> > >> > > > technical design discussion on how to setup the CI build system.
> > >> > > Probably 1
> > >> > > > or 2 pager documents with the architecture and review with Infra
> > and
> > >> > > > community members.
> > >> > > >
> > >> > > > ***There were few proposal and discussion on the slack channel,
> to
> > >> > reach
> > >> > > > wider community members, moving that discussion formally to this
> > >> list.
> > >> > > >
> > >> > > >
> > >> > > > My Proposal: Option 1 - Set up separate Jenkins CI build system.
> > >> > > >
> > >> > > > Thanks,
> > >> > > >
> > >> > > > Sandeep
> > >> > > >
> > >> > > >
> > >> > > >
> > >> > > > --
> > >> > > > Sandeep Krishnamurthy
> > >> > > >
> > >> > >
> > >> >
> > >>
> >
>

Reply via email to