Re: Add code quality checks to pre-commits.

Maximilian Michels Tue, 08 Jan 2019 09:23:11 -0800

I don't think it would be unreasonable to disable parallel build for code 
coverage if necessary, perhaps in a separate Jenkins job if it significantly 
increases job time.

+1 A separate coverage job would be helpful. It would be fine if it ran longerthan the regular PreCommit.


On 08.01.19 10:41, Scott Wegner wrote:

I started a PR to see what it would take to upgrade to Gradle 5.0:https://github.com/apache/beam/pull/7402

It seems the main blocker it gogradle plugin compatibility for the Go SDK. Thegogradle project is actively working on compatibility, so perhaps we could checkback in a month or so: https://github.com/gogradle/gogradle/pull/270

I can see from the Gradle build scan that our Java pre-commit is using parallelbuild but not the build cache:https://scans.gradle.com/s/eglskrakojhrm/switches I don't think it would beunreasonable to disable parallel build for code coverage if necessary, perhapsin a separate Jenkins job if it significantly increases job time.

On Mon, Jan 7, 2019 at 9:56 AM Kenneth Knowles <[email protected]<mailto:[email protected]>> wrote:


    I am pretty sure the build cache is not in use on Jenkins. It would have to
    live somewhere persisted across runs, which I don't think is default. And
    we'd be seeing much lower build times.

    Kenn

    On Mon, Jan 7, 2019 at 9:42 AM Mikhail Gryzykhin <[email protected]
    <mailto:[email protected]>> wrote:

        @Jason
        I believe that we want to get some coverage report as a PR pre-commit
        check, so that we can utilize it as part of code review. Removing
        parallelization increases pre-commit duration greatly (see the big drop
        on this graph
        
<http://104.154.241.245/d/_TNndF2iz/pre-commit-test-latency?orgId=1&from=now-6M&to=now>)
        and I believe that's what Michael worries about.

        If we'd want to run coverage jobs in background on master, then it is
        possible to do a separate job.

        --Mikhail

        Have feedback <http://go/migryz-feedback>?


        On Mon, Jan 7, 2019 at 9:33 AM Jason Kuster <[email protected]
        <mailto:[email protected]>> wrote:

            Michael, could we solve that problem by adding an additional build
            with those features disabled specifically for generating coverage
            data over time? Any tradeoffs to doing something like that?

            On Mon, Jan 7, 2019 at 8:23 AM Michael Luckey <[email protected]
            <mailto:[email protected]>> wrote:

                Unfortunately I am a bit worried, that enabling code coverage
                will not come for free at the moment. As far as I understand the
                current state of the build system, we would either have to
                disable parallel builds or the build cache, which I assume both
                to be enabled currently on precommit runs. I am trying to get
                some numbers here, but it might well be, that someone did
                already that evaluation. So I d be happy to get further insights
                if someone with deeper knowledge to the setup could jump in 
here.

                This situation will probably be resolved after an upgrade to
                gradle5, but unfortunately this seems to be not easily doable
                right now.

                michel



                On Mon, Jan 7, 2019 at 4:21 PM Maximilian Michels
                <[email protected] <mailto:[email protected]>> wrote:

                    Code coverage would be very useful, regardless of what tool
                    we use to measure
                    it. Especially when reviewing PRs, because it provides a
                    quantitative measure
                    which can help reviewer and contributor to decide if testing
                    is sufficient.

                    I'd prefer to use specific tools instead of one-fits-it-all
                    tool but I'm open to
                    whatever works best. The process of setting the tooling up
                    is likely going to be
                    iterative.

                    @Mikhail If you want to setup anything and present it here,
                    that would be
                    awesome. (Of course discussion can continue here.)

                    On 03.01.19 19:34, Heejong Lee wrote:
                     > I think we should also consider false positive ratio of
                    the tool. Oftentimes
                     > deeper analysis easily produces tons of false positives
                    which make people less
                     > interested in static analysis results because of triaging
                    overheads.
                     >
                     > On Thu, Jan 3, 2019 at 4:18 PM Scott Wegner
                    <[email protected] <mailto:[email protected]>
                     > <mailto:[email protected] <mailto:[email protected]>>> 
wrote:
                     >
                     >     Discussion on software engineering practices and
                    tools tends to gather many
                     >     opinions :) I suggest breaking this out into a doc to
                    keep the discussion
                     >     organized.
                     >
                     >     I appreciate that you've started with a list of
                    requirements. I would add a few:
                     >
                     >     6. Analysis results should be integrated into the
                    code review workflow.
                     >     7. It should also be possible to run analysis and
                    evaluate results locally.
                     >     8. Analysis rules and thresholds should be easily
                    configurable.
                     >
                     >     And some thoughts on the previous requirements:
                     >
                     >      > 2. Tool should keep history of reports.
                     >
                     >     Seems nice-to-have but not required. I believe the
                    most value is viewing the
                     >     delta during code review, and also maybe a snapshot
                    of the overall state of
                     >     master. If we want trends we could also import data 
into
                     > s.apache.org/beam-community-metrics
                    <http://s.apache.org/beam-community-metrics>
                    <http://s.apache.org/beam-community-metrics>
                     >
                     >      > 4. Tool should encorporate code coverage and
                    static analysis reports. (Or
                     >     more if applicable)
                     >
                     >     Is the idea to have a single tool responsible for all
                    code analysis? We
                     >     currently have a variety of tools running in our
                    build. It would be
                     >     challenging to find a single tool that aggregates all
                    current (and future)
                     >     analysis, especially considering the different
                    language ecosystems. Having
                     >     targeted tools responsible for different pieces
                    allows us to pick-and-choose
                     >     what works best for Beam.
                     >
                     >
                     >     On Thu, Jan 3, 2019 at 3:43 PM Mikhail Gryzykhin
                    <[email protected] <mailto:[email protected]>
                     >     <mailto:[email protected]
                    <mailto:[email protected]>>> wrote:
                     >
                     >         Let me summarize and answer main question that I 
see:
                     >         1. Seems that we do want to have some statistics
                    on coverage and
                     >         integrate automatic requirements into our build
                    system.
                     >         2. Implementation is still to be discussed.
                     >
                     >         Lets talk about implementation further.
                     >
                     >         My requirements for choice are:
                     >         1. Tool should give us an option for deep-dive
                    into findings.
                     >         2. Tool should keep history of reports.
                     >         3. Tool should give an option to break build
                    (allow for hardcoded
                     >         requirements)
                     >         4. Tool should encorporate code coverage and
                    static analysis reports.
                     >         (Or more if applicable)
                     >         5. Tool should support most or all languages we
                    utilize in beam.
                     >
                     >         Let me dive into SonarQube a bit first. (All up
                    to my understanding of
                     >         how it works.)
                     >         Hits most of the points, potentially with some
                    tweaks.
                     >         This tool relies on reports generated by common
                    tools. It also tracks
                     >         history of builds and allows to navigate it.
                    Multi language. I'm still
                     >         working on figuring out how to configure it 
though.
                     >
                     >         Common thresholds/checks that are suggested by
                    SonarQube:
                     >         Many checks are possible to apply to new code
                    only. This allows not to
                     >         fix legacy code, but keep all new additions clean
                    and neat (ish).
                     >         Test coverage by line/branch: Relies on cubertura
                    report. Usually
                     >         coverage by branch is suggested. (all "if" case
                    lines should be tested
                     >         with positive and negative condition result)
                     >         Method complexity: Amount of different
                    paths/conditions that method can
                     >         be invoked with. Suggested max number is 15.
                    Generally describes how
                     >         easy it is to test/understand method.
                     >         Bugs/vulnerabilities: Generally, output of
                    Findbug. Reflects commonly
                     >         vulnerable/dangerous code that might cause
                    errors. Or just errors in
                     >         code. I believe that sonar allows for custom code
                    analysis as well, but
                     >         that is not required.
                     >         Technical debt: estimations on how much time will
                    it take to cleanup
                     >         code to make it shiny. Includes code
                    duplications, commented code, not
                     >         following naming conventions, long methods, ifs
                    that can be inverted,
                     >         public methods that can be private, etc. I'm not
                    familiar with explicit
                     >         list, but on my experience suggestions are
                    usually relevant.
                     >         More on metrics can be found here:
                     >
                    
https://docs.sonarqube.org/latest/user-guide/metric-definitions/
                     >
                     >         Suggested alternatives:
                     > https://scan.coverity.com/
                     >         This tool looks great and I'll check more on it.
                    But it has a
                     >         restriction to 14 or 7 builds per week (not sure
                    how will they estimate
                     >         our project). Also, I'm not sure if we can break
                    pre-commit based on
                     >         report from coverity. Looks good for generating
                    historical data.
                     >
                     > https://docs.codecov.io/docs/browser-extension
                     >         I'll check more on this one. Looks great to have
                    it integrated in PRs.
                     >         Although it requires plugin installation by each
                    developer. I don't
                     >         think it allows to break builds and only does
                    coverage. Am I correct?
                     >
                     >         Regards,
                     >         --Mikhail
                     >
                     >         Have feedback <http://go/migryz-feedback>?
                     >
                     >         On Thu, Jan 3, 2019 at 2:18 PM Kenneth Knowles
                    <[email protected] <mailto:[email protected]>
                     >         <mailto:[email protected]
                    <mailto:[email protected]>>> wrote:
                     >
                     >             It would be very useful to have line and/or
                    branch coverage visible.
                     >             These are both very weak proxies for quality
                    or reliability, so IMO
                     >             strict thresholds are not helpful. One thing
                    that is super useful is
                     >             to integrate line coverage into code review,
                    like this:
                     > https://docs.codecov.io/docs/browser-extension. It is
                    very easy to
                     >             notice major missing tests.
                     >
                     >             We have never really used Sonarqube. It was
                    turned on as a
                     >             possibility in the early days but never
                    worked on past that point.
                     >             Could be nice. I suspect there's a lot to be
                    gained by just finding
                     >             very low numbers and improving them. So just
                    running Jacoco's
                     >             offline HTML generation would do it (also
                    this integrates with
                     >             Jenkins). I tried this the other day and
                    discovered that our gradle
                     >             config is broken and does not wire tests and
                    coverage reporting
                     >             together properly. Last thing: How is
                    "technical debt" measured? I'm
                     >             skeptical of quantitative measures for
                    qualitative notions.
                     >
                     >             Kenn
                     >
                     >             On Thu, Jan 3, 2019 at 1:58 PM Heejong Lee
                    <[email protected] <mailto:[email protected]>
                     >             <mailto:[email protected]
                    <mailto:[email protected]>>> wrote:
                     >
                     >                 I don't have any experience of using
                    SonarQube but Coverity
                     >                 worked well for me. Looks like it already
                    has beam repo:
                     > https://scan.coverity.com/projects/11881
                     >
                     >                 On Thu, Jan 3, 2019 at 1:27 PM Reuven Lax
                    <[email protected] <mailto:[email protected]>
                     >                 <mailto:[email protected]
                    <mailto:[email protected]>>> wrote:
                     >
                     >                     checkstyle and findbugs are already
                    run as precommit checks,
                     >                     are they not?
                     >
                     >                     On Thu, Jan 3, 2019 at 7:19 PM
                    Mikhail Gryzykhin
                     >                     <[email protected]
                    <mailto:[email protected]> <mailto:[email protected]
                    <mailto:[email protected]>>> wrote:
                     >
                     >                         Hi everyone,
                     >
                     >                         In our current builds we (can)
                    run multiple code quality
                     >                         checks tools like checkstyle,
                    findbugs, code test
                     >                         coverage via cubertura. However
                    we do not utilize many
                     >                         of those signals.
                     >
                     >                         I suggest to add requirements to
                    code based on those
                     >                         tools. Specifically, I suggest to
                    add pre-commit checks
                     >                         that will require PRs to conform
                    to some quality checks.
                     >
                     >                         We can see good example of
                    thresholds to add at Apache
                     >                         SonarQube provided default
                    quality gate config

> <https://builds.apache.org/analysis/quality_gates/show/1>:

                     >                         80% tests coverage on new code,
                     >                         5% technical technical debt on
                    new code,
                     >                         No bugs/Vulnerabilities added.
                     >
                     >                         As another part of this proposal,
                    I want to suggest the
                     >                         use of SonarQube for tracking
                    code statistics and as
                     >                         agent for enforcing code quality
                    thresholds. It is
                     >                         Apache provided tool that has
                    integration with Jenkins
                     >                         or Gradle via plugins.
                     >
                     >                         I believe some reporting to
                    SonarQube was configured for
                     >                         mvn builds of some of Beam
                    sub-projects, but was lost
                     >                         during migration to gradle.
                     >
                     >                         I was looking for other options,
                    but so far found only
                     >                         general configs to gradle builds
                    that will fail build if
                     >                         code coverage for project is too
                    low. Such approach will
                     >                         force us to backfill tests for
                    all existing code that
                     >                         can be tedious and demand
                    learning of all legacy code
                     >                         that might not be part of current
                    work.
                     >
                     >                         I suggest to discuss and come to
                    conclusion on two
                     >                         points in this tread:
                     >                         1. Do we want to add code quality
                    checks to our
                     >                         pre-commit jobs and require them
                    to pass before PR is
                     >                         merged?
                     >
                     >                             Suggested: Add code quality
                    checks listed above at
                     >                             first, adjust them as we see
                    fit in the future.
                     >
                     >                         2. What tools do we want to
                    utilize for analyzing code
                     >                         quality?
                     >
                     >                             Under discussion. Suggested:
                    SonarQube, but will
                     >                             depend on functionality level
                    we want to achieve.
                     >
                     >
                     >                         Regards,
                     >                         --Mikhail
                     >
                     >
                     >
                     >     --
                     >
                     >
                     >
                     >
                     >     Got feedback? tinyurl.com/swegner-feedback
                    <http://tinyurl.com/swegner-feedback>
                     >     <https://tinyurl.com/swegner-feedback>
                     >

---------

            Jason Kuster
            Apache Beam / Google Cloud Dataflow

            See something? Say something. go/jasonkuster-feedback
            <https://goto.google.com/jasonkuster-feedback>



--




Got feedback? tinyurl.com/swegner-feedback 
<https://tinyurl.com/swegner-feedback>

Re: Add code quality checks to pre-commits.

Reply via email to