Re: On wearing multiple hats

2016-10-28 Thread Shawn Heisey
On 10/25/2016 1:50 AM, Isabel Drost-Fromm wrote:
> Longer version: Every now and then I hear people complain either
> privately or publicly [1] that people working on Apache projects who
> are not paid to do that work and have don't have the luxury to
> participate full-time are facing a hard time getting into our
> communities. 

I'm a committer on the Lucene-Solr project, working primarily on the
Solr part.  I've had this role for about 3.5 years.  I am not a member
of the PMC.

The committer invitation came completely out of the blue.  Before that,
I had contributed a few patches via Jira, and some of them had even been
committed, but my biggest participation is being active on the Solr
mailing list and IRC channel.  I maintain Solr installations as part of
my job, but nobody has ever paid me for the work I do on the project,
and my employer has never made any demands of me in my role as
committer.  I definitely cannot work on Solr full-time.  I enjoy
participating, and I like to think that I'm part of a good open source
community.

I think I can safely say that our project has several people who are not
paid for their project work, and do not have significant spare time to
work on the project.  There are also a number of committers who DO have
jobs where I believe they are effectively paid to improve the project,
even if it's not a full job description.  It's hard to say whether those
relationships represent conflicts of interest regarding the health of
the project.  My cautious point of view is that there's no *immediate*
cause for concern with Lucene.

At least one of our committers knows almost nothing about Java, which is
significant because Lucene-Solr is a Java codebase. That person obtained
the role because of a strong willingness to help in other areas -- they
are active on the mailing lists, and they almost single-handedly
contributed a vastly superior Solr web interface before being invited as
a committer, using html, css, and javascript.

I'm not sure which Apache projects might fit the description you have
provided.  I am subscribed to a few other Apache project mailing lists,
for other Java technologies that Solr includes as dependencies.  Aside
from being far less active than the Solr community, those also appear to
work properly in the Apache Way like (IMHO) Lucene-Solr does.

Even if there are projects that work the way you have described, I'm
reluctant to endorse having the foundation "help" (read: interfere) with
their operation unless the project or its community specifically
requests it.  That should be reserved for projects that are completely
broken, not projects that have a few internal issues to work out.  If a
particular community feels that they have issues, I think it's mostly up
to that community (the PMC in particular) to make that determination and
deal with the problem.

All that said... there's likely room for improvement in some
projects/communities, even some that you'd say are healthy.

Thanks,
Shawn


-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Hervé BOUTEMY
Le vendredi 28 octobre 2016, 10:41:37 CEST Daniel Gruno a écrit :
> > For Maven, the only option I see is pom.xml files: how can we confirm
> > this?
> > And confirm if language breakdowns counts files only, or weighted with
> > file size or with another weight?
> 
> It uses the same heuristics as CLoC (with a few modifications for
> increased stability), so you could run that locally and see why it does
> what it does.
ok, then here is the doc I'll add a pointer to
https://github.com/AlDanial/cloc#Languages

> language analysers are never 100% accurate, OpenHub's
> analyser is famous for making odd claims about Forth, and GitHub's
> downplays Python in many projects, the list goes on :)
> 
> Counts are lines of code, there is no weighting going on there. it's
> just raw figures.
> 
> > Is Snoot open sourced somewhere?
> 
> It's about as open as GitHub, OpenHub, Masterbranch etc :) The interface
> for the system is public and documented (under documentation/exports),
> but the internal systems on the boxes are proprietary for the most part.
great, all-sources is the API I needed to integrate content into projects.a.o 
build: we'll need to sort out which token to use, but at least I can start to 
work on interpreting results fetched by hand

https://api.snoot.io/api/3/api-docs#all-sources


> 
> With regards,
> Daniel.
> 
> > Regards,
> > 
> > Hervé
> > 
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> > For additional commands, e-mail: dev-h...@community.apache.org
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Hervé BOUTEMY
Le vendredi 28 octobre 2016, 10:27:31 CEST Daniel Gruno a écrit :
> > I have a snoot account, then I could have a look at the list of repos that
> > are taken into account. I have a few questions:
> > 1. can we show the list of repos from this statistics page?
> 
> Do you mean the entire list of repos analysed? I'm sure we could, I just
> don't quite know how we'd present it :) It's a rather large list.
yes, I mean the full list (740 repositories, Snoot says :) )
I know that making it appealing will require some work, but I think that it's 
important to make that information visible for people wanting to dig into 
these stats, and even help us fix issues on Snoot config

I'm even sure we could try to add this list of repos split by committee, on 
each committee page: that would make each PMC able to see its Snort config and 
once again help to fix issues

> 
> > 2. I saw that some imports are failing, because list of repos change over
> > time: how can I help fix issues?
> 
> If you're up for keeping the list updated, speak to Sally about getting
> admin privs on Snoot, I'm sure she'll be happy to have someone help out :)
great, I'll do, thanks

Regards,

Hervé

> 
> With regards,
> Daniel.
> 
> > Regards,
> > 
> > Hervé
> > 
> > [1] https://projects.apache.org/projects.html?pmc
> > 
> > Le mercredi 26 octobre 2016, 21:28:17 CEST Daniel Gruno a écrit :
> >> On 10/26/2016 09:06 PM, Mike Drob wrote:
> >>> A few section specific comments -
> >>> Largest/Busiest projects is difficult to make use of due to the huge
> >>> "other" section. Maybe a list makes more sense rather than a pie/circle
> >>> chart.Email, topics and email authors, past year -- more readable as a
> >>> line
> >>> chart and for a longer time span I think
> >> 
> >> Changing the email stats to lines was rather straightforward, so I've
> >> done that. I also changed it to just show stats for user/dev lists,
> >> leaving out the issues/commit lists which are rather chatty but not
> >> representative of email-based discussions. Changing the top repos by
> >> sloc/commits will require some time, as I'll have to write some custom
> >> representation for that.
> >> 
> >> With regards,
> >> Daniel.
> >> 
> >>> On Wed, Oct 26, 2016 at 1:07 PM, Daniel Gruno 
> > 
> > wrote:
>  I added an initial stats page at
>  https://projects.apache.org/statistics.html - assuming no one objects,
>  I'll add it to the top menu of the other pages in a day or so.
>  
>  Do peruse - anything we need to add/edit?
>  
>  With regards,
>  Daniel.
>  
>  On 10/26/2016 01:07 PM, Daniel Gruno wrote:
> > Hi folks,
> > I was wondering, since we have full access to Snoot for the ASF, why
> > not
> > take advantage of that and add a statistics page to
> > projects.apache.org,
> > showing the various live stats available (no. of commits/committers,
> > largest repos by size/commits, proper language breakdown, relationship
> > mapping, mail stats etc).
> > 
> > I was inclined to JFDI, but I'd love to hear what others think about
> > this. If I don't hear any loud objections, I'll add a stats page
> > today,
> > and we can see if it's of any use :)
> > 
> > Comments? Suggestions? :)
> > 
> > With regards,
> > Daniel.
> > 
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> > For additional commands, e-mail: dev-h...@community.apache.org
>  
>  -
>  To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
>  For additional commands, e-mail: dev-h...@community.apache.org
> >> 
> >> -
> >> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> >> For additional commands, e-mail: dev-h...@community.apache.org
> > 
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> > For additional commands, e-mail: dev-h...@community.apache.org
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Daniel Gruno
On 10/28/2016 10:30 AM, Hervé BOUTEMY wrote:
> Le mercredi 26 octobre 2016, 23:12:44 CEST Daniel Gruno a écrit :
>> On 10/26/2016 10:56 PM, Phil Steitz wrote:
>>> On 10/26/16 11:07 AM, Daniel Gruno wrote:
 I added an initial stats page at
 https://projects.apache.org/statistics.html - assuming no one objects,
 I'll add it to the top menu of the other pages in a day or so.

 Do peruse - anything we need to add/edit?
>>>
>>> Maven is not a programming language.  What exactly is the
>>> denominator on that stat?  Number of files?  Lines of code?
>>> Projects primarily using?
>>
>> I suspect it's scripts specifically for maven it's counting. the
>> denominator is lines of functional code (101 million in total, not
>> counting blanks and comments which take us to 150M total).
> For Maven, the only option I see is pom.xml files: how can we confirm this?
> And confirm if language breakdowns counts files only, or weighted with file 
> size 
> or with another weight?

It uses the same heuristics as CLoC (with a few modifications for
increased stability), so you could run that locally and see why it does
what it does. language analysers are never 100% accurate, OpenHub's
analyser is famous for making odd claims about Forth, and GitHub's
downplays Python in many projects, the list goes on :)

Counts are lines of code, there is no weighting going on there. it's
just raw figures.

> Is Snoot open sourced somewhere?

It's about as open as GitHub, OpenHub, Masterbranch etc :) The interface
for the system is public and documented (under documentation/exports),
but the internal systems on the boxes are proprietary for the most part.

With regards,
Daniel.

> 
> Regards,
> 
> Hervé
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org
> 


-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Hervé BOUTEMY
Le mercredi 26 octobre 2016, 23:12:44 CEST Daniel Gruno a écrit :
> On 10/26/2016 10:56 PM, Phil Steitz wrote:
> > On 10/26/16 11:07 AM, Daniel Gruno wrote:
> >> I added an initial stats page at
> >> https://projects.apache.org/statistics.html - assuming no one objects,
> >> I'll add it to the top menu of the other pages in a day or so.
> >> 
> >> Do peruse - anything we need to add/edit?
> > 
> > Maven is not a programming language.  What exactly is the
> > denominator on that stat?  Number of files?  Lines of code?
> > Projects primarily using?
> 
> I suspect it's scripts specifically for maven it's counting. the
> denominator is lines of functional code (101 million in total, not
> counting blanks and comments which take us to 150M total).
For Maven, the only option I see is pom.xml files: how can we confirm this?
And confirm if language breakdowns counts files only, or weighted with file 
size 
or with another weight?
Is Snoot open sourced somewhere?

Regards,

Hervé

-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Daniel Gruno
On 10/28/2016 10:17 AM, Hervé BOUTEMY wrote:
> IIUC, this "Largest/Busiest projects" statistics is neither per project, 
> neither per committee (or PMC), but per repo
> 
> notice: 1 committee (or PMC) = n projects [1]
> and 1 committee may have many repos
> 
> I'll update the title to "Largest/Busiest repos", that will be less 
> misleading.

Thanks!
> 
> 
> I have a snoot account, then I could have a look at the list of repos that 
> are 
> taken into account. I have a few questions:
> 1. can we show the list of repos from this statistics page?

Do you mean the entire list of repos analysed? I'm sure we could, I just
don't quite know how we'd present it :) It's a rather large list.

> 2. I saw that some imports are failing, because list of repos change over 
> time: how can I help fix issues?

If you're up for keeping the list updated, speak to Sally about getting
admin privs on Snoot, I'm sure she'll be happy to have someone help out :)

With regards,
Daniel.

> 
> Regards,
> 
> Hervé
> 
> [1] https://projects.apache.org/projects.html?pmc
> 
> Le mercredi 26 octobre 2016, 21:28:17 CEST Daniel Gruno a écrit :
>> On 10/26/2016 09:06 PM, Mike Drob wrote:
>>> A few section specific comments -
>>> Largest/Busiest projects is difficult to make use of due to the huge
>>> "other" section. Maybe a list makes more sense rather than a pie/circle
>>> chart.Email, topics and email authors, past year -- more readable as a
>>> line
>>> chart and for a longer time span I think
>>
>> Changing the email stats to lines was rather straightforward, so I've
>> done that. I also changed it to just show stats for user/dev lists,
>> leaving out the issues/commit lists which are rather chatty but not
>> representative of email-based discussions. Changing the top repos by
>> sloc/commits will require some time, as I'll have to write some custom
>> representation for that.
>>
>> With regards,
>> Daniel.
>>
>>> On Wed, Oct 26, 2016 at 1:07 PM, Daniel Gruno  
> wrote:
 I added an initial stats page at
 https://projects.apache.org/statistics.html - assuming no one objects,
 I'll add it to the top menu of the other pages in a day or so.

 Do peruse - anything we need to add/edit?

 With regards,
 Daniel.

 On 10/26/2016 01:07 PM, Daniel Gruno wrote:
> Hi folks,
> I was wondering, since we have full access to Snoot for the ASF, why not
> take advantage of that and add a statistics page to projects.apache.org,
> showing the various live stats available (no. of commits/committers,
> largest repos by size/commits, proper language breakdown, relationship
> mapping, mail stats etc).
>
> I was inclined to JFDI, but I'd love to hear what others think about
> this. If I don't hear any loud objections, I'll add a stats page today,
> and we can see if it's of any use :)
>
> Comments? Suggestions? :)
>
> With regards,
> Daniel.
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org

 -
 To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
 For additional commands, e-mail: dev-h...@community.apache.org
>>
>> -
>> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
>> For additional commands, e-mail: dev-h...@community.apache.org
> 
> 
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org
> 


-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Re: Adding some statistics to projects.a.o?

2016-10-28 Thread Hervé BOUTEMY
IIUC, this "Largest/Busiest projects" statistics is neither per project, 
neither per committee (or PMC), but per repo

notice: 1 committee (or PMC) = n projects [1]
and 1 committee may have many repos

I'll update the title to "Largest/Busiest repos", that will be less 
misleading.


I have a snoot account, then I could have a look at the list of repos that are 
taken into account. I have a few questions:
1. can we show the list of repos from this statistics page?
2. I saw that some imports are failing, because list of repos change over 
time: how can I help fix issues?

Regards,

Hervé

[1] https://projects.apache.org/projects.html?pmc

Le mercredi 26 octobre 2016, 21:28:17 CEST Daniel Gruno a écrit :
> On 10/26/2016 09:06 PM, Mike Drob wrote:
> > A few section specific comments -
> > Largest/Busiest projects is difficult to make use of due to the huge
> > "other" section. Maybe a list makes more sense rather than a pie/circle
> > chart.Email, topics and email authors, past year -- more readable as a
> > line
> > chart and for a longer time span I think
> 
> Changing the email stats to lines was rather straightforward, so I've
> done that. I also changed it to just show stats for user/dev lists,
> leaving out the issues/commit lists which are rather chatty but not
> representative of email-based discussions. Changing the top repos by
> sloc/commits will require some time, as I'll have to write some custom
> representation for that.
> 
> With regards,
> Daniel.
> 
> > On Wed, Oct 26, 2016 at 1:07 PM, Daniel Gruno  
wrote:
> >> I added an initial stats page at
> >> https://projects.apache.org/statistics.html - assuming no one objects,
> >> I'll add it to the top menu of the other pages in a day or so.
> >> 
> >> Do peruse - anything we need to add/edit?
> >> 
> >> With regards,
> >> Daniel.
> >> 
> >> On 10/26/2016 01:07 PM, Daniel Gruno wrote:
> >>> Hi folks,
> >>> I was wondering, since we have full access to Snoot for the ASF, why not
> >>> take advantage of that and add a statistics page to projects.apache.org,
> >>> showing the various live stats available (no. of commits/committers,
> >>> largest repos by size/commits, proper language breakdown, relationship
> >>> mapping, mail stats etc).
> >>> 
> >>> I was inclined to JFDI, but I'd love to hear what others think about
> >>> this. If I don't hear any loud objections, I'll add a stats page today,
> >>> and we can see if it's of any use :)
> >>> 
> >>> Comments? Suggestions? :)
> >>> 
> >>> With regards,
> >>> Daniel.
> >>> 
> >>> -
> >>> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> >>> For additional commands, e-mail: dev-h...@community.apache.org
> >> 
> >> -
> >> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> >> For additional commands, e-mail: dev-h...@community.apache.org
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
> For additional commands, e-mail: dev-h...@community.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org