Re: counting downloads

2007-05-06 Thread Xavier Hanin

FYI, Vadim is already providing stats on some (non incubating) projects:
http://people.apache.org/~vgritsenko/stats/projects/index.html

I don't know if it's easy for him to add incubating projects, and how
he deals with mirrors. In a sense even if the download counter is not
accurate, I agree with eelco to say that the trend can still be
interesting.

Xavier

On 5/2/07, Marshall Schor [EMAIL PROTECTED] wrote:

We're curious to see how many downloads we're getting, perhaps sorted by
ip number or who's downloading (I realize that would need be
volunteered information).
Do other projects have a good way to track this? I know we could pull
the logs for the p.a.o webserver and grep through them looking for
things, but I'm wondering if there's something we can put on our
download page that users would click on that would count things?

-Marshall Schor

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





--
Xavier Hanin - Independent Java Consultant
Manage your dependencies with Ivy!
http://incubator.apache.org/ivy/

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-06 Thread robert burrell donkin

On 5/3/07, Daniel Kulp [EMAIL PROTECTED] wrote:


Noel,

On Thursday 03 May 2007 03:59, Noel J. Bergman wrote:
  Except you forget one detail: incubator artifacts don't go to the
  mirror system.

 And that's changing/changed.

Can I ask when?The vote that was called in mid march [1] never had
a result posted.   The discussion [2] that started that vote also
didn't seem to reach any type of concensus.


we've now had a clear call from infra

(incubator works within broader apache policy)

i've done this kind of infrastructure work before so i'll pull
together a plan in JIRA and run it past people on this list and infra


I guess my major concern right now is the CXF/Wicket releases that have
gone out in the lasts couple days.   Did we do the right thing just
putting them in /www/people.apache.org/dist/incubator or should they
have gone to the mirrors?   There isn't
a /www/www.apache.org/dist/incubator directory. Anyway, I'd just
like to make sure we didn't do something wrong due to a policy change I
missed.   (I could easily have missed it, email overload and all)


don't worry - IIRC the policy hasn't changed yet

IMHO this is something that needs to be fixed by the IPMC. it's going
to take a little while to pull everything together. we have some
scripts in jakarta that generate the download pages from meta-data and
that's the direction i think we need to move in. (they also have some
cool RSS/news stuff for releases is cool)

i'd rather have CXF and wicket in the old places and have the time to
do this properly than delay the releases or rush the move

- robert

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-06 Thread robert burrell donkin

On 5/3/07, Danny Angus [EMAIL PROTECTED] wrote:

On 5/2/07, Garrett Rooney [EMAIL PROTECTED] wrote:

  Do other projects have a good way to track this? I know we could pull
  the logs for the p.a.o webserver and grep through them looking for
  things, but I'm wondering if there's something we can put on our
  download page that users would click on that would count things?

James uses google analytics. We currently count the hits to the
download cgi page, which is probably over generous, but you can put
google javascript on the links to the files themselves, we just
haven't done it.


i'm going to need to create download pages for the incubator. this
isn't one of my personal itches and i would need to research the
analytics-foo. but if someone wants to create a JIRA with the required
javascripts, i'll take a look at including it.

any volunteers?

- robert

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-06 Thread Daniel Kulp

Thanks for the clarification Robert.   I'm glad we didn't mess something 
up.  :-)

I'll definitely keep an eye on the various threads for information about 
the changes.   It all sounds very good to me.  :-)

Dan


On Sunday 06 May 2007 06:55, robert burrell donkin wrote:
 On 5/3/07, Daniel Kulp [EMAIL PROTECTED] wrote:
  Noel,
 
  On Thursday 03 May 2007 03:59, Noel J. Bergman wrote:
Except you forget one detail: incubator artifacts don't go to
the mirror system.
  
   And that's changing/changed.
 
  Can I ask when?The vote that was called in mid march [1] never
  had a result posted.   The discussion [2] that started that vote
  also didn't seem to reach any type of concensus.

 we've now had a clear call from infra

 (incubator works within broader apache policy)

 i've done this kind of infrastructure work before so i'll pull
 together a plan in JIRA and run it past people on this list and infra

  I guess my major concern right now is the CXF/Wicket releases that
  have gone out in the lasts couple days.   Did we do the right thing
  just putting them in /www/people.apache.org/dist/incubator or should
  they have gone to the mirrors?   There isn't
  a /www/www.apache.org/dist/incubator directory. Anyway, I'd just
  like to make sure we didn't do something wrong due to a policy
  change I missed.   (I could easily have missed it, email overload
  and all)

 don't worry - IIRC the policy hasn't changed yet

 IMHO this is something that needs to be fixed by the IPMC. it's going
 to take a little while to pull everything together. we have some
 scripts in jakarta that generate the download pages from meta-data and
 that's the direction i think we need to move in. (they also have some
 cool RSS/news stuff for releases is cool)

 i'd rather have CXF and wicket in the old places and have the time to
 do this properly than delay the releases or rush the move

 - robert

 -
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]

-- 
J. Daniel Kulp
Principal Engineer
IONA
P: 781-902-8727C: 508-380-7194
[EMAIL PROTECTED]
http://www.dankulp.com/blog

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-05 Thread Marshall Schor

Noel J. Bergman wrote:

Garrett Rooney wrote:

  

The mirror system makes this an essentially unsolvable problem.



And in addition to all of your other valid points, there is the problem of
having a a farm of caching proxies on the ISP side of the net, leading to
the seemingly odd case that the more popular a download, the less likely you
are to get a good count, because the more likely it is to be served from the
caching proxies.
--- Noel


  


One confusion I have: If we count clicks on the download link, it 
seems that even if that link led to a mirror page, it would count 
pretty accurately (except of course if a person clicked to download, and 
then didn't bother going through with it).  I guess it would also miss 
the case where people somehow found their way to a download mirror 
*without* going through the project download page.  Did I miss 
something, or would this give a reasonable estimate of downloads?


-Marshall

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-05 Thread Garrett Rooney

On 5/5/07, Marshall Schor [EMAIL PROTECTED] wrote:


One confusion I have: If we count clicks on the download link, it
seems that even if that link led to a mirror page, it would count
pretty accurately (except of course if a person clicked to download, and
then didn't bother going through with it).  I guess it would also miss
the case where people somehow found their way to a download mirror
*without* going through the project download page.  Did I miss
something, or would this give a reasonable estimate of downloads?


Unless they don't download it via your link, or they download more
than once (getting copies on multiple machines?), or any number of
other things that can throw your numbers off.  It's a losing battle
for statistics that IMO aren't very useful anyway.  All download
counts are good for is ego stroking, there are better ways to spend
time and energy.

-garrett

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-05 Thread Eelco Hillenius

Unless they don't download it via your link, or they download more
than once (getting copies on multiple machines?), or any number of
other things that can throw your numbers off.  It's a losing battle
for statistics that IMO aren't very useful anyway.  All download
counts are good for is ego stroking, there are better ways to spend
time and energy.


I agree with most of the comments here. But one thing download
statistic imho *are* useful for is measuring progress. Even if the
absolute numbers aren't useful, and trends are not reliable, it
definitively gives an idea of whether a project is gaining attention
or not.

Another thing is that even if statistics aren't reliable, people still
often want to have them. Matt Raible's talk on ApacheCon listed job
counts per framework on Dice for instance. Completely useless, but
people seem to love to hear it, even at Apache. Or there is my
publisher asking me for download statistics of the project I'm
involved in and using Amazon's ranking of a competing book to
'determine' the market potential for the topic.

Imho, it's enough to say that we/ Apache doesn't want to spend energy
on something that is technically hard or impossible and that in the
opinion of the people who have to support it serves little value.

Eelco

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: counting downloads

2007-05-03 Thread Noel J. Bergman
 Except you forget one detail: incubator artifacts don't go to the mirror 
 system.

And that's changing/changed.

--- Noel


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-03 Thread Daniel Kulp

Noel,

On Thursday 03 May 2007 03:59, Noel J. Bergman wrote:
  Except you forget one detail: incubator artifacts don't go to the
  mirror system.

 And that's changing/changed.

Can I ask when?The vote that was called in mid march [1] never had 
a result posted.   The discussion [2] that started that vote also 
didn't seem to reach any type of concensus.

I guess my major concern right now is the CXF/Wicket releases that have 
gone out in the lasts couple days.   Did we do the right thing just 
putting them in /www/people.apache.org/dist/incubator or should they 
have gone to the mirrors?   There isn't 
a /www/www.apache.org/dist/incubator directory. Anyway, I'd just 
like to make sure we didn't do something wrong due to a policy change I 
missed.   (I could easily have missed it, email overload and all)


[1]http://mail-archives.apache.org/mod_mbox/incubator-general/200703.mbox/31cc37360703151823u2099483dn95aeaed8c6752245%40mail.gmail.com
[2]http://mail-archives.apache.org/mod_mbox/incubator-general/200703.mbox/31cc37360703140920h6a601b57gbb86ff0cd5e8049b%40mail.gmail.com

-- 
J. Daniel Kulp
Principal Engineer
IONA
P: 781-902-8727C: 508-380-7194
[EMAIL PROTECTED]
http://www.dankulp.com/blog

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-03 Thread Danny Angus

On 5/2/07, Garrett Rooney [EMAIL PROTECTED] wrote:


 Do other projects have a good way to track this? I know we could pull
 the logs for the p.a.o webserver and grep through them looking for
 things, but I'm wondering if there's something we can put on our
 download page that users would click on that would count things?


James uses google analytics. We currently count the hits to the
download cgi page, which is probably over generous, but you can put
google javascript on the links to the files themselves, we just
haven't done it.

d.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-02 Thread Luciano Resende

Yes, I'd be interested in this information for the Apache Tuscany downloads
as well...

On 5/2/07, Marshall Schor [EMAIL PROTECTED] wrote:


We're curious to see how many downloads we're getting, perhaps sorted by
ip number or who's downloading (I realize that would need be
volunteered information).
Do other projects have a good way to track this? I know we could pull
the logs for the p.a.o webserver and grep through them looking for
things, but I'm wondering if there's something we can put on our
download page that users would click on that would count things?

-Marshall Schor

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





--
Luciano Resende
http://people.apache.org/~lresende


Re: counting downloads

2007-05-02 Thread Garrett Rooney

On 5/2/07, Marshall Schor [EMAIL PROTECTED] wrote:

We're curious to see how many downloads we're getting, perhaps sorted by
ip number or who's downloading (I realize that would need be
volunteered information).
Do other projects have a good way to track this? I know we could pull
the logs for the p.a.o webserver and grep through them looking for
things, but I'm wondering if there's something we can put on our
download page that users would click on that would count things?


The mirror system makes this an essentially unsolvable problem.  Not
to mention the fact that you have no clue if people are actually
getting the code from one of our mirrors at all, they could get it
from a linux distribution, or any number of other repackagers.  On top
of that, you can't correlate back to an individual person, so you
don't know if you're getting one guy downloading it more than once
because he's accidentally deleted it...

Get used to the idea that counting downloads is an inherently poor way
to judge if your software is being used.  More interesting metrics
might be numbers of posts on mailing lists, bugs filed, etc.

-garrett

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: counting downloads

2007-05-02 Thread Noel J. Bergman
Garrett Rooney wrote:

 The mirror system makes this an essentially unsolvable problem.

And in addition to all of your other valid points, there is the problem of
having a a farm of caching proxies on the ISP side of the net, leading to
the seemingly odd case that the more popular a download, the less likely you
are to get a good count, because the more likely it is to be served from the
caching proxies.

--- Noel



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-02 Thread Daniel Kulp
On Wednesday 02 May 2007 18:43, Garrett Rooney wrote:
 The mirror system makes this an essentially unsolvable problem.  Not
 to mention the fact that you have no clue if people are actually
 getting the code from one of our mirrors at all, they could get it
 from a linux distribution, or any number of other repackagers.  On top
 of that, you can't correlate back to an individual person, so you
 don't know if you're getting one guy downloading it more than once
 because he's accidentally deleted it...

Except you forget one detail: incubator artifacts don't go to the mirror 
system.   (Which, again, I think is silly.  Part of being in the 
incubator is to learn about apache processes.   One of those should be 
the mirror system.)

To answer the question: all the logs are available on 
people.apache.org:/x1/logarchive/people
You can parse them an pull out any information you want.   Just keep in 
mind that after you graduate, all warnings about the mirrors do apply, 
so don't get used to having the download information.


-- 
J. Daniel Kulp
Principal Engineer
IONA
P: 781-902-8727C: 508-380-7194
[EMAIL PROTECTED]
http://www.dankulp.com/blog

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: counting downloads

2007-05-02 Thread Luciano Resende

Thanks Dan, indeed, as Tuscany is still under incubation, mirrors and other
things won't apply.
Did anybody ever created a script to parse the logs and provide totals ?

On 5/2/07, Daniel Kulp [EMAIL PROTECTED] wrote:


On Wednesday 02 May 2007 18:43, Garrett Rooney wrote:
 The mirror system makes this an essentially unsolvable problem.  Not
 to mention the fact that you have no clue if people are actually
 getting the code from one of our mirrors at all, they could get it
 from a linux distribution, or any number of other repackagers.  On top
 of that, you can't correlate back to an individual person, so you
 don't know if you're getting one guy downloading it more than once
 because he's accidentally deleted it...

Except you forget one detail: incubator artifacts don't go to the mirror
system.   (Which, again, I think is silly.  Part of being in the
incubator is to learn about apache processes.   One of those should be
the mirror system.)

To answer the question: all the logs are available on
people.apache.org:/x1/logarchive/people
You can parse them an pull out any information you want.   Just keep in
mind that after you graduate, all warnings about the mirrors do apply,
so don't get used to having the download information.


--
J. Daniel Kulp
Principal Engineer
IONA
P: 781-902-8727C: 508-380-7194
[EMAIL PROTECTED]
http://www.dankulp.com/blog

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





--
Luciano Resende
http://people.apache.org/~lresende