Re: Standards for mail archive statistics gathering?

2015-05-07 Thread Steve Blackmon
I'd like to submit to this group that no batch job is necessary to compute many useful statistics - rather with a suitable representation an indexed event stream of mailing list messages, commits, releases, etc... can be searched aggregated and visualized in real-time. Hacked a bit this afternoon

Re: Standards for mail archive statistics gathering?

2015-05-07 Thread Andrea Pescetti
Betty James wrote: How do I get off this thread. don't know how I got on Just follow the instructions others have already provided. I'm putting you in CC in case you have already unsubscribed from this discussion list. but I am just a totally ignorant individual using Open Office and tryi

Re: Standards for mail archive statistics gathering?

2015-05-06 Thread Hervé BOUTEMY
Le mercredi 6 mai 2015 12:48:34 Steve Blackmon a écrit : > > For visualization, for sure, json is the current natural format when data > > is consumed from the browser. > > I don't have great experience on this, and what I'm missing with json > > currently is a common practice on documenting a stru

Re: Standards for mail archive statistics gathering?

2015-05-06 Thread Niclas Hedhman
If you want to unsubscribe, please find instructions at http://apache.org/foundation/mailinglists.html And the name of this list is dev@community.apache.org Cheers Niclas On Thu, May 7, 2015 at 7:48 AM, Betty James wrote: > Oh my gosh. How do I get off this thread. don't know how I got on, b

Re: Standards for mail archive statistics gathering?

2015-05-06 Thread Betty James
Oh my gosh. How do I get off this thread. don't know how I got on, but I am just a totally ignorant individual using Open Office and trying to donate (which doesn't sound necessary anymore)so unless you are in good shape and in your 70's try to figure out how I can get off the list! Betty B.

Re: Standards for mail archive statistics gathering?

2015-05-06 Thread Steve Blackmon
> For visualization, for sure, json is the current natural format when data is > consumed from the browser. > I don't have great experience on this, and what I'm missing with json > currently is a common practice on documenting a structure: are there common > practices? In podling streams [0], we

Re: Standards for mail archive statistics gathering?

2015-05-06 Thread Boris Baldassari
Hi all, Le 06/05/2015 03:26, Shane Curcuru a écrit : Drat. Metrics Grimoire looks pretty nifty - essentially a set of frameworks for extracting metadata from a bunch of sources - but it's GPL, so personally I have no interest in working on it. If someone else uses it to generate datasets that

Re: Standards for mail archive statistics gathering?

2015-05-05 Thread Hervé BOUTEMY
Le mardi 5 mai 2015 21:26:36 Shane Curcuru a écrit : > On 5/5/15 7:33 AM, Boris Baldassari wrote: > > Hi Folks, > > > > Sorry for the late answer on this thread. Don't know what has been done > > since then, but I've some experience to share on this, so here are my 2c.. > > No, more input is alwa

Re: Standards for mail archive statistics gathering?

2015-05-05 Thread Shane Curcuru
On 5/5/15 7:33 AM, Boris Baldassari wrote: > Hi Folks, > > Sorry for the late answer on this thread. Don't know what has been done > since then, but I've some experience to share on this, so here are my 2c.. No, more input is always appreciated! Hervé is doing some centralization of the projects

Re: Standards for mail archive statistics gathering?

2015-05-05 Thread Louis Suárez-Potts
> On 05 May 2015, at 07:33, Boris Baldassari > wrote: > > Hi Folks, > > Sorry for the late answer on this thread. Don't know what has been done since > then, but I've some experience to share on this, so here are my 2c.. > > * Parsing dates and time zones: > If you are to use Perl, the Date:

Re: Standards for mail archive statistics gathering?

2015-05-05 Thread Boris Baldassari
Hi Folks, Sorry for the late answer on this thread. Don't know what has been done since then, but I've some experience to share on this, so here are my 2c.. * Parsing dates and time zones: If you are to use Perl, the Date::Parse module handles dates and time zones pretty well. As for Python I

Re: Standards for mail archive statistics gathering?

2015-04-28 Thread Rich Bowen
On 04/27/2015 09:36 AM, Shane Curcuru wrote: I'm interested in working on some visualizations of mailing list activity over time, in particular some simple analyses, like thread length/participants and the like. Given that the raw data can all be precomputed from mbox archives, is there any se

Re: Standards for mail archive statistics gathering?

2015-04-28 Thread Shane Curcuru
On 4/27/15 2:29 PM, Rob Weir wrote: > On Mon, Apr 27, 2015 at 9:36 AM, Shane Curcuru wrote: ... > If you do Python, you might take a look at > https://svn.apache.org/repos/asf/openoffice/devtools/list-stats/ for a > simple program that could be adapted easily enough. It uses the > Python mailbox

Re: Standards for mail archive statistics gathering?

2015-04-27 Thread Rob Weir
On Mon, Apr 27, 2015 at 9:36 AM, Shane Curcuru wrote: > I'm interested in working on some visualizations of mailing list > activity over time, in particular some simple analyses, like thread > length/participants and the like. Given that the raw data can all be > precomputed from mbox archives, i

Standards for mail archive statistics gathering?

2015-04-27 Thread Shane Curcuru
I'm interested in working on some visualizations of mailing list activity over time, in particular some simple analyses, like thread length/participants and the like. Given that the raw data can all be precomputed from mbox archives, is there any semi-standard way to distill and save metadata abou