[ 
https://issues.apache.org/jira/browse/COMDEV-295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17688183#comment-17688183
 ] 

Daniel Gruno commented on COMDEV-295:
-------------------------------------

The mail stats will, as alluded to on the dev list, need a complete rework.
I don't have cycles for it right now, but the basic premise would be:

for each project:
  grab last six months of stats from lists.apache.org for each of their lists


> Consider whether to extract mod_mbox statistics locally
> -------------------------------------------------------
>
>                 Key: COMDEV-295
>                 URL: https://issues.apache.org/jira/browse/COMDEV-295
>             Project: Community Development
>          Issue Type: Improvement
>          Components: Reporter Tool
>            Reporter: Sebb
>            Priority: Major
>
> At present the reporter tool uses the mailglomper script to extract 
> information about the number and timing of mails sent to each list.
> To do this, it has to download the entire mailbox, even though it only needs 
> the ^From_ lines.
> Whilst the code attempts to avoid unnecessary downloads (by checking dates 
> and sizes), for busy mbox files there can be a lot of network traffic.
> It might make sense for some initial data extraction to be done locally and 
> cached in a file.
> This could be done as part of updating the mod_mbox indexes.
> For example, the script could extract just the ^From_ lines into a separate 
> file and store that alongside the .mbox file. Mailglomper would read the much 
> abbreviated file instead.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org

Reply via email to