Dear All, 

I have used two dumps from english Wikipedia as below, the count results 
turn out like this, Would you please let me know which one is completed and 
can be analyzed? and I am confused why the 2001-2009 had different number? 
Thanks very much !!!!!!

select count (1), to_char(rev_timestamp,'YYYY') from enwiki.revision group 
by to_char(rev_timestamp,'YYYY') order by (to_char(rev_timestamp,'YYYY'))


resource is : 
http://download.wikimedia.org/enwiki/20100130/enwiki-20100130-stub-meta-history.xml.gz

+----------+---------------------+
| count(1) | year(rev_timestamp) |
+----------+---------------------+
|    57559 |                2001 |
|   616878 |                2002 |
|  1598363 |                2003 |
|  6999869 |                2004 |
| 20697477 |                2005 |
| 57214741 |                2006 |
| 75235972 |                2007 |
| 74757575 |                2008 |
| 70600627 |                2009 |
|  6017974 |                2010 |
+----------+---------------------+


 
resource is : 
http://download.wikimedia.org/enwiki/20101011/enwiki-20101011-stub-meta-history.xml.gz

> 64305  2001
> 616257 2002
> 1596612        2003
> 6979494        2004
> 20642853       2005
> 57043694       2006
> 74936692       2007
> 74387391       2008
> 70085652       2009
> 53054853       2010

---------------------
>Wikitech-l mailing list
>Wikitech-l@lists.wikimedia.org
>https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>





_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to