Dear All,
I have used two dumps from english Wikipedia as below, the count results
turn out like this, Would you please let me know which one is completed and
can be analyzed? and I am confused why the 2001-2009 had different number?
Thanks very much !!!!!!
select count (1), to_char(rev_timestamp,'YYYY') from enwiki.revision group
by to_char(rev_timestamp,'YYYY') order by (to_char(rev_timestamp,'YYYY'))
resource is :
http://download.wikimedia.org/enwiki/20100130/enwiki-20100130-stub-meta-his…
+----------+---------------------+
| count(1) | year(rev_timestamp) |
+----------+---------------------+
| 57559 | 2001 |
| 616878 | 2002 |
| 1598363 | 2003 |
| 6999869 | 2004 |
| 20697477 | 2005 |
| 57214741 | 2006 |
| 75235972 | 2007 |
| 74757575 | 2008 |
| 70600627 | 2009 |
| 6017974 | 2010 |
+----------+---------------------+
resource is :
http://download.wikimedia.org/enwiki/20101011/enwiki-20101011-stub-meta-his…
64305 2001
616257 2002
1596612 2003
6979494 2004
20642853 2005
57043694 2006
74936692 2007
74387391 2008
70085652 2009
53054853 2010
---------------------
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l