On 14 December 2010 01:57, Monica shu
<monicashu452(a)gmail.com> wrote:
Thanks Diederik and Waksman,
It seems that I need to do parse the dump for article data to get this piece
of information...
Yes, this will be the last choice, but I think there maybe some easier
way...
I just got home and checked the dump I've downloaded.
It's downloaded on June, 10, 2010, the size is 6117881141 in bz2.
I remember when I download, it's the latest version at that moment.
As the dumps are generated every N months, and the one I have is bigger that
the version 2010-01-30 as Waksman said, my version should be between Feb to
June.
A Google search hints that enwiki-20100312-pages-articles.xml.bz2
might be the one with size 6117881141.
Andrew Dunbar (hippietrail)
Does anybody remember the version between this
period, or happened to
download the same version with me?
Thanks very much to tell me any related information again!
Best regards!
Monica
On Mon, Dec 13, 2010 at 3:24 PM, Shaun Waksman <shaunwaksman(a)gmail.com>wrote;wrote:
Hi Monica,
The file sizes of the EN pages dumps that are available today are:
5204823166 enwiki-20100312-pages-articles.xml.7z
5983814213 enwiki-20100130-pages-articles.xml.bz2
Note that the former is in 7z and the later is in bz2
Does this help?
Shaun
On Mon, Dec 13, 2010 at 8:45 AM, Monica shu <monicashu452(a)gmail.com>
wrote:
Hi all,
I have downloaded a dump several month ago.
By accidentally, I lost the version info of this dump, so I don't know
when
this dump was generated.
Is there any place that list out info about the past dumps(such as
size...)?
Thanks!
Monica
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
It should be trivial to add the dump data to the header each dump
file. Since in the files themselves the date field of the filename is
often replaced by "latest" this could be very useful. It could also be
useful to include the revision ID and timestamp of the latest revision
but I assume this would be a little more difficult. Should I file a
feature request?
Andrew Dunbar (hippietrail)