The "latest" files are just links to files in a dump directory; this includes the md5 file. That's why you get a mismatch. I think there is a report about this in bugzilla (if not please add one). This is relatively low priority on the list of things to fix, since there is a workaround: wait a few days for the run to complete.
Ariel
Στις 09-10-2012, ημέρα Τρι, και ώρα 00:01 -0400, ο/η Zachary Harris έγραψε:
On 10/08/2012 11:36 PM, Hydriz Wikipedia wrote:
Shouldn't the new files be copied over to the latest only after the whole dump is completed?
I was actually hoping to go the other direction---keeping the September and October dumps (to use the present example) mixed together in "latest", but having the latest md5sums file track with the mixture. The above suggestion would almost certainly be an easier way to maintain consistency between the dumps and the md5sums file. My request would maintain a "hot" up-to-date "latest" directory with consistent md5sums, although admittedly it may not be worth the trouble if doing so is not straightforward.
-Zach
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l