Pada tanggal 31 Mei 2018 7.00 PM, < xmldatadumps-l-request@lists.wikimedia.org> menulis:
Send Xmldatadumps-l mailing list submissions to xmldatadumps-l@lists.wikimedia.org
To subscribe or unsubscribe via the World Wide Web, visit https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l or, via email, send a message with subject or body 'help' to xmldatadumps-l-request@lists.wikimedia.org
You can reach the person managing the list at xmldatadumps-l-owner@lists.wikimedia.org
When replying, please edit your Subject line so it is more specific than "Re: Contents of Xmldatadumps-l digest..."
Today's Topics:
- change to output file numbering of big wikis (Ariel Glenn WMF)
Message: 1 Date: Thu, 31 May 2018 14:36:51 +0300 From: Ariel Glenn WMF ariel@wikimedia.org To: Wikipedia Xmldatadumps-l Xmldatadumps-l@lists.wikimedia.org, Wikimedia developers wikitech-l@lists.wikimedia.org Subject: [Xmldatadumps-l] change to output file numbering of big wikis Message-ID: <
CALCvg_4Qmi2WsfiTFMqzHQBuX0fF0WZHhkz-YSWS7W0_zuBvZg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
TL;DR: Scripts that reply on xml files numbered 1 through 4 should be updated to check for 1 through 6.
Explanation:
A number of wikis have stubs and page content files generated 4 parts at a time, with the appropriate number added to the filename. I'm going to be increasing that thi month to 6.
The reason for the increase is that near the end of the run there are usually just a few big wikis taking their time at completing. If they run with 6 processes at once, they'll finish up a bit sooner.
If you have scripts that rely on the number 4, just increase it to 6 and you're done.
This will go into effect for the June 1 run and all runs afterwards.
Thanks!
xmldatadumps-l@lists.wikimedia.org