I think. Well, I hope.
The whitelist at http://dumps.wikimedia.org/other/pagecounts-all-sites/README.txt claims that meta.mediawiki.org is whitelisted. As is usability.mediawiki.org. As is...you get the picture ;)
Unless I've had a stroke and am hallucinating the *.mediawiki.org, we mean wikimedia. At least it validates the people who grumble the two names are too similar? ;)
Have fun,
Hi,
On Tue, Mar 10, 2015 at 03:45:53AM -0400, Oliver Keyes wrote:
[ Typo in secondary documentation of pagecounts-all-sites ]
Thanks, fixed in HDFS.
After the next rsync (in ~1 hour) the new README should be live.
Have fun, Christian
Yay; thank you! :)
On 10 March 2015 at 12:16, Christian Aistleitner christian@quelltextlich.at wrote:
Hi,
On Tue, Mar 10, 2015 at 03:45:53AM -0400, Oliver Keyes wrote:
[ Typo in secondary documentation of pagecounts-all-sites ]
Thanks, fixed in HDFS.
After the next rsync (in ~1 hour) the new README should be live.
Have fun, Christian
-- ---- quelltextlich e.U. ---- \ ---- Christian Aistleitner ---- Companies' registry: 360296y in Linz Christian Aistleitner Kefermarkterstrasze 6a/3 Email: christian@quelltextlich.at 4293 Gutau, Austria Phone: +43 7946 / 20 5 81 Fax: +43 7946 / 20 5 81 Homepage: http://quelltextlich.at/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi,
[ just to keep archives happy ]
On Tue, Mar 10, 2015 at 05:16:30PM +0100, Christian Aistleitner wrote:
After the next rsync (in ~1 hour) the new README should be live.
The new README is live now at: http://dumps.wikimedia.org/other/pagecounts-all-sites/README.txt
Have fun, Christian
Is that in public version control somewhere?
Assuming not, is there a path towards that?
While I don't mind so much the README, I'm more concerned about the landing page at http://dumps.wikimedia.org/ which is quite dated and would benefit from being in public version control so that maintainers can keep it in sync with other simple html pages we serve.
https://phabricator.wikimedia.org/T87404 https://phabricator.wikimedia.org/T54055 https://phabricator.wikimedia.org/T73152
— Timo
On 10 Mar 2015, at 17:16, Christian Aistleitner christian@quelltextlich.at wrote:
Hi,
On Tue, Mar 10, 2015 at 03:45:53AM -0400, Oliver Keyes wrote:
[ Typo in secondary documentation of pagecounts-all-sites ]
Thanks, fixed in HDFS.
After the next rsync (in ~1 hour) the new README should be live.
Have fun, Christian
-- ---- quelltextlich e.U. ---- \ ---- Christian Aistleitner ---- Companies' registry: 360296y in Linz Christian Aistleitner Kefermarkterstrasze 6a/3 Email: christian@quelltextlich.at 4293 Gutau, Austria Phone: +43 7946 / 20 5 81 Fax: +43 7946 / 20 5 81 Homepage: http://quelltextlich.at/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Timo,
On Tue, Mar 10, 2015 at 09:46:53PM +0100, Timo Tijhof wrote:
Is that in public version control somewhere?
The real documentation is under revision control (through wikitech).
As explained in the first section of the README, that README is just a pointer to the authorative Documentation on wiki:
https://wikitech.wikimedia.org/wiki/Analytics/Data/Pagecounts-all-sites
That's in the usual spot for datasets, and since it's a wikipage, everyone can be bold there :-), it's watchable, and also under revision control.
Assuming not, is there a path towards that?
For example:
hdfs://analytics-hadoop/wmf/data/archive/pagecounts-all-sites/README.txt
But the above Wikipage has details on availability of that dataset (and hence the README.txt).
While I don't mind so much the README, I'm more concerned about the landing page at http://dumps.wikimedia.org/ which is quite dated and would benefit from being in public version control so that [...]
Apergos puppetized this not too long ago.
You're probably looking for
https://git.wikimedia.org/blob/operations%2Fpuppet.git/4d8af109c86228c7ac2b5... https://git.wikimedia.org/blob/operations%2Fpuppet.git/4d8af109c86228c7ac2b5...
Have fun, Christian