Hi Timo,
On Tue, Mar 10, 2015 at 09:46:53PM +0100, Timo Tijhof wrote:
Is that in public version control somewhere?
The real documentation is under revision control (through wikitech).
As explained in the first section of the README, that README is just a pointer to the authorative Documentation on wiki:
https://wikitech.wikimedia.org/wiki/Analytics/Data/Pagecounts-all-sites
That's in the usual spot for datasets, and since it's a wikipage, everyone can be bold there :-), it's watchable, and also under revision control.
Assuming not, is there a path towards that?
For example:
hdfs://analytics-hadoop/wmf/data/archive/pagecounts-all-sites/README.txt
But the above Wikipage has details on availability of that dataset (and hence the README.txt).
While I don't mind so much the README, I'm more concerned about the landing page at http://dumps.wikimedia.org/ which is quite dated and would benefit from being in public version control so that [...]
Apergos puppetized this not too long ago.
You're probably looking for
https://git.wikimedia.org/blob/operations%2Fpuppet.git/4d8af109c86228c7ac2b5... https://git.wikimedia.org/blob/operations%2Fpuppet.git/4d8af109c86228c7ac2b5...
Have fun, Christian