Hi,
I've been trying to match edit activity with pagecounts but I've encountered a couple of problems. The amazing pagecounts dumps ( https://dumps.wikimedia.org/other/pagecounts-raw/) use the page url to identify the individual page:
fr.b Special:Recherche/Achille_Baraguey_d%5C%27Hilliers 1 624
while the stub-meta-history uses the "raw" title:
<page> <title>Wikipedia:Community Portal</title> <ns>4</ns> <id>1270</id>
so I need an easy way to map title to url. I imagine there some rules on how this "translation" is done? My google-fu has failed to encounter them.
Also, are is timezones mentioned in the meta-history files:
<timestamp>2006-02-18T19:29:10Z</timestamp>
the same as the one used in the pagecount filenames:
pagecounts-20140725-070000.gz
Best,
B
******************************************* Bruno Miguel Tavares Gonçalves, PhD Homepage: www.bgoncalves.com Email: bgoncalves@gmail.com *******************************************