Hi,

I've been trying to match edit activity with pagecounts but I've encountered a couple of problems. The amazing pagecounts dumps (https://dumps.wikimedia.org/other/pagecounts-raw/) use the page url to identify the individual page:

      fr.b Special:Recherche/Achille_Baraguey_d%5C%27Hilliers 1 624
while the stub-meta-history uses the "raw" title:

  <page>
    <title>Wikipedia:Community Portal</title>
    <ns>4</ns>
    <id>1270</id>

so I need an easy way to map title to url. I imagine there some rules on how this "translation" is done? My google-fu has failed to encounter them.

Also, are is timezones mentioned in the meta-history files:

<timestamp>2006-02-18T19:29:10Z</timestamp>

the same as the one used in the pagecount filenames:

pagecounts-20140725-070000.gz

Best,

B

*******************************************
Bruno Miguel Tavares Gonçalves, PhD
Homepage: www.bgoncalves.com
Email: bgoncalves@gmail.com
*******************************************