Hi,
I've been using the very helpful pagecount dumps described at:
https://dumps.wikimedia.org/other/pagecounts-ez/
And it describes:
Line format:
wiki code (subproject.project) article title monthly total (with interpolation when data is missing) hourly counts
In the wiki code field, the subproject is the language code (fr, el, ja, etc) or meta, commons etc.
The project is one of b (wikibooks), k (wiktionary), n (wikinews), o (wikivoyage), q (wikiquote), s (wikisource), v (wikiversity), z (wikipedia).
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Thanks, Michael
Hi Michael!
Yes, the ".m" code can stand for either being a *.mediawiki.org project or for being a mobile wiki (you can separate both cases). See the docs here: https://wikitech.wikimedia.org/wiki/Analytics/Archive/Data/Pagecounts-all-si...
I created a task to add some more documentation to the page you linked: https://phabricator.wikimedia.org/T180452
Thanks a lot!
On Tue, Nov 14, 2017 at 3:43 AM, Michael Baldwin mjbaldwinjr@gmail.com wrote:
Hi,
I've been using the very helpful pagecount dumps described at:
https://dumps.wikimedia.org/other/pagecounts-ez/
And it describes:
Line format: wiki code (subproject.project) article title monthly total (with interpolation when data is missing) hourly counts In the wiki code field, the subproject is the language code (fr, el,
ja, etc) or meta, commons etc.
The project is one of b (wikibooks), k (wiktionary), n (wikinews), o
(wikivoyage), q (wikiquote), s (wikisource), v (wikiversity), z (wikipedia).
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Thanks, Michael
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Sorry for this code scheme being not so intuitive, which two meanings for 'm' depending on where it appears.
The coding system was extended several times, and Christian Aistleitner and I prioritized downward compatibility over intuitiveness, reluctantly.
Erik
From: Analytics [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Marcel Ruiz Forns Sent: Tuesday, November 14, 2017 13:07 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. analytics@lists.wikimedia.org Subject: Re: [Analytics] Undocumented project code in pagecounts-ez
Hi Michael!
Yes, the ".m" code can stand for either being a *.mediawiki.org http://mediawiki.org project or for being a mobile wiki (you can separate both cases).
See the docs here: https://wikitech.wikimedia.org/wiki/Analytics/Archive/Data/Pagecounts-all-si...
I created a task to add some more documentation to the page you linked:
https://phabricator.wikimedia.org/T180452
Thanks a lot!
On Tue, Nov 14, 2017 at 3:43 AM, Michael Baldwin <mjbaldwinjr@gmail.com mailto:mjbaldwinjr@gmail.com > wrote:
Hi,
I've been using the very helpful pagecount dumps described at:
https://dumps.wikimedia.org/other/pagecounts-ez/
And it describes:
Line format:
wiki code (subproject.project)
article title
monthly total (with interpolation when data is missing)
hourly counts
In the wiki code field, the subproject is the language code (fr, el, ja, etc) or meta, commons etc.
The project is one of b (wikibooks), k (wiktionary), n (wikinews), o (wikivoyage), q (wikiquote), s (wikisource), v (wikiversity), z (wikipedia).
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Thanks,
Michael
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org mailto:Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Michael Baldwin, 14/11/2017 04:43:
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Historically we collect most docs here: https://en.wikipedia.org/wiki/User:Killiondude/stats https://archive.org/details/wikipedia_visitor_stats&tab=about
Federico
Thanks, Federico.
In the docs you referenced, I can't find any reference to "en.m" that contrasts with the "en.z". This page https://archive.org/details/wikipedia_visitor_stats&tab=about describes codes, but they're different from the ones in pagecounts-ez https://dumps.wikimedia.org/other/pagecounts-ez/.
I've noticed the "en.m" lines only started appearing in Dec 2015.
I'm just trying to understand, if I want the most accurate pagecounts over time, should I be including the "en.m" lines on top of "en.z", or are they something different?
On Tue, Nov 14, 2017 at 7:24 AM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Michael Baldwin, 14/11/2017 04:43:
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Historically we collect most docs here: https://en.wikipedia.org/wiki/User:Killiondude/stats https://archive.org/details/wikipedia_visitor_stats&tab=about
Federico
Maybe this doc will help?
https://wikitech.wikimedia.org/wiki/Analytics/Archive/Data/Pagecounts-all-si...
On Tue, Nov 14, 2017 at 1:29 PM, Michael Baldwin mjbaldwinjr@gmail.com wrote:
Thanks, Federico.
In the docs you referenced, I can't find any reference to "en.m" that contrasts with the "en.z". This page https://archive.org/details/wikipedia_visitor_stats&tab=about describes codes, but they're different from the ones in pagecounts-ez https://dumps.wikimedia.org/other/pagecounts-ez/.
I've noticed the "en.m" lines only started appearing in Dec 2015.
I'm just trying to understand, if I want the most accurate pagecounts over time, should I be including the "en.m" lines on top of "en.z", or are they something different?
On Tue, Nov 14, 2017 at 7:24 AM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Michael Baldwin, 14/11/2017 04:43:
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Historically we collect most docs here: https://en.wikipedia.org/wiki/User:Killiondude/stats https://archive.org/details/wikipedia_visitor_stats&tab=about
Federico
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi, I needed to read those codes for all projects last year, I tried to share my learnings, maybe it will help :
https://meta.wikimedia.org/wiki/Learning_patterns/Tips_for_reading_project_c...
Feel free to correct my bad English :)
2017-11-14 22:29 GMT+01:00 Michael Baldwin mjbaldwinjr@gmail.com:
Thanks, Federico.
In the docs you referenced, I can't find any reference to "en.m" that contrasts with the "en.z". This page https://archive.org/details/wikipedia_visitor_stats&tab=about describes codes, but they're different from the ones in pagecounts-ez https://dumps.wikimedia.org/other/pagecounts-ez/.
I've noticed the "en.m" lines only started appearing in Dec 2015.
I'm just trying to understand, if I want the most accurate pagecounts over time, should I be including the "en.m" lines on top of "en.z", or are they something different?
On Tue, Nov 14, 2017 at 7:24 AM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Michael Baldwin, 14/11/2017 04:43:
However, I've been coming across a large number of wiki codes "en.m". The "m" code is undocumented. It appears to be the mobile version of Wikipedia, but can anyone confirm that? Should the page be updated with this information?
Historically we collect most docs here: https://en.wikipedia.org/wiki/User:Killiondude/stats https://archive.org/details/wikipedia_visitor_stats&tab=about
Federico
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics