@Dušan Kreheľ: I think there's a misunderstanding. I read your re-written article. In it, you say that the current format is:
domain_code page_title count_views total_response_size
For an example, you give this:
sk Kreheľ 2 0
But, actually, that format is deprecated and the new format is pageviews complete, which looks like this:
sk.wikipedia Kreheľ null desktop 13 B2D2G2J2O2T1V1X1
The B2D2G2J2O2T1V1X1 is exactly the kind of encoding you're talking about, and no 0-values are present.
You made the point that we are missing a yearly rollup in this new format. This would be quite a large file, but if there's a good use case for such a dump, a request in phabricator is a good way to proceed.
On Sat, Oct 1, 2022 at 9:58 AM Dušan Kreheľ dusankrehel@gmail.com wrote:
The big update of the article is done. Please, You look.
Gergő Tisza: The current fresh hour format can remain. Later it can be converted to another format. And thus be more suitable for others.
2022-09-18 22:35 GMT+02:00, Dušan Kreheľ dusankrehel@gmail.com:
I have updated the document. I added the export of human pageviews for year 2021. The statistics are in the article. A download link has been added.
Dan Andreescu: None problem was to understand You.
2022-09-05 21:48 GMT+02:00, Dan Andreescu dandreescu@wikimedia.org:
Hi Dušan,
I added the details on pageviews_complete to the talk page on your proposal <
https://en.wikipedia.org/w/index.php?title=User_talk:Du%C5%A1an_Krehe%C4%BE/...
.
Please let me know if it's still confusing.
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-leave@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/