datasets even with imperfect measurements is important.
Particularly for
longitudinal analysis.
Have in mind that maintaining these old dumps is not "free", it causes a
lot of confusion and maintenance costs to have several pageview definitions
around. We get a lot of questions about spiky-ness of old definition and we
need to maintain software that generates the old files thus, we think is
reasonable to ask our users to transition to the new definition and
eventually (in a period of months) turn off the old dumps.
On Thu, Dec 24, 2015 at 6:12 AM, Maurice Vergeer <m.vergeer(a)maw.ru.nl>
wrote:
Dear all,
As I just mentioned to Dan in a private email conversation, keeping
datasets even with imperfect measurements is important. Particularly for
longitudinal analysis.
Also, from what I understand - me being a newby here - is that the data
are stored in separate files. Dan suggested reordering the page into
categories. Maybe, another option is to create more extensive datasets with
more different measurements in a single datafile. On the other hand, the
files would become even bigger in size. Not an issue for mee, but for users
in the field accesibility (dowlnload bandwidth) could become an issue.
my two cents
Maurice
On Thu, Dec 24, 2015 at 2:58 PM, Alex Druk <alex.druk(a)gmail.com> wrote:
Nothing against this approach!
On Thu, Dec 24, 2015 at 2:55 PM, Dan Andreescu <dandreescu(a)wikimedia.org>
wrote:
On Thu, Dec 24, 2015 at 8:48 AM, Alex Druk <alex.druk(a)gmail.com> wrote:
Hi Dan,
Happy holidays!
Good idea to combine these datasets! However we have one more dataset
by Erik Zachte :
http://dumps.wikimedia.org/other/pagecounts-ez/
And that's an important one! But I was thinking we could re-organize
the page into categories. Erik's dataset could go into a "processed data"
category or something like that. The three I wanted to talk about on this
thread are just the raw data.
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
Thank you.
Alex Druk
alex.druk(a)gmail.com
(775) 237-8550 Google voice
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
________________________________________________
Maurice Vergeer
To contact me, see
http://mauricevergeer.nl/node/5
To see my publications, see
http://mauricevergeer.nl/node/1
________________________________________________
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics