Page Ids are still not coming in uniformly, but in this case should be
there enough to figure out what - is, maybe. That's a good idea.
On Saturday, January 23, 2016, Oliver Keyes <okeyes(a)wikimedia.org> wrote:
+1. Could we look at the pageIDs rather than titles?
Is that being
passed through uniformly yet?
On 23 January 2016 at 13:08, Toby Negrin <tnegrin(a)wikimedia.org
<javascript:;>> wrote:
Thanks Dan -- I'm just concerned that we
might be missing something (like
the central notice banners back in the day) with a fairly large
magnitude.
-Toby
On Sat, Jan 23, 2016 at 3:36 AM, Dan Andreescu <dandreescu(a)wikimedia.org
<javascript:;>>
wrote:
>
> Yes and no, it kind of depends whether we want to lose data. We've been
> talking about better ways to say "Unknown" but /wiki/Unknown is a page
too
> :) We're just not focusing on this level
of detail yet, bigger fish to
fry,
> caveat emptor, etc.
>
>
> On Saturday, January 23, 2016, Toby Negrin <tnegrin(a)wikimedia.org
<javascript:;>> wrote:
>>
>> Is that a bug in the ETL?
>>
>> On Friday, January 22, 2016, Oliver Keyes <okeyes(a)wikimedia.org
<javascript:;>> wrote:
>>>
>>> Actually - is Hadoop's "nothing was provided in this field!"
making it
>>> doubly confusing :/
>>>
>>> On 22 January 2016 at 22:06, Dan Garry <dgarry(a)wikimedia.org
<javascript:;>> wrote:
>>> > On 22 January 2016 at 15:17,
Ryan Kaldari <rkaldari(a)wikimedia.org
<javascript:;>>
>>> > wrote:
>>> >>
>>> >> Any idea why the most popular article in India is "-"?
>>> >
>>> >
>>> > That specific article often sees a lot of traffic. This is normally
>>> > caused
>>> > by a bot, spider, or other automaton. Unfortunately, by definition
no
> > method
> > of detecting automated traffic is perfect, so things like this often
> > slip
> > through.
> >
> > Dan
> >
> > --
> > Dan Garry
> > Lead Product Manager, Discovery
> > Wikimedia Foundation
> >
> > _______________________________________________
> > Analytics mailing list
> > Analytics(a)lists.wikimedia.org <javascript:;>
> >
https://lists.wikimedia.org/mailman/listinfo/analytics
> >
>
>
>
> --
> Oliver Keyes
> Count Logula
> Wikimedia Foundation
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org <javascript:;>
>
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org <javascript:;>
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org <javascript:;>
https://lists.wikimedia.org/mailman/listinfo/analytics
--
Oliver Keyes
Count Logula
Wikimedia Foundation
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org <javascript:;>
https://lists.wikimedia.org/mailman/listinfo/analytics