Page Ids are still not coming in uniformly, but in this case should be there enough to figure out what - is, maybe.  That's a good idea.

On Saturday, January 23, 2016, Oliver Keyes <okeyes@wikimedia.org> wrote:
+1. Could we look at the pageIDs rather than titles? Is that being
passed through uniformly yet?

On 23 January 2016 at 13:08, Toby Negrin <tnegrin@wikimedia.org> wrote:
> Thanks Dan -- I'm just concerned that we might be missing something (like
> the central notice banners back in the day) with a fairly large magnitude.
>
> -Toby
>
> On Sat, Jan 23, 2016 at 3:36 AM, Dan Andreescu <dandreescu@wikimedia.org>
> wrote:
>>
>> Yes and no, it kind of depends whether we want to lose data.  We've been
>> talking about better ways to say "Unknown" but /wiki/Unknown is a page too
>> :)  We're just not focusing on this level of detail yet, bigger fish to fry,
>> caveat emptor, etc.
>>
>>
>> On Saturday, January 23, 2016, Toby Negrin <tnegrin@wikimedia.org> wrote:
>>>
>>> Is that a bug in the ETL?
>>>
>>> On Friday, January 22, 2016, Oliver Keyes <okeyes@wikimedia.org> wrote:
>>>>
>>>> Actually - is Hadoop's "nothing was provided in this field!" making it
>>>> doubly confusing :/
>>>>
>>>> On 22 January 2016 at 22:06, Dan Garry <dgarry@wikimedia.org> wrote:
>>>> > On 22 January 2016 at 15:17, Ryan Kaldari <rkaldari@wikimedia.org>
>>>> > wrote:
>>>> >>
>>>> >> Any idea why the most popular article in India is "-"?
>>>> >
>>>> >
>>>> > That specific article often sees a lot of traffic. This is normally
>>>> > caused
>>>> > by a bot, spider, or other automaton. Unfortunately, by definition no
>>>> > method
>>>> > of detecting automated traffic is perfect, so things like this often
>>>> > slip
>>>> > through.
>>>> >
>>>> > Dan
>>>> >
>>>> > --
>>>> > Dan Garry
>>>> > Lead Product Manager, Discovery
>>>> > Wikimedia Foundation
>>>> >
>>>> > _______________________________________________
>>>> > Analytics mailing list
>>>> > Analytics@lists.wikimedia.org
>>>> > https://lists.wikimedia.org/mailman/listinfo/analytics
>>>> >
>>>>
>>>>
>>>>
>>>> --
>>>> Oliver Keyes
>>>> Count Logula
>>>> Wikimedia Foundation
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> Analytics@lists.wikimedia.org
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>
>
> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>



--
Oliver Keyes
Count Logula
Wikimedia Foundation

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics