Sorry, forgot to CC the list
On 10/02/13 23:36, Platonides wrote:
On 10/02/13 23:08, Robert Crowe wrote:
I'm seeing rows in the page table that have weird titles, and I'd like to be able to identify and filter them out, but I don't see properties that seem to identify them. For example:
page.page_id = 21441554 page.page_title = 4567797074e280934d6f726f63636f5f72656c6174696f6e73
What should I look for to identify pages like that?
Thanks,
Robert
Seems it's an artifact of the tool you are using to view the page table. It is showing you the title in hexadecimal, probably because you have it marked as binary. If you convert the hexadecimal, it translates to “Egypt–Morocco_relations”, which is the title of the page. The problem is in the presentation.
xmldatadumps-l@lists.wikimedia.org