On the toolserver, the pagelinks table contains lots'o'links to a blank target: select * from pagelinks where pl_title="" limit 100 ; (tried up to limit 10000, found 10000 but sloooow)
Apparently, many links to empty page titles in namespace 0. This surfaced when one of my tools spat out blank results. It wasn't there a few weeks ago (two month at most). Tested for en and de wikipedia.
Is this a toolserver replication thing, or are these things in the live database as well? It's probably not a showstopper, but we might want to get rid of both links and bug anyway.
Magnus
How do links like that even parse?
Does [[]] parse? It shouldn't, and if a title is invalid (illegal chars), then no pagelinks entry should be added.
Magnus Manske-2 wrote:
On the toolserver, the pagelinks table contains lots'o'links to a blank target: select * from pagelinks where pl_title="" limit 100 ; (tried up to limit 10000, found 10000 but sloooow)
Apparently, many links to empty page titles in namespace 0. This surfaced when one of my tools spat out blank results. It wasn't there a few weeks ago (two month at most). Tested for en and de wikipedia.
Is this a toolserver replication thing, or are these things in the live database as well? It's probably not a showstopper, but we might want to get rid of both links and bug anyway.
Magnus
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org