Not that this is offtopic here, but you will find probably more
knowledgeable people and probably a quicker response at the specialized
list
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
On Mon, Sep 3, 2018 at 3:06 PM BinĂ¡ris <wikiposta(a)gmail.com> wrote:
Hi,
As far as I understand, pages in an XML dump are in the order of their
original creation.
This does not correspond to the page ID, because if a page gets a new id
after deletion and restore or renaming to that title or anything, the order
still remains the original.
But this sortkey itself is not stored. In other words, a dump is not sorted
by any key one could finf in the dump, and behaves as an unosorted
structure.
Is this true? Can I use any non-linear (e.g. binary) search in a dump?
--
BinĂ¡ris
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
--
Jaime Crespo
<http://wikimedia.org>