Hi,
I would be interested to know how many pages in
enwiki-latest-pages-articles.xml . My own count gives 19,4 Mio. pages.
Can this be, at least roughly, confirmed?
In the internet I just find these numbers:
5,861,178 - I guess this are all namespace 0 pages
47,826,337 - this are all pages in all namespaces
Sigbert
--
https://hu.berlin/skhttps://hu.berlin/mmstat3
Hi,
I am trying to work with the Pagelinks file in order to extract the number of articles that link to a specific Wiki page (the same idea as can be seen in the "What-links-here" tool: https://en.wikipedia.org/wiki/Help:What_links_here).
However, when I loop over this SQL-like file, I find very weird cases of links that don't seem to exist in Wikipedia.
For example, the first line in the file indicates that there is a link between page ids 1939 and 2. PageID 2 doesn't even seem to exist.
Even when I look at pages that do exist, the link indicated in the file does not exist on the actual wiki page.
Am I missing anything?
Hi there,
I'm used to consolidate some wikidata knowledge, based on the file
latest-all.json.bz2
<https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.json.bz2>.
And when looking at the usual URL at
https://dumps.wikimedia.org/wikidatawiki/entities/,
I can see that the file is giving a very small size since February,
16th 2025 (39).
Is there an issue at your side that could help understanding why the
file is giving this small size,
or is the file no more available ?
Thanks for your answer !
Best,
Guillaume