Hello,
just a heads-up for anyone using HTML dumps, apart from the missing namespaces issue already mentioned on this list, there also seem to be entire pages missing, and some of the included page data is outdated and does not contain the latest changes. I have no idea how many pages are affected.
phabricator ticket with more details: https://phabricator.wikimedia.org/T305407
– Jan