Hi all,

I am trying to read the dump from

https://dumps.wikimedia.your.org/enwiki/20200701/enwiki-20200701-pages-articles-multistream.xml.bz2

using a Java XMLStreamReader but it complains about the format. It looks like the file does not contain an XML header (<?xml version="1.0"?> or such) and after unpacking and prepending the header all seems fine.

Is there a good reason why headers are missing?

Cheers,

Alex