Brion Vibber wrote:
Could be a bug in Mono's XmlWriter implementation. (The dumps from MediaWiki are filtered and split into multiple streams by a program I wrote in C# to produce full, current-only, and current-non-talk- non-userpage dumps from one run.) I'll take a look.
Now filed as http://bugzilla.ximian.com/show_bug.cgi?id=76095
Will see about fixing...
Have submitted a patch. The next dump should be correct.
Have I mentioned how much I hate UTF-16 and how a 16-bit "char" type promotes the writing of naive code that doesn't take surrogate pairs into account?
-- brion vibber (brion @ pobox.com)