Does anyone have a working definition of characters allowed in the filenames that I could apply in my pre-processing of the xml files? See [1] and [2] for the technical standards that apply by default.
In my NYPL uploads I have found that characters in the chosen filename like: * ö (o umlaut) * Æ (upper case ash / ae ligature) caused GWT to halt the upload at that point (no warning back to me). These characters should be acceptable to the MediaWiki software.
These characters seem to be okay in the image page body, just not the filename. Other characters like é (e acute) appear to process fine. For the 18th century and earlier maps from the NYPL, this is a major time-sink. :-(
Links 1. https://commons.wikimedia.org/wiki/MediaWiki:Filename-prefix-blacklist 2. https://commons.wikimedia.org/wiki/MediaWiki:Titleblacklist
Fae